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I. REAL PARTY IN INTEREST 

The real party in interest is Genentech, Inc., South San Francisco, California, by an 
assignment of the parent application, U.S. Patent Application Serial No. 09/665,350 recorded 
July 9, 2001, at Reel 01 1964 and Frame 0181. The present application is a continuation of U.S. 
Serial No. 09/665,350. 

II. RELATED APPEALS AND INTERFERENCES 

The claims pending in the current application are directed to antibodies to a polypeptide 
referred to herein as "PR0269." There exist two related patent applications, (1) U.S. Patent 
Application Serial No. 09/907,841, now Patent No. 7, 033, 825, issued 04-25-2006 (containing 
claims directed to nucleic acids encoding PR0269 polypeptides), and (2) U.S. Patent Application 
Serial No. 09/904,766, filed November 12, 2001 (containing claims directed to PR0269 
polypeptides). Related U.S. Patent Application Serial No. 09/904,766 application is also under 
final rejection by the same Examiner, and based upon the same outstanding rejections, is being 
appealed independently and concurrently herewith. Although there exist several applications 
directed to the "gene amplification" utility, in general, under Appeal, none of these are related to 
PR0269 molecules or antibodies binding to it. 

III. STATUS OF CLAIMS 

Claims 39-43 are in this application. 
Claims 1-38 and 44 have been canceled. 

Claims 39-43 stand rejected and Appellants appeal the rejection of these claims. 

IV. STATUS OF AMENDMENTS 

A summary of the prosecution history for this case is as follows: 

Previously, in response to a Final Office Action mailed on January 13, 2005, a Notice of 

Appeal was filed on June 13, 2005 and an Appeal Brief was filed on September 13, 2005 . An 

Examiner's Answer was mailed on November 15, 2005 which cited new references; hence, a 

Reply Brief was filed January 17, 2006 with a Petition for Designation of New Grounds of 

Rejection and with a request to withdraw finality of the rejection under 37 C.F.R. §1.181 . The 

Decision on the Petition granting the Appellants' request to have the finality withdrawn was 
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mailed on January 30, 2006. Therefore, a Supplemental Response with additional references and 
affidavits supporting Appellants' arguments was filed March 30, 2006. In response to a Final 
Office Action dated April 21, 2006, an RCE response was filed on August 21, 2006 with further 
references and affidavits supporting Appellants' arguments. Two Final Office Actions were 
mailed on September 27, 2006 and October 13, 2006, and a Response and a Notice of Appeal 
was filed March 13, 2007. An Advisory Action was mailed April 2, 2007. 

No claim amendments have been submitted after the Response mailed March 13, 2007. 

A copy of the rejected claims in the present Appeal is provided in Section VIII. 

V. SUMMARY OF CLAIMED SUBJECT MATTER 

The invention claimed in the present application is related to an isolated antibody that 
specifically binds to the polypeptide of SEQ ID NO: 96 (Claim 39). The invention is further 
directed to an antibody that specifically binds to the polypeptide of SEQ ID NO: 96 which is a 
monoclonal antibody, a humanized antibody or an antibody fragment. (Claims 40, 41 and 42). 
The invention is further directed to an isolated antibody that specifically binds to the polypeptide 
of SEQ ID NO: 96 which is labeled. (Claim 43) 

The full-length PR0269 polypeptide having the amino acid sequence of SEQ ID NO:96 
is described in the specification at, for example, page 12, line 30 to page 13, line 1, page 40, lines 
1-11, page 103, lines 4-12, in Figure 36 and in SEQ ID NO:96. PR0269 is described as a novel 
polypeptide having a signal peptide sequence and a transmembrane domain (see, for example, 
Example 15 and Figure 36). As shown in Example 92 and Table 9 of the specification, PR0269 
showed approximately 2-3.5 fold amplification in 8 primary lung tumors and tumor cell lines, 
(see Table 9). The cDNA nucleic acid encoding PR0269 is described in the specification at, for 
example, Example 15, in Figure 35 and in SEQ ID NO:95. Page 60, lines 18-22 of the 
specification provides the description for Figures 35 and 36. Examples 53-56 describe the 
expression of PRO polypeptides in various host cells, including E. coli, mammalian cells, yeast 
and Baculo virus-infected insect cells. The preparation of antibodies that bind PRO polypeptides 
is set forth in the specification at pages 139-147 and Example 57. 
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VI. GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

1 . Whether Claims 39-43 satisfy the utility/ enablement requirement under 35 
U.S.C. §§101/112, first paragraph. 

VII. ARGUMENTS 
Summary of the Arguments: 
Issue 1: Utility/ Enablement 

As a preliminary matter, Appellants note that the Examiner refers to the microarray assay 
in several instances in several Office Actions, and cites references like Lilley et aL, King et al. 9 
Wildsmith et aL, Nagaraja et aL, Waghray et al. 9 Sagynaliev et aL, Chen et al etc. which use and 
analyze the microarray assay. On the other hand, Appellants rely upon the gene amplification 
data of the PR0269 gene for patentable utility of the antibodies to PR0269 polypeptides in the 
present application, and would like to distinguish between the gene amplification and the 
microarray assay. The gene amplification assay measures the level at which a certain gene (i.e. 
DNA) is amplified in the genome, whereas the microarray assay measures the level of expression 
of a mRNA encoding for a certain polypeptide in a sample. Throughout prosecution, the 
Examiner fails to distinguish between these two techniques, but Appellants submit that the two 
assays, although similar, are not the same. Appellants would like to bring to the Examiner's 
attention a recent decision in a microarray case by the Board of Patent Appeals and Interferences 
(Decision on Appeal No. 2006-1469). In its decision, the Board reversed the utility rejection, 
acknowledging that "there is a strong correlation between mRNA levels and protein expression, 
and the Examiner has not presented any evidence specific to the PR01866 polypeptide to refute 
that." (Page 9). Appellants submit that, in the instant application, the Examiner has likewise not 
presented any evidence specific to the PR0269 polypeptide to refute Applicant's assertion of a 
correlation between DNA levels, mRNA levels and protein expression. Appellants add that they 
analyze the microarray assay only in response to the Examiner's cited references. 

Appellants rely upon the gene amplification data of the PR0269 gene for patentable 
utility of the PR0269 polypeptides and their antibodies. This data is clearly disclosed in the 
instant specification in Example 92 which discloses that the gene encoding PR0269 showed 
significant amplification, ranging from 2.00-3.5-fold amplification in eight primar y lung tumors 
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or cell lines . Appellants have submitted, in their Response filed March 31, 2003, a Declaration 
by Dr. Audrey Goddard, which explains that a gene identified as being amplified at least 2-fold 
by the disclosed gene amplification assay in a tumor sample relative to a normal sample is useful 
as a marker for the diagnosis of cancer , and for monitoring cancer development and/or for 
measuring the efficacy of cancer therapy. Therefore, such a gene is useful as a marker for the 
diagnosis of lung cancer , and for monitoring cancer development and/or for measuring the 
efficacy of cancer therapy. 

Appellants have also submitted, in their Responses filed November 3, 2004, and August 
21, 2006, ample evidence to show that, in general, if a gene is amplified in cancer, it is more 
likely than not that the encoded protein will be expressed at an elevated level. For instance, the 
articles by Orntoft et aL 9 Hyman et al. 9 and Pollack et al collectively teach that in general gene 
amplification increases mRNA expression . Second, the Declarations of Dr. Paul Polakis: 
(Polakis I and II- made of record in Appellants* Responses filed October 3, 2004 and March 30, 
2006, respectively), shows that, in general there is a correlation between mRNA levels and 
polypeptide levels . Third, Appellants further submit that even if there were no correlation 
between gene amplification and increased mRNA/protein expression, (which Appellants 
expressly do not concede to), a polypeptide encoded by a gene that is amplified in cancer would 
still have a specific, substantial, and credible utility. Appellants submit that, as evidenced by the 
Ashkenazi Declaration (made of record in Appellants' Response filed October 16, 2003) and the 
teachings of Hanna and Mornin (made of record in Appellants' Response filed May 21, 2004), 
simultaneous testing of gene amplification and gene product over-expression enables more 
accurate tumor classification , even if the gene-product, the protein, is not over-expressed. This 
leads to better determination of a suitable therapy for the tumor, as demonstrated by a real-world 
example of the breast cancer marker HER-2/neu. 

Appellants further note that the sale of gene expression chips to measure mRNA levels is 
a highly successful business, with a company such as Affymetrix recording 168.3 million dollars 
in sales of their GeneChip arrays in 2004. Clearly, the research community believes that the 
information obtained from these chips is useful (i.e., that it is more likely than not informative of 
the protein level). Therefore, as a general rule, one skilled in the art would find it more likely 
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than not that PR0269 and antibodies binding to the PR0269 polypeptides are useful as a 
diagnostic tools for detecting lung tumors. 

Appellants submit that the teachings of the Examiner's cited references do not 
conclusively establish a prima facie case for lack of utility (as will be discussed in detail below). 
In particular, contrary to the Examiner's interpretation, the data of Haynes et al. (see Figure 1) 
and Chen et al (see Tables I and II of the paper) suggest that a positive correlation does exist 
between gene and protein expression. In addition, the teachings of Hu et ai, Lian et al. and 
Fessler et al., Greenbaum et al. do not show a lack of correlation between mRNA and protein 
expression for genes in general . In fact, these cited references make clear references to various 
limitations in their studies and to their conclusions that were drawn by excluding certain data 
points. Appellants respectfully submit that such conclusions cannot be used to validate the 
Examiner's conclusions regarding the correlation between gene and mRNA/ protein expression 
in general . Since the Examiner has not cited evidence that clearly addresses gene and mRNA/ 
protein expression in general a prima facie case for lack of utility has not been made. Taken 
together, although there are some examples in the scientific art that do not fit within the central 
dogma of molecular biology that there is generally a positive correlation between DNA, mRNA, 
and polypeptide levels, in general, in the majority of amplified genes , as exemplified by the 
teachings of Orntoft et al, Hyman et al, Pollack et al. f the two Polakis Declarations, the art 
overwhelmingly show that gene amplification influences gene expression at the mRNA and 
protein levels . The widespread, art accepted use of information obtained from array chips for 
detecting diagnostic markers lend further support that in general, one of skill in the art would 
reasonably expect in this instance, based on the amplification data for the PR0269 gene, that the 
PR0269 polypeptide is concomitantly overexpressed and has utility in the diagnosis of lung 
cancer or for individuals at risk for developing lung cancer. 

Accordingly, Appellants submit that when the proper legal standard is applied, one 
should reach the conclusion that the present application discloses at least one patentable utility 
for the claimed PR0269 polypeptides and its antibodies thereof. Accordingly, one of ordinary 
skill in the art would also understand how to make and use the recited antibodies for the 
diagnosis of lung cancer without any undue experimentation. 

These arguments are all discussed in further detail below under the appropriate headings. 
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Response to Rejections 

ISSUE 1. Claims 39-43 are Supported bv a Credible, Specific and Substantial Asserted 
Utility, and Thus Meet the Utility Requirement of 35 U.S.C. §101 and the "How to Use 
Prong" of the Enablement Requirement of 35 U.S.C. SI 12, First Paragraph 

The sole basis for the Examiner's rejection of Claims 39-43 under these sections is that 
the data presented in Example 92 of the present specification is allegedly insufficient under 
applicable legal standards to establish a patentable utility under 35 U.S.C. § 1 01 for the presently 
claimed subject matter, and further, since a patentable utility has not been established, one would 
not know how to use the claimed invention. 

Appellants strongly disagree and respectfully traverse the rejection. 

A. The Legal Standard For Utility Under 35 U.S.C. §101 

According to 35 U.S.C. §101: 

Whoever invents or discovers any new and useful process, machine, manufacture, 
or composition of matter, or any new and useful improvement thereof, may obtain 
a patent therefor, subject to the conditions and requirements of this title. 
(Emphasis added). 

In interpreting the utility requirement, in Brenner v. Manson, 1 the Supreme Court held 
that the quid pro quo contemplated by the U.S. Constitution between the public interest and the 
interest of the inventors required that a patent Applicant disclose a "substantial utility" for his or 

2 

her invention, i.e., a utility "where specific benefit exists in currently available form." The 
Court concluded that "a patent is not a hunting license. It is not a reward for the search, but 
compensation for its successful conclusion. A patent system must be related to the world of 

3 

commerce rather than the realm of philosophy." 

Later, in Nelson v. Bowler* the C.C.P.A. acknowledged that tests evidencing 
pharmacological activity of a compound may establish practical utility, even though they may 



Brenner v. Manson, 383 U.S. 519, 148 U.S.P.Q. (BNA) 689 (1966). 

2 Id. at 534, 148 U.S.P.Q. (BNA) at 695. 

3 Id. at 536, 148 U.S.P.Q. (BNA) at 696. 

4 Nelson v. Bowler, 626 F.2d 853, 206 U.S.P.Q. (BNA) 881 (C.C.P.A. 1980). 
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not establish a specific therapeutic use. The Court held that "since it is crucial to provide 
researchers with an incentive to disclose pharmaceutical activities in as many compounds as 
possible, we conclude adequate proof of any such activity constitutes a showing of practical 

utility." 5 

In Cross v. Iizuka^ the C.A.F.C. reaffirmed Nelson, and added that in vitro results might 
be sufficient to support practical utility, explaining that "in vitro testing, in general, is relatively 
less complex, less time consuming, and less expensive than in vivo testing. Moreover, in vitro 
results with the particular pharmacological activity are generally predictive of in vivo test results, 
i.e., there is a reasonable correlation there between." 7 The Court perceived, "No insurmountable 
difficulty" in finding that, under appropriate circumstances, "in vitro testing, may establish a 

8 

practical utility." 

The case law has also clearly established that Appellants 1 statements of utility are usually 

9 

sufficient, unless such statement of utility is unbelievable on its face. The PTO has the initial 

10 

burden to prove that Appellants' claims of usefulness are not believable on their face. In 
general, an Appellant's assertion of utility creates a presumption of utility that will be sufficient 
to satisfy the utility requirement of 35 U.S.C. §101, "unless there is a reason for one skilled in 

11 12 

the art to question the objective truth of the statement of utility or its scope." ' 



Id. at 856, 206 U.S.P.Q. (BNA) at 883. 

6 Cross v. lizuka, 753 F.2d 1047, 224 U.S.P.Q. (BNA) 739 (Fed. Cir. 1985). 

7 Id. at 1050, 224 U.S.P.Q. (BNA) at 747. 



9 In re Gazave, 379 F.2d 973, 154 U.S.P.Q. (BNA) 92 (C.C.P.A. 1967). 

10 

Ibid 

11 In reLanger, 503 F.2d 1380,1391, 183 U.S.P.Q. (BNA) 288, 297 (C.C.P.A. 1974). 

12 See also In re Mies, 628 F.2d 1322, 206 USPQ 885 (C.C.P.A. 1980); In re Irons, 340 F.2d 974, 144 
USPQ351 (1965); In re Sichert, 566F.2dll54, 1159, 196 USPQ 209, 212-13 (C.C.P.A. 1977). 
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13 

Compliance with 35 U.S.C. §101 is a question of fact. The evidentiary standard to be 
used throughout ex parte examination in setting forth a rejection is a preponderance of the 

14 

totality of the evidence under consideration. Thus, to overcome the presumption of truth that 
an assertion of utility by the Appellant enjoys, the Examiner must establish that it is more likely 
than not that one of ordinary skill in the art would doubt the truth of the statement of utility. 
Only after the Examiner made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the Appellant. The issue will then be decided on the totality of evidence. 

The well established case law is clearly reflected in the Utility Examination Guidelines 
("Utility Guidelines"), 15 which acknowledge that an invention complies with the utility 
requirement of 35 U.S.C. §101, if it has at least one asserted "specific, substantial, and credible 
utility" or a "well-established utility." Under the Utility Guidelines, a utility is "specific" when 
it is particular to the subject matter claimed. For example, it is generally not enough to state that 
a nucleic acid is useful as a diagnostic without also identifying the conditions that are to be 
diagnosed. 

In explaining the "substantial utility" standard, M.P.E.P. §2107.01 cautions, however, 
that Office personnel must be careful not to interpret the phrase "immediate benefit to the 
public" or similar formulations used in certain court decisions to mean that products or services 
based on the claimed invention must be "currently available" to the public in order to satisfy the 
utility requirement. "Rather, any reasonable. use that an applicant has identified for the invention 
that can be viewed as providing a public benefit should be accepted as sufficient, at least with 
regard to defining a 'substantial utility.'" 16 Indeed, the Guidelines for Examination of 

17 

Applications for Compliance With the Utility Requirement, gives the following instruction to 

13 Raytheon v. Roper, 724 F.2d 951, 956, 220 U.S.P.Q. (BNA) 592, 596 (Fed. Cir. 1983) cert denied, 469 
US 835 (1984). 

H In re Oetiker, 977 F.2d 1443, 1445, 24 U.S.P.Q.2d (BNA) 1443, 1444 (Fed. Cir. 1992). 

15 66 Fed. Reg. 1092(2001). 

16 M.P.E.P. §2107.01. 

17 M.P.E.P. §2107 11(B)(1). 
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patent examiners: "If the Applicant has asserted that the claimed invention is useful for any 
particular practical purpose . . . and the assertion would be considered credible by a person of 
ordinary skill in the art, do not impose a rejection based on lack of utility." 

B. Proper Application of the Legal Standard 

Appellants submit that the evidentiary standard to be used throughout ex parte 
examination of a patent application is a preponderance of the totality of the evidence under 
consideration. Thus, to overcome the presumption of truth that an assertion of utility by the 
Appellant enjoys, the Examiner must establish that it is more likely than not that one of ordinary 
skill in the art would doubt the truth of the statement of utility. Only after the Examiner has 
made a proper prima facie showing of lack of utility, does the burden of rebuttal shift to the 
Appellant. 

Appellants respectfully submit that the data presented in Example 92 of the specification 
and the cumulative evidence of record support a "specific, substantial and credible" asserted 
utility for the presently claimed invention. 

Patentable utility for the PR0269 polypeptides and their antibodies is based upon the 
gene amplification data for the gene encoding the PR0269 polypeptide of SEQ ID NO: 96. 
Example 92 describes the results obtained using a very well-known and routinely employed 
polymerase chain reaction (PCR)-based assay, the TaqMan™ PCR assay, also referred to herein 
as the gene amplification assay. This assay allows one to quantitatively measure the level of 
gene amplification in a given sample, say, a tumor extract, or a cell line. It was well known in 
the art at the time the invention was made that gene amplification is an essential mechanism for 
oncogene activation. Appellants isolated genomic DNA from a variety of primary cancers and 
cancer cell lines that are listed in Table 9, including primary lung cancers of the type and stage 
indicated in Table 8 of the specification. The tumor samples were tested in triplicates with 
Taqman™ primers and with internal controls, beta-actin and GADPH in order to quantitatively 
compare DNA levels between samples. As a negative control, DNA was isolated from the cells 
often normal healthy individuals, which was pooled and used as a control and also, no-template 
controls (pages 222-234). The results of TaqMan™ PCR are reported in ACt units, as explained 
in the passage on pages 222-223. One unit corresponds to one PCR cycle or approximately a 2- 
fold amplification, relative to control, two units correspond to 4-fold, 3 units to 8-fold 
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amplification and so on. Using this PCR-based assay, Appellants showed that the gene encoding 

for PR0269 was amplified, that is, it showed approximately 1.04-1.60 ACt units which 

corresponds to 2.00-3.5-fold amplification in eight primary lung tumors or cell lines . 

Appellants submitted a Declaration by Dr. Audrey Goddard which provides a statement 

by an expert in the relevant art that "fold amplification" values of at least 2-fold are considered 

significant in the TaqMan™ PCR gene amplification assay. Appellants particularly draw the 

Board's attention to page 3 of the Goddard Declaration which clearly states that: 

It is further my considered scientific opinion that an at least 2-fold increase in 
gene copy number in a tumor tissue sample relative to a normal (i.e., non-tumor) 
sample is significant and useful in that the detected increase in gene copy number 
in the tumor sample relative to the normal sample serves as a basis for using 
relative gene copy number as quantitated by the TaqMan PCR technique as a 
diagnostic marker for the presence or absence of tumor in a tissue sample of 
unknown pathology. Accordingly, a gene identified as being amplified at least 2- 
fold by the quantitative TaqMan PCR assay in a tumor sample relative to a normal 
sample is useful as a marker for the diagnosis of cancer, for monitoring cancer 
development and/or for measuring the efficacy of cancer therapy. (Emphasis 
added). 

Accordingly, the 2.00-3.5-fold amplification in eight primary lung tumors or cell lines 
would be considered significant and credible by one skilled in the art, based upon the facts 
disclosed in the Goddard Declaration. 

Further, as discussed in detail below, Appellants have provided ample evidence in the 
form of articles from the art, like Orntoft et al. 9 Hyman et al. 9 Pollack et al, and over a 100 
references (see Evidence List items 14-145) and Declarations by experts in the field of oncology 
and gene expression, i.e.: the Declarations by Dr. Paul Polakis (I and II) and by Dr. Avi 
Ashkenazi , to show that, in general, if a gene is amplified in cancer, it is "more likely than not" 
that the encoded protein will also be expressed at an elevated level. 

C. A prima facie case of lack of utility has not been established 

As discussed above, the increase in DNA copy number for the PR0269 gene is 
significant. Further, the evidentiary standard to be used throughout ex parte examination of a 
patent application is a preponderance of the totality of the evidence under consideration. Thus, 
to overcome the presumption of truth that an assertion of utility by the applicant enjoys, the 
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Examiner must establish that it is more likely than not that one of ordinary skill in the art would 
doubt the truth of the statement of utility. 

Accordingly, it is not a legal requirement to establish a necessary correlation between an 
increase in the copy number of the DNA and protein expression levels that would correlate to the 
disease state or that it is imperative to find evidence that DNA amplification is " necessarily " or 
"always" associated with overexpression of the gene product. Appellants respectfully submit 
that when the proper evidentiary standard is applied, a correlation must be acknowledged. Only 
after the Examiner has made a proper prima facie showing of lack of utility, does the burden of 
rebuttal shift to the Applicant. 

Previously, the Examiner has indicated based on references Pennica et al, Konopka et 
al and Haynes et al to show that gene amplification data cannot reliably predict protein levels. 
Appellants have argued the references in great detail throughout prosecution, and these 
arguments are incorporated by reference herein for brevity. Appellants summarize the rejections 
and the arguments submitted below. 

The teachings of Pennica et al are specific to WISP genes, a specific class of closely 
related molecules. Pennica et al showed that there was good correlation between DNA and 
mRNA expression levels for the WISP A gene but not for WISP-2 and WISP-3 genes. WISPs 1- 
3 have no structural relationship to the PR0269 polypeptides of the present application. The 
apparent finding that for two out of three specific molecules, that are related to each other but 
have no relationship to PR0269, there was no correlation between gene amplification and the 
level of mRNA/protein expression does not establish, in general, that it is more likely than not 
that such correlation does not exist, and has no bearing whatsoever on determining the question 
whether such correlation is likely to exist between PR0269 gene amplification and 
mRNA/protein expression levels. As discussed above, the standard is not absolute certainty . 
Pennica et al has no teaching whatsoever about the correlation of gene amplification and protein 
expression for genes in general or PR0269 or related molecules in particular. 

Similarly, in Konopka et al, the Examiner has generalized a very specific result disclosed 
by Konopka et al to cover all genes. Konopka et al actually state that "[p]rotein expression is 
not related to amplification of the abl gene but to variation in the level of bcr-ab\ mRNA 
produced from a single Ph 1 template." (See Konopka et al ., Abstract, emphasis added). The 
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paper does not teach anything whatsoever about the correlation of protein expression and gene 
amplification in general and provides no basis for the generalization that apparently underlies 
the present rejection. The statement of Konopka et al that "[p]rotein expression is not related to 
amplification of the abl gene ..." is not sufficient to establish a prima facie case of lack of 
utility. Therefore, the combined teachings of Pennica et al and Konopka et al are not directed 
towards genes in general but to a single gene or genes within a single family and thus, their 
teachings cannot support a general conclusion regarding correlation between gene amplification 
and mRNA or protein levels. In addition, the abl gene has no structural relationship to the 
PR0269 gene of the present application and thus Konopka et al provides no information of 
specific relevance to the question whether for PR0269 there is a reasonable expectation that 
correlation between gene amplification and mRNA/protein expression levels is likely to exist. 

The Examiner also cited Haynes et al to show that transcript levels and protein levels do 
not correlate. However, Appellants had shown that Haynes themselves admit that " there was a 
general trend, although no strong correlation between protein [expression] and transcript levels" 
(see Figure 1 and page 1863, paragraph 2.1, last line). Therefore, when the proper legal standard 
is used, Haynes clearly supports the Appellants' position that in general, a positive correlation 
exists between mRNA and protein expression levels . Since accurate prediction is not the 
standard , a prima facie case of lack of utility has not been met based on the cited references 
Pennica et al, Konopka et al and Haynes et al Appellants respectfully submit that, contrary to 
the Examiner's assertion, none of the cited reference conclusively establish a prima facie case 
for lack of utility for the PR0269 molecule. 

Appellants have already discussed references Hu et al, et al, Chen et al, Lian et al, 
Fessler et al, Greenbaum et al in great detail in their previous responses (see Appeal brief filed 
September 13, 2005 and also the Reply brief filed January 17, 2006), and these arguments are 
hereby incorporated by reference for brevity. 

Briefly, the analytical methods utilized by Hu et al have certain statistical drawbacks, as 
the authors themselves admit, and Hu et al 's conclusions only apply to a specific type of breast 
tumor (estrogen receptor (ER)-positive breast tumor) and cannot be generalized to breast cancer 
genes in general, let alone to cancer genes in general. 
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Regarding Lian et al, Appellants respectfully submit that Lian et al only teach that 
protein expression may not correlate mRNA level in differentiating myeloid cells, and not about 
genes in general . In fact, the authors themselves admit that there were a number of problems 
with their data . For instance, at page 520 of this article, the authors explicitly express their 
concerns regarding the methods they utilized and the interpretation of their data stating that 
" [f|hese data must be considered with several caveats: membrane and other hydrophobic proteins 
and very basic proteins are not well displayed by the standard 2DE approach, and proteins 
presented at low level will be missed. In addition, to simplify MS analysis, we used a Coomassie 
dye stain rather than silver to visualize proteins, and this decreased the sensitivity of detection of 
minor proteins. " (Emphasis added). Besides, Lian et al 's conclusions are based on the 
Coomassie dye staining method, which is not a very sensitive method of measuring protein. 

Similarly, in Fessler et al, examined lipopoysaccharide-activated neutrophilins, in 
response to LPS stimulation. Fessler et al also used the Coomassie Blue dye staining method, 
and concede that it is known to have a limited protein binding range and a non-linear curve for 
protein detection. Protein identification in their study was also done using two-dimensional 
PAGE, which is, by their own admission, limited only to well-resolved regions of the gel , and 
therefore, may have performed less well with hydrophobic and high molecular weight proteins 
(see page 31301, col. 1). 

In addition, Chen et al also concede that there are problems with 2D gel protein 
detection and therefore, cannot accurately predict protein levels . For instance, Chen et al says 
that, "(i)t is apparent that without prior enrichment only a relatively small and highly selected 
population of long-lived, highly expressed proteins is observed. There are many more proteins in 
a given cell which are not visualized by such methods. Frequently it is the low abundance 
proteins that execute key regulatory functions " (page 1870, col. 1, emphasis added). Thus, Chen 
et al, concede that by selecting proteins visualized by 2D gels, they are likely to have excluded 
in their analysis many key regulatory proteins which could be candidate cancer markers. 

Further, the microarray analysis provided by Chen et al in fact support the Appellants 
general proposition that, even if protein levels cannot be accurately predicted (which is not 
required by the utility standard), in the majority of the proteins studied, it is most likely than not 
that an increase in gene amplification or mRNA levels generally correlates well with increased 
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protein levels. In Table 1, which lists 66 genes [the paper incorrectly states there are 69 genes 
listed] for which only one protein isoform is expressed, Chen et al show that 40 genes out of 66 
had a positive correlation between mRNA expression and protein expression , which clearly 
meets the standard for "more likely than not". Similarly, in Table II , 22 genes out of 30 [again 
the paper incorrectly states there are 29] and at least one isoform showed a positive correlation 
between mRNA expression and protein expression. Furthermore, 12 genes out of 29 showed a 
strong positive correlation [as determined by the authors] for at least one isoform. Therefore, the 
authors of the Chen paper clearly agreed that microarrays provided a reliable measure of the 
expression levels of the gene and could be used to identify genes whose overexpression is 
associated with tumors. 

Appellants note that Greenbaum is compares the expression of a number of different 
mRNAs and their corresponding proteins in yeast cells and not in human cells or in cancer cells 
versus normal cells. Greenbaum states that logically "we would assume that those ORFs that 
show a large degree of variation in their expression are controlled at the transcriptional level. 
The variability of the mRNA expression is indicative of the cell controlling the mRNA 
expression at different points of the cell cycle to achieve the resulting and desired protein. Thus, 
we would expect and we found a high degree of correlation (r-0.89) between the reference 
mRNA and protein levels for these particular ORFs: the cell has already put significant 
energy into dictating the final level of protein through tightly controlling the mRNA 
expression, (page 117.5 1 st column). Furthermore, Greenbaum states: "we found that ORFs 
that have higher than average levels of ribosomal occupancy - that is that a large percentage of 
their cellular mRNA concentration is associated with ribosomes (being translated) - have well 
correlated mRNA and protein expression levels. (Figure 2)." Therefore, contrary to the 
Examiner's assertion, Greenbaum does find high levels of correlation between mRNA and 
protein expression in yeast cells. 

In summary, Hu et al, Lian et al Chen et al, and Fessler et al Greenbaum et al, do not 
conclusively teach that, in general, protein levels cannot be accurately predicted from mRNA/ 
gene amplification levels. These authors concede that either due to insensitive protein detection 
methods or due their methodologies utilized in their protocols, some protein species may have 
been underrepresented over others. Therefore, the teachings of these references cannot be relied 
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upon to establish a prima facie showing of lack of utility. On the other hand, as noted even in 
Haynes et al and Chen et al, most genes showed a positive correlation between increased gene 
amplification, mRNA and translated protein. 

Alberts and Lewin et al (Items 15 and 16 of the Evidence List) 

The Examiner acknowledges that " the teachings of Alberts and Lewin (which) disclose 
that initiation of transcription is a common point for a cell to regulate the gene expression" but 
asserts that "it is not the only means of regulating gene expression". (Final Office Action of 
September 27, 2006). 

Appellants respectfully disagree and submit that the utility standard is not absolute 
certainty. Therefore, Appellants do not need to establish that the production of RNA is 
inevitably equated with production of protein in order to meet the utility standard. Instead, as 
long as it is more likely than not that a change of the transcription level of a gene gives rise to a 
change in translation level of a gene, the utility standard is met. 

Meric et al. (Item 92 of the Evidence List) 

The Examiner asserts that Meric teaches that "gene expression is quite complicated and is 
also regulated at the level of mRNA stability, mRNA translation, and protein stability" (Final 
Office Action of September 27, 2006). 

Appellants respectfully submit that Meric simply summarizes the translation regulation of 

cancer cells. Meric indicates that translation initiation is regulated in response to nutrient 

availability and mitogenic stimulation and is coupled with cell cycle progression and cell growth. 

Meric further discusses that alteration in translation control occur in cancer. For example, 

variant mRNA sequences can alter the translational efficiency of individual mRNA molecule. 

(See Abstract). Meric further teaches that the changes of the translational efficiency of a mRNA 

transcript depend on the mutation of a specific mRNA sequence. (Page 973, column 2 to 

page 91556, column 1). Meric never suggest that the translation of a cancer gene is suppressed 

in cancer in general and therefore, an increased mRNA levels will not yield an increased protein 

levels. To the contrary, Meric teaches that the translation efficiency of a number of cancer genes 

is enhanced in cancer cells compared to its normal counterpart. For instance, in patient with 

multiple myeloma, a C-T mutation in the c-myc IRES was identified and found to cause an 

enhanced initiation of translation. (Page 91556, column 1). Therefore, the level of proteins 
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encoded by these genes increases in cancer cells at an even higher magnitude than the mRNA 
level. 

Appellants emphasize that it is not a legal requirement to establish an absolute correlation 
between an increase in the mRNA level and protein expression levels that would correlate to the 
disease state nor is it imperative to find evidence that protein levels can be accurately predicted. 
Therefore, the Examiner has misinterpreted the teaching of Meric and applied improperly high 
evidentiary standard. Therefore, the Examiner has failed to establish a prima facie showing of 
lack of utility. 

Godbout et al (Item 46 of Evidence List) 

The Examiner asserts that of the references cited by Appellants in their IDS filed August 
21, 2006, only "Godbout is drawn to the predictability of protein levels based upon genomic 
DNA amplification" (Final Office Action of October 13, 2006 and page 10 of Advisory Action 
of April 2, 2007). 

Appellants respectfully submit that, as discussed in the RCE response of August 21, 
2006, there are several other references; for instance, Bea et al. investigated gene amplification, 
mRNA expression, and protein expression of the putative oncogene BMI-1 in human lymphoma 
samples, and supports Appellants' assertion that gene amplification is correlated with both 
increased mRNA and protein expression. 

The Examiner further asserts that unlike Godbout et al, the instant specification does not 
teach structure/ function analysis and the Examiner questions whether the level of genomic 
amplification of DDX1 gene is comparable to the disclosed PR0269. 

Appellants respectfully submit that it was never claimed that PR0269 is similar in any 
way to the DDX1 gene of Godbout et al \ in fact the Godbout reference was submitted to show 
good correlation between protein levels based upon genomic DNA amplification, which the 
Examiner clearly agrees with. Structure/ function data, which the Examiner requests, is not a 
requirement for the utility requirement, hence this rejection is improper. 

Lillev et al* King et al and Wildsmith et ah 

The Examiner cites Lilley et al to show that "the extrapolation that changes in transcript 

level will also result in corresponding changes in protein amount or activity cannot always be 

made." (Final Office Action of September 27, 2006). 
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Appellants repeat that it is not a legal requirement for utility, to establish a necessary 
correlation between an increase in the mRNA level and protein expression levels, or to show that 
changes in transcript level should always result in corresponding changes in protein amount or 
activity. Accordingly, the question is not whether a correlation between an increase in mRNA 
and protein expression levels always exists, rather if it is more likely than not that a person of 
ordinary skill in the pertinent art would recognize such a positive correlation. Nowhere in the 
Lilley paper does the author suggest that it is more likely than not that altered mRNA levels 
does not correlate with altered protein levels. On the contrary, the statement that "changes in 
transcript level will also result in corresponding changes in protein amount or activity cannot 
always be made" implies that the mRNA/protein correlation exists in most cases. 

The Examiner cites King et al to show that "mRNA levels do not necessarily correlates 
with protein levels." (Final Office Action of September 27, 2006). 

First of all, Appellants note that the instant application is not directed to the microarray 
assay as in King et al . As discussed above, the utility standard is not absolute certainty. 
Therefore, Appellants should not be required to show a "necessary correlation" between mRNA 
and protein levels in order to establish a patentable utility. King never indicates that it is more 
likely than not that a general correlation between the mRNA and protein levels for a gene does 
not exist, this paper alone does not suffice to establish a prima facie showing of lack of utility. 

Appellants also note that the author discussed numerous advantages of the microarray 
technology, which offers tremendous advantages in the study of human diseases. For instance, 
on page 2287, the author states that " microarrays can be expected to prove extremely valuable 
as tools for the study of the generic basis of complex diseases. The ability to measure expression 

profiles across entire genomes provides a level of information not previously attainable 

Microarrays make it possible to investigate differential gene expression in normal vs. diseased 
tissue, in treated vs. non-treated tissue, and in different stages during the natural course of the 
disease, all on a genomic scale. Gene expression profiles may help to unlock the molecular basis 

of phenotype, response to treatment, and heterogeneity of disease " Therefore, if anything, 

the King reference supports the use of the microarray in the diagnosis of human diseases, which 
silently assumes that, most probably, increases in mRNA levels correlate well with increases in 
protein levels which in turn impacts disease. 
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Similarly, Wildsmith never indicated that it is more likely than not that a general 
correlation between the mRNA and protein levels for a gene does not exist. Therefore, this paper 
is not sufficient to establish a prima facie showing of lack of utility. In fact, the Wildsmith paper 
discusses examples of a number of successes of microarray applications in the detection of 
human diseases (see Page 284). For instance, the author has pointed out that "one area of rapid 
progress using microarray technology is the increased understanding of cancer. Molecular 
pathologies are subgrouping cancers of tissues such as blood, skin, and breast, based on 
differential gene expression patterns. For example, within a small group of breast cancer tissue 
samples, Perou et al. distinguished two broad subgroups representing those expressing or 
alternatively lacking expression of the oestrogen receptor- y- gene. The work was not 
conclusive, but never has progress in this field been so rapid when compared with the previous 
methods of gene amplification. Another example of the impact of this technology is in the 
identification of two biomarkers for prostate cancer, namely hepsin and PIM1 (Dhanasekaran et 
al, 2001). Microarray technology has also accelerated the understanding of the molecular events 
surrounding pulmonary fibrosis. Specially, two distinct clusters of genes associated with 
imflammation and fibrosis have been identified in a disease where, for years, the pathogenesis 
and treatment have remained unknown (Katsuma et al 9 2001)." 

Therefore, collectively, the references Lilley et al, Wildsmith et al, and King et al 
references cited by the Examiner show that the art indicates that, generally, if a mRNA is 
overexpressed in cancer, it is more likely than not that the encoded protein will also be expressed 
at an elevated level. 

Madoz Gurpide et al and Celts et al 

The Examiner cites Madoz-Gurpide et al and Celis et al to show that one cannot 
accurately predict protein levels based on mRNA levels (made of record by the Examiner in the 
Final Office Action mailed September 27, 2006). 

Madoz-Gurpide et al. explains that mRNA expression alone does not provide 
information regarding the "activation state, post-translational modification or localization of 
corresponding proteins" (emphasis added; page 168, col. 1). That is, Madoz-Gurpide et al 
explain that mechanisms are not apparent from mRNA expression alone. Madoz Gurpide et al 
further state that, it is "unclear" how well the reported RNA levels correlate with protein levels. 
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In support of this assertion, the authors cite only a single reference, namely, the Chen et al, 
which was discussed above and in detail in the Appeal Brief filed September 13, 2005. Madoz 
Gurpide et al also acknowledge that the DNA microarray studies, such as those carried out by 
Beer et al (specifically cited -by the authors at page 52), "justify the use of this technology for 
uncovering patterns of gene expression that are clinically informative" (emphasis added; 
page 53). Thus, while Madoz-Gurpide et al note that it is "more difficult to develop an 
understanding of disease at a mechanistic level using DNA microarrays," (emphasis added; 
page 53), Appellants respectfully point out that that "understanding of a disease at the 
mechanistic level" is not relevant to Appellants' assertions of utility, as discussed above. 
Accordingly, a prima facie case cannot be made based on the teachings within the Madoz- 
Gurpide et al. reference. 

Like Madoz-Gurpide et al, Celis et al. discuss that mechanisms are not apparent from 
mRNA expression alone. Celis et al note that "proteomics addresses problems that cannot be 
approached by DNA analysis, namely, relative abundance of the protein product, post- 
radiational modification, subcellular localization, turnover, interaction with other proteins as 
well as functional aspects" (page 6, col. 2). However, in their discussion, Celis et al cite Orntoft 
et al (Item 103 of Evidence List) and note that "in most cases there was a good correlation 
between transcript and protein levels." Celis et al further explain that those few cases which 
showed apparent discrepancies may have been due to other causes, such as post : transcriptional 
processing or degradation of the protein, or the choice of methods used to assess protein 
expression levels. Celis et al further note that the observation that there is often more change in 
mRNAs as compared to the proteins may be due to the fact that current technologies detect 
mainly high abundance proteins, while most of the changes affecting protein levels may involve 
low abundance proteins. Thus, the correlation between mRNA and protein levels may be even 
higher than typically observed, given these factors. Celis et al explain that proteomics is useful 
in combination with arrays "for the entire process of drug development and evaluation." (page 6; 
col. 1). Appellants further submit that significant correlations between gene and protein 
expression are most likely to be observed for genes associated with cancer, since as Celis et al. 
note, "transformation resulted in the abnormal expression of normal genes, rather than in the 
expression of new ones" (page 11, col. 1). Accordingly, alterations in gene amplification or 
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expression are more likely to be associated with altered protein expression in the case of cancer 
than in other cases where DNA microarrays are used, because, as explained by Celis et al, the 
alterations in expression levels of certain normal proteins are part of the process that leads to 
cancer. 

Accordingly, a prima facie case cannot be made based on the teachings within the Celis 
et al. reference. In fact, Appellants submit that, Celis et al support the Appellants contention 
that it is more likely than not that changes in mRNA levels reflect changes in protein levels, in 
general. 

Nagaraja et al* Waghrav et al* Saeanaliev et al 

The Examiner also asserts that "[comprehensive studies where significantly large 
numbers of transcripts and proteins were examined report that increases in mRNA and protein 
samples are not correlated." > (Final Office Action of September 27, 2006). In support of this 
assertion, the Examiner cites three new references, by Nagaraja et al., Waghray et al., and 
Sagynaliev et al. 

The Examiner cites Nagaraja et al as allegedly teaching that "the proteomic profiles 
indicated altered abundance of fewer proteins as compared to transcript profiles." (Final Office 
Action of September 27, 2006). 

Appellants respectfully submit that the fact that many more transcripts than proteins were 
found to be differentially expressed does not mean that most mRNA changes did not result in 
correlating protein changes, but merely reflects the fact that expression levels were only 
measured at all for many fewer proteins than transcripts . In particular, the total number of 
proteins whose expression levels could be visualized on silver- stained gels was only about 300 
(page 2332, col. 1), as compared to the approximately 14,500 genes on the microarray chips for 
which mRNA levels were measured (page 2336, col. 1). Since the expression levels of so many 
fewer proteins than transcripts were measured, it is hardly surprising that a smaller absolute 
number of proteins than mRNAs were found to be overexpressed, because the protein products 
of most of the overexpressed mRNAs would not have been among the small number of proteins 
identified on the gels. 

The Examiner next cites Waghray et a/., to the effect that "for most of the proteins 
identified, there was no appreciable concordant change at the RNA level," and that "[t]he change 
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in intensity for most of the affected proteins identified could not be predicted based on the level 
of the corresponding RNA." (Final Office Action of September 27, 2006). Appellants reiterate 
that they need only show that there is a correlation between mRNA and protein levels, such that 
mRNA overexpression generally predict protein overexpression. A showing that mRNA levels 
can be used to "accurately predict" the precise levels of protein expression is not required . 

Appellants also emphasize that Appellants are asserting that a measurable change in 
mRNA level generally leads to a corresponding change in the level of protein expression, not 
that changes in protein level can be used to predict changes in riiRNA level. Waghrav et ah did 
not take genes which showed significant mRNA changes and check the correspo nding protein 
levels . Instead, the authors looked at a small and unrepresentative number of proteins, and 
checked the corresponding mRNA levels. Waghray et ah acknowledge that only "[a] relatively 
small set of genes could be analyzed at the protein level, largely due to the limited sensitivity of 
2-D PAGE" (page 1337, col. i). In particular, while the authors examined the expression levels 
of 16,570 genes (page 1329, col 2), they were able to measure the expression levels of only 
1031 proteins (page 1333, col. 2). Waghray et ah does not teach that changes in mRNA 
expression were not correlated with changes in expression of the corresponding protein. All 
Waghray et ah state is that "for most of the proteins identified, there was no appreciable 
concordant change at the mRNA level" (page 1337, col. 2). This statement is not relevant to 
Appellants' assertion of utility, since Appellants are not asserting that changes in mRNA levels 
are the only cause of changes in protein levels. Waghray et ah do not contradict Appellants' 
assertion that changes in mRNA expression, in general, correspond to changes in expression of 
the corresponding protein. 

Lastly, the Examiner cites Sagynaliev et ah, as allegedly teaching that "it is also difficult 
to reproduce transcriptomics results with proteomics tools." In particular, the Examiner notes 
that according to Sagynaliev et ah, of 982 genes found to be differentially expressed in human 
CRC, only 177 (18%) have been confirmed using proteomics technologies. (Final Office Action 
of September 27, 2006). 

The Sagynaliev et al reference, titled " Web-based data warehouse on gene expression in 
human colorectal cancer" (emphasis added), drew conclusions based upon a literature survey of gene 
expression data published in human CRC , and not from experimental data. While a literature survey 
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can be a useful tool to assist researchers, the results may greatly over-represent or under-represent 
certain genes, and thus the conclusions may not be generally applicable. In particular, Appellants 
note that, as evidenced by Lian et al, Nagaraja et al, and Waghray et al, discussed above, the 
number of mRNAs examined in transcriptomics studies is typically much larger than the number of 
proteins examined in corresponding proteomics studies , due to the difficulties in detecting and 
resolving more than a small minority of all expressed proteins on 2D gels. Thus the fact that only 
18% of all genes found to be differentially expressed in human CRC have been confirmed using 
proteomics technologies does not mean that the corresponding proteins are not also differentially 
expressed, but is most likely due to the fact that the corresponding proteins were not identified on 2D 
gels, and thus their expression levels remain unknown. 

The authors of Sagynaliev et al acknowledge the many technical problems in finding 
pfoteomic data for CRC that can be matched to transcriptomic data to see if the two correlate. The 
authors state that "results have been obtained using heterogeneous samples in particular cell lines, 
whole tissue biopsies, and epithelial cells purified from surgical specimens." However, "Results 
obtained in cell lines do not allow accurate comparison between normal and cancer cells, and the 
presence/absence of proteins of interest has to be confirmed in biopsies." (Page 3072, left column.) 
In particular, the authors specifically note that "only a single study [1] provided differential display 
protein expression data obtained in the human patient, using whole tissue biopsy." (Page 3068, left 
column, second paragraph; see also, Table 2.) 

Appellants further note that Table 2 shows that 6 out of 8 published proteomics studies were 
done using 2-D PAGE. However, the authors state that "2-D PAGE or 2-D DIGE have well-known 
technological limitations . . . even under well-defined experimental conditions, 2-D PAGE parallel 
analysis of paired CRC samples is hampered by a significant variability." (Page 3077, left column, 
third paragraph.) Therefore, Appellants respectfully submit that it is well known in the art that there 
are problems associated with selecting only those proteins detectable by 2D gels. 

Li et ah 

The Examiner cites new reference Li et al as teaching that "68.8% of the genes showing 
over-representation in the genome did not show elevated transcript- levels." (Final Office Action 
of October 13, 2006). 
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Appellants respectfully point out that Li et al acknowledge that their results differed 
from those obtained by Hyman et al and Pollack et al (of record), who found a substantially 
higher level of correlation between gene amplification and increased gene expression. The 
authors note that "[t]his discordance may reflect methodologic differences between studies or 
biological differences between breast cancer and lung adenocarcinoma" (page 2629, col. 1). For 
instance, as explained in the Supplemental Information accompanying the Li article, genes were 
considered to be amplified if they had a copy number ratio of at least 1.40 . In the case of 
PR0269, as discussed in previously filed responses and in the Goddard Declaration (of record), 
an appropriate threshold for considering gene amplification to be significant is a copy number of 
at least 2.0 (which is a higher threshold). The PR0269 gene showed significant amplification of 
2.00-3.5-fold amplification in eight primary lung tumors or cell lines , and thus fully meets this 
standard. It is not surprising that, in the Li et al reference, by using a lower threshold of 1.4 for 
considering gene amplification, a higher number of genes not showing corresponding increases 
in mRNA expression were found. Nevertheless, the results of Li et al do not conclusively 
disprove that a gene with a substantially higher level of gene amplification, such as PRQ269 , 
would be expected to show a corresponding increase in transcript expression. 

Therefore, the Patent Office has failed to meet its initial burden of proof that Appellants' 
claims of utility are not substantial or credible. The arguments presented by the Examiner based 
on references Pennica et al, Konopka et al, Haynes et al } Hu et al, Chen et al, Lian et al, 
Fessler et al , Greenbaum et al , Lilley et al , King et al , Wildsmith et al , Nagaraja et al , 
Sagynaliev et al, Waghray et al, Madoz-Gurpide et al, Celis et al, and Li et al, do not provide 
sufficient reasons to doubt the statements by Appellants that PR0269 has utility as a diagnostic 
marker for lung cancer. Appellants once again remind the Examiner that only after the Examiner 
has made a proper prima facie showing of lack of utility, does the burden of rebuttal shift to the 
Appellant. Based on the above discussions, such a showing has not been made. Accordingly, the 
instant rejection should be withdrawn for the Examiner's lack of establishment of a prima facie 
showing. 

Moreover, Appellants acknowledge that, in certain instances, DNA/ mRNA and protein 
levels do not correlate. In fact, Appellants have included several such references directed 
towards a single gene or genes that lack a correlation in their IDS filed August 21, 2006. The 
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references discussed in the Preliminary Amendment filed August 21, 2006, and the arguments 
therein are hereby incorporated by reference for brevity. The IDS included references that 
studied single genes or gene families, multiple or large families of genes, and included studies 
that a wide variety of techniques, including gene amplification and microarray. Regardless of 
the techniques employed, by and large, increased genes/ transcripts levels mostly corr elated with 
increased protein levels, even if accurate predictions of proteins could not be made . As 
discussed throughout prosecution, the law does not require the existence of a "necessary" 
correlation between DNA/mRNA and protein levels, or that protein levels be "accurately 
predicted." In fact, authors in several of the cited references (cited both, by the Examiner, and 
by Appellants) themselves acknowledge that there is a general correlation between protein 
expression and transcript levels and DNA levels, which meets the "more likely than not 
standard." 

In summary, Appellants maintain that even though there are certain instances where a 
correlation, between DNA/mRNA and protein levels do not exist, in most cases , there is 
generally good correlation between them, and this was collectively demonstrated in the more 
than 100 references submitted by the Appellants in the IDS filed August 21, 2006. 

D. The Gene Amplification Data Establishes Credible, Substantial and Specific 
Patentable Utility for the PRQ269 Polypeptide and its Antibodies 

On the other hand, as discussed throughout prosecution, Appellants submit that Example 
92 of the specification further discloses that, "(a)mplification is associated with overexpression 
of the gene product, indicating that the polypeptides are useful targets for therapeutic 
intervention in certain cancers such as lung, colon, breast and other cancers and diagnostic 
determination of the presence of those cancers" (Emphasis added). Appellants have also 
submitted ample evidence to show that, in general, if a gene is amplified in cancer, it is "more 
likely than not" that the encoded protein will also be expressed at an elevated level. 

Besides the reference, the Declaration by Dr. Paul Polakis (Polakis I - made of record in 
Appellants 1 Response filed October 16, 2003), principal investigator of the Tumor Antigen 
Project of Genentech, Inc., the assignee of the present application, explains that in the course of 
Dr. Polakis' research using microarray analysis, he and his co-workers identified approximately 
200 gene transcripts that are present in human tumor cells at significantly higher levels than in 
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corresponding normal human cells. Appellants submit that Dr. Polakis' Declaration was 
presented to support the position that there is a correlation between mRNA levels and 
polypeptide levels. The second Declaration by Dr. Polakis (Polakis II) presented evidentiary 
data in Exhibit B. Exhibit B of the Declaration identified 28 gene transcripts out of 31 gene 
transcripts (i.e., greater than 90%) that showed good correlation between tumor mRNA and 
tumor protein levels. As Dr. Polakis' Declaration (Polakis II) says "[a]s such, in the cases where 
we have been able to quantitatively measure both (i) mRNA and (ii) protein levels in both (i) 
tumor tissue and (ii) normal tissue, we have observed that in the vast majority of cases, there is a 
very strong correlation between increases in mRNA expression and increases in the level of 
protein encoded by that mRNA." Accordingly, Dr. Polakis has provided the facts to enable the 
Examiner to draw independent conclusions regarding protein data. Appellants further emphasize 
that the opinions expressed in the Polakis Declaration, including in the above quoted statement, 
are all based on factual findings. For instance, antibodies binding to about 30 of these tumor 
antigens were prepared and mRNA and protein levels were compared. In approximately 80% of 
the cases , the researchers found that increases in the level of a particular mRNA correlated with 
changes in the level of protein expressed from that mRNA when human tumor cells are 
compared with their corresponding normal cells . Therefore, Dr. Polakis* research, which is 
referenced in his Declaration, shows that, in general there is a correlation between increased 
mRNA and polypeptide levels . Hence, one of skill in the art would reasonably expect that, based 
on the gene amplification data of the PR0269 gene, the PR0269 polypeptide is concomitantly 
overexpressed in lung tumors studied as well. 

Appellants further note that the sale of gene expression chips to measure mRNA levels is 
a highly successful business, with a company such as Affymetrix recording 168.3 million dollars 
in sales of their GeneChip® arrays in 2004. Clearly, the research community believe that the 
information obtained from these chips is useful (i.e., that it is more likely than not that the results 
are informative of protein levels). 

Thus, based on the asserted utility for PR0269 in the diagnosis of lung tumors, the 
reduction to practice of the instantly claimed protein sequence of SEQ ID NO: 96 in the present 
application (see page 12, line 30 to page 13, line 1, page 40, lines 1-11, page 103, lines 4-12), the 
disclosure of the step-by-step protocols for making chimeric PRO polypeptides, including those 
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wherein the heterologous polypeptide is an epitope tag or an Fc region of an immunoglobulin in 
the specification (at page 1 16, lines 12 to 359), the disclosure of a step-by-step protocol for 
making and expressing PR0269 in appropriate host cells (in Examples 53-56), the step-by-step 
protocol for the preparation, isolation and detection of monoclonal, polyclonal and other types of 
antibodies against the PR0269 protein in the specification (in Examples 57-59), for example, the 
preparation of antibodies that bind PRO polypeptides is set forth in the specification at pages 
139-147 and Example 57, and the disclosure of the gene amplification assay in Example 92, the 
skilled artisan would know exactly how to make and use the claimed antibodies for the diagnosis 
of lung cancers. Appellants submit that based on the detailed information presented in the 
specification and the advanced state of the art in oncology, the skilled artisan would have found 
such testing routine and not 'undue.' 

Therefore, Appellants respectfully request reconsideration and reversal of this 
outstanding rejections under 35 U.S.C. §101 and §112, First Paragraph to Claims 39-43. 
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CONCLUSION 

For the reasons given above, Appellants submit that present specification clearly 
describes, details and provides a patentable utility for the claimed invention. Moreover, it is 
respectfully submitted that based upon this disclosed patentable utility, the present specification 
clearly teaches "how to use" the presently claimed polypeptide and its antibodies. As such, 
Appellants respectfully request reconsideration and reversal of the outstanding rejection of 
Claims 39-43. 

The Commissioner is authorized to charge any fees which may be required, including 
extension fees, or credit any overpayment to Deposit Account No. 08-1641 (referencing 
Attorney's Docket No. 39780-1618 P2C34) . 

Respectfully submitted, 

Date: August 13, 2007 

HELLER EHRMAN LLP 

275 Middlefield Road 
Menlo Park, California 94025-3506 
Telephone: (650) 324-7000 
Facsimile: (650) 324-0638 



GmgerR. Dreger (Reg. No. 33,055) 
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VIII. CLAIMS APPENDIX 

Claims on Appeal 

39. An isolated antibody that specifically binds to the polypeptide of SEQ ID NO: 96. 

40. The antibody of Claim 39 which is a monoclonal antibody. 

41 . The antibody of Claim 39 which is a humanized antibody. 

42. The antibody of Claim 39 which is an antibody fragment. 

43. An isolated antibody that specifically binds to the polypeptide of SEQ ID NO: 96 
which is labeled. 
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RELATED PROCEEDINGS APPENDIX 

None- no decision rendered by a Court or the Board in any related proceedings identified 
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DECLARATION OF AUDREY D. GODDARD, Ph.D UNDER 37 CF.IL § 1.132 

Assistant Commissioner of Patents 
Washington, D.C. 20231 

Sir: 

1, Audrey D. Goddard, Ph.D. do hereby declare and say as follows: 

1 . I am a Senior Clinical Scientist at the Experimental Medicine/BioOncology, Medical 
Affairs Department of Genentech, Inc., South San Francisco, California 94080. 

2. Between 1993 and 200 1 , 1 headed the DNA Sequencing Laboratory at the Molecular 
Biology Department of Genentech, Inc. During this time, my responsibilities included the 
identification and characterization of genes contributing to the oncogenic process, and determination 
of the chromosomal localization of novel genes. 

3. My scientific Curriculum Vitae, including my list of publications, is attached to and 
forms part of this Declaration (Exhibit A). 



\ 




Serial No.: * 
Filed: * 

4. I am familiar with a variety of techniques known in the art for detecting and 
quantifying the amplification of oncogenes in cancer, including the quantitative TaqMan PCR (i.e., 
"gene amplification") assay described in the above captioned patent application. 

5. The TaqMan PCR assay is described, for example, in the following scientific 
publications: Higuchi et al. 9 Biotechnology 10:413-417 (1992) (Exhibit B); Livak et aL, PCR 
Methods AppL 4:357-362 (1995) (Exhibit C) and Heid et al., Genome Res. 6:986-994 (1996) - 
(Exhibit D). Briefly, the assay is based on the principle that successful PCR yields a fluorescent 
signal due to Taq DNA polymerase-mediated exonuclease digestion of a fluorescently labeled 
oligonucleotide that is homologous to a sequence between two PCR primers. The extent of 
digestion depends directly on the amount of PCR, and can be quantified accurately by measuring the 
increment in fluorescence that results from decreased energy transfer. This is an extremely sensitive 
technique, which allows detection in the exponential phase of the PCR reaction and, as a result, 
leads to accurate determination of gene copy number. 

6. The quantitative fluorescent TaqMan PCR assay has been extensively and 
successfully used to characterize genes involved in cancer development and progression. 
Amplification of protooncogenes has been studied in a variety of human tumors, and is widely 
considered as having etiological, diagnostic and prognostic significance. This use of the quantitative 
TaqMan PCR assay is exemplified by the following scientific publications: Pennica et aL, Proc. 
Natl. Acad. Sci. USA . 95(25): 147 17- 14722 (1998) (Exhibit E); Pitti et a/., Nature 
396(6712);699-703 (1998) (Exhibit F) and Bieche et a/.. Int. J. Cancer 78:661-666 (1998) (Exhibit 
G), the first two of which I am co-author. In particular, Pennica et aL have used the quantitative 
TaqMan PCR assay to study relative gene amplification of WISP and c-myc in various cell lines, 
colorectal tumors and normal mucosa. Pitti et aL studied the genomic amplification of a decoy 
receptor for Fas ligand in lung and colon cancer, using the quantitative TaqMan PCR assay. Bieche 
et aL used the assay to study gene amplification in breast cancer. 
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7. It is my personal experience that the quantitative TaqMan PCR technique is 
technically sensitive enough to detect at least a 2-fold increase in gene copy number relative to 
control It is further my considered scientific opinion that an at least 2-fold increase in gene copy 
number in a tumor tissue sample relative to a normal (i.e., non-tumor) sample is significant and 
useful in that the detected increase in gene copy number in the tumor sample relative to the normal 
sample serves as a basis for using relative gene copy number as quantitated by the TaqMan PCR 
technique as a diagnostic marker for the presence or absence of tumor in a tissue sample of unknown 
pathology. Accordingly, a gene identified as being amplified at least 2-fold by the quantitative 
TaqMan PCR assay in a tumor sample relative to a normal sample is useful as a marker for the 
diagnosis of cancer, for monitoring cancer development and/or for measuring the efficacy of cancer 
therapy. 

8. I declare further that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true. I declare that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereoa 

Date Audrey D. Goddard, PLD. 
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AUDREY D. GODDARD, Ph.D. 



Genentech, Inc. 110 Congo St. 

1 DNA Way San Francisco, CA, 94131 

South San Francisco, CA, 94080 415.841.9154 

650.225.6429 415.819.2247 (mobile) 

goddarda@gene.com agoddard@pacbell.net 



PROFESSIONAL EXPERIENCE 

Genentech, Inc. 1993-present 
South San Francisco, CA 

2001 - present Senior Clinical Scientist 

Experimental Medicine / BioOncology, Medical Affairs 

Responsibilities: 

• Companion diagnostic oncology products 

• Acquisition of clinical samples from Genentech's clinical trials for translational research 

• Translational research using clinical specimen and data for drug development and 
diagnostics 

• Member of Development Science Review Committee, Diagnostic Oversight Team, 21 CFR 
Part 1 1 Subteam 

Interests: 

• Ethical and legal implications of experiments with clinical specimens and data 

• Application of pharmacogenomics in clinical trials 



1998-2001 Senior Scientist 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities: 

• Management of a laboratory of up to nineteen -including postdoctoral fellow, associate 
scientist senior research associate and research assistants/associate levels 

• Management of a $750K budget 

• DNA sequencing core facility supporting a 350+ person research facility. 

• DNA sequencing for high throughput gene discovery, - ESTs, cDNAs, and constructs 

• Genomic sequence analysis and gene identification 

• DNA sequence and primary protein analysis 

Research: 

• Chromosomal localization of novel genes 

• Identification and characterization of genes contributing to the oncogenic process 

• Identification and characterization of genes contributing to inflammatory diseases 

• Design and development of schemes for high throughput genomic DNA sequence analysis 

• Candidate gene prediction and evaluation 




Audrey D. Goddard, P/i.D page 2 of 9 

1993-1998 Scientist 

Head of the DNA Sequencing Laboratory, Molecular Biology Department, Research 
Responsibilities 

• DNA sequencing core facility supporting a 350+ person research facility 

• Assumed responsibility for a pre-existing team of five technicians and expanded the group 
into fifteen, introducing a level of middle management and additional areas of research 

• Participated in the development of the basic plan for high throughput secreted protein 
discovery program - sequencing strategies, data analysis and tracking, database design 

• High throughput EST and cDNA sequencing for new gene identification. 

• Design and implementation of analysis tools required for high throughput gene identification. 

• Chromosomal localization of genes encoding novel secreted, proteins. 

Research: 

• Genomic sequence scanning for new gene discovery. 

• Development of signal peptide selection methods. 

• Evaluation of candidate disease genes. 

• Growth hormone receptor gene SNPs in children with Idiopathic short stature 



Imperial Cancer Research Fund 1989-1992 
London, UK with Dr. Ellen Solomon 

6/89 -12/92 Postdoctoral Fellow 

• Cloning and characterization of the genes fused at the acute promyelocyte leukemia 
translocation breakpoints on chromosomes 17 and 15. 

• Prepared a successfully funded European Union multi-center grant application 

McMaster University 1983 
Hamilton, Ontario, Canada with Dr. G. D. Sweeney 

5/83 - 8/83: NSERC Summer Student 

• In vitro metabolism of p-naphthoflavone in C57BI/6J and DBA mice 



EDUCATION 

University of Toronto 

Toronto, Ontario, Canada. 1989 
Department of Medical 
Biophysics. 

Honours B.Sc McMaster University, 

"The 'in vitro metabolism of the cytochrome P-448 Hamilton, Ontario, Canada. 1983 
inducer p-naphthoflavone in C57BL/6J mice." Department of Biochemistry 
Supervisor: Dr. G. D. Sweeney 



Ph.D. 

"Phenotypic and genotypic effects of mutations in 
the human retinoblastoma gene." 
Supervisor: Dr. R. A. Phillips 
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ACADEMIC AWARDS 



Imperial Cancer Research Fund Postdoctoral Fellowship 1989-1992 

Medical Research Council Studentship 1983-1988 

NSERC Undergraduate Summer Research Award 1983 

Society of Chemical Industry Merit Award (Hons. Biochem.) 1983 

Dr. Harry Lyman Hooker Scholarship 1981-1983 

J.LW. Gill Scholarship 1981-1982 

Business and Professional Women's Club Scholarship 1980-1981 

Wyerhauser Foundation Scholarship 1979-1980 



INVITED PRESENTATIONS 

Genentech's gene discovery pipeline: High throughput identification, cloning and 
characterization of novel, genes. Functional Genomics: From Genome to Function, Litchfield 
Park, AZ, USA. October 2000 

High throughput identification, cloning and characterization of novel genes. G2K:Back to 
Science, Advances in Genome Biology and Technology I. Marco Island, FL, USA. February 
2000 

Quality control in DNA Sequencing: The use of Phred and Phrap. Bay Area Sequencing 
Users Meeting, Berkeley, CA, USA April 1999 

High throughput secreted protein identification and cloning. Tenth International Genome 
Sequencing and Analysis Conference, Miami, FL, USA September 1998 

The evolution of DNA sequencing: The Genentech perspective. Bay Area Sequencing Users 
Meeting, Berkeley, CA, USA. May 1998 

Partial Growth Hormone Insensitivity: The role of GH-receptor mutations in Idiopathic Short 
Stature. Tenth Annual National Cooperative Growth Study Investigators Meeting, San 
Francisco, CA, USA. October, 1996 

Growth hormone (GH) receptor defects are present in selected children with non-GH-deficient 
short stature: A molecular basis for partial GH-insensitivity. 76 th Annual Meeting of The 
Endocrine Society, Anaheim, CA, USA. June 1994 

A previously uncharacterized gene, myl, is fused to the retinoic acid receptor alpha gene in 
acute promyelocyte leukemia. XV International Association for Comparative Research on 
Leukemia and Related Disease, Padua, Italy. October 1991 
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PATENTS 

Goddard A, Godowski PJ, Gurney AL NL2 Tie ligand homologue polypeptide. Patent 
Number: 6,455,496. Date of Patent: Sept. 24, 2002. 

Goddard A, Godowski PJ and Gurney AL. NL3 Tie ligand homologue nucleic acids. Patent 
Number: 6,426,218. Date of Patent: July 30, 2002. 

Godowski P, Gurney A, Hillan KJ, Botstein D, Goddard A, Roy M, Ferrara N, Tumas D, 
Schwall R. NL4 Tie ligand homologue nucleic acid. Patent Number: 6,4137,770. Date of 
Patent: July 2, 2002. 

Ashkenazi A, Fong S, Goddard A, Gurney AL, Napier MA, Tumas D, Wood Wl. Nucleic acid 
encoding A-33 related antigen poly peptides. Patent Number: 6,410,708. Date of Patent:: 
Jun. 25, 2002. 

Botstein DA, Cohen RL, Goddard AD, Gurney AL, Hillan KJ, Lawrence DA, Levine AJ, 
Pennica D, Roy MA and Wood Wl. WISP polypeptides and nucleic acids encoding same. 
Patent Number: 6,387,657. Date of Patent: May 14, 2002. 

Goddard A, Godowski PJ and Gurney AL. Tie ligands. Patent Number: 6,372,491. Date of 
Patent: April 16, 2002. 

Godowski PJ, Gurney AL, Goddard A and Hillan K. TIE ligand homologue antibody. Patent 
Number: 6,350,450. Date of Patent: Feb. 26, 2002. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Tie 
receptor tyrosine kinase ligand homologues. Patent Number: 6,348,351 . Date of Patent: 
Feb. 19, 2002. 

Goddard A, Godowski PJ and Gurney AL. Ligand homologues. Patent Number: 6,348,350; 
Date of Patent: Feb. 19, 2002. 

Attie KM, Carlsson LMS, Gesundheit N and Goddard A. Treatment of partial growth 
hormone insensitivity syndrome. Patent Number: 6,207,640. Date of Patent: March 27, 
2001. 

Fong S, Ferrara N, Goddard A, Godowski PJ, Gurney AL, Hillan K and Williams PM. Nucleic 
acids encoding NL-3. Patent Number: 6,074,873. Date of Patent: June 13, 2000 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,824,642. Date of Patent: October 20, 1998 

Attie K, Carlsson LMS, Gesunheit N and Goddard A. Treatment of partial growth hormone 
insensitivity syndrome. Patent Number: 5,646,1 13. Date of Patent: July 8, 1997 

Multiple additional provisional applications filed 
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PUBLICATIONS 

Seshasayee D, Dowd P, Gu Q, Erickson S, Goddard AD Comparative sequence analysis of 
the HER2 locus in mouse and man. Manuscript in preparation. 

Abuzzahab MJ, Goddard A, Grigorescu F, Lautier C, Smith RJ and Chernausek SD. Human 
IGF-1 receptor mutations resulting in pre- and post-natal growth retardation. Manuscript in 
preparation. 

Aggarwal S, Xie, M-H, Foster J, Frantz G, Stinson J, Corpuz RT, Simmons L, Hillan K, 
Yansura DG, Vandlen RL f Goddard AD and Gurney AL FHFR, a novel receptor for the 
fibroblast growth factors. Manuscript submitted. 

Adams SH, Chui C, Schilbach SL, Yu XX, Goddard AD, Grimaldi JC, Lee J, Dowd P, Colman 
S. ( Lewin DA. (2001) BFIT, a unique acyl-CoA thioesterase induced in thermogenic brown 
adipose tissue: Cloning, organization of the human gene, and assessment of a potential link 
to obesity. Biochemical Journal 360: 135-142. 

Lee J. Ho WH. Maruoka M. Corpuz RT. Baldwin DT. Foster JS. Goddard AD. Yansura DG. 
Vandlen RL. Wood Wl. Gurney AL. (2001) IL-17E, a novel proinflammatory ligand for the IL- 
17 receptor homolog IL-17RM. Journal of Biological Chemistry 276(2): 1660-1664. 

Xie M-H, Aggarwal S, Ho W-H, Foster J, Zhang Z, Stinson J, Wood Wl, Goddard AD and 
Gurney AL. (2000) Interleukin (IL)-22 f a novel human cytokine that signals through the 
interferon-receptor related proteins CRF2-4 and IL-22R. Journal of Biological Chemistry 275: 
31335-31339. 

Weiss GA, Watanabe CK, Zhong A, Goddard A and Sidhu SS. (2000) Rapid mapping of 
protein functional epitopes by combinatorial alanine scanning. Proc. Natl. Acad. Sci. USA 97: 
8950-8954. 

Guo S, Yamaguchi Y, Schilbach S t Wada T.;Lee J, Goddard A, French D , Handa H, 
Rosenthal A. (2000) A regulator of transcriptional elongation controls vertebrate neuronal 
development. Nature 408: 366-369. 

Yan M, Wang L-C, Hymowitz SG, Schilbach S, Lee J, Goddard A, de Vos AM, Gao WQ, Dixit 
VM. (2000) Two-amino acid molecular switch in an epithelial morphogen that regulates 
binding to two distinct receptors. Science 290: 523-527. 

Sehl PD, Tai JTN, Hillan KJ t Brown LA, Goddard A, Yang R, Jin H and Lowe DG. (2000) 
Application of cDNA microarrays in determining molecular phenotype in cardiac growth, 
development, and response to injury. Circulation 101: 1990-1999. 

Guo S, Brush J, Teraoka H, Goddard A, Wilson SW, Mullins MC and Rosenthal A. (1999) 
Development of noradrenergic neurons in the zebrafish hindbrain requires BMP, FGF8, and 
the homeodomain protein soulless/Phox2A. Neuron 24: 555-566. 

Stone D, Murone, M, Luoh, S, Ye W, Armanini P, Gurney A, Phillips HS, Brush, J, Goddard 
A, de Sauvage FJ and Rosenthal A, (1999) Characterization of the human suppressor of 
fused; a negative regulator of the zinc-finger transcription factor GIL J. Cell Sci. 112: 4437- 
4448. 

Xie M-H, Holcomb I, Deuel B, Dowd P, Huang A, Vagts A, Foster J, Liang J, Brush J, Gu Q, 
Hillan K, Goddard A and Gurney, A.L. (1999) FGF-19, a novel fibroblast growth factor with 
unique specificity for FGFR4. Cytokine 11: 729-735. 



Audrey 0. Goddard, Ph.D page 6 of 9 



Yan M, Lee J, Schilbach S, Goddard A and Dixit V. (1999) mE10, a novel caspase 
recruitment domain-containing proapoptotic molecule. J. Biol. Chem. 274(15): 10287-10292. 

Gurney AL, Marsters SA, Huang RM, Pitti RM, Mark DT, Baldwin DT, Gray AM, Dowd P, 
Brush J, Heldens S, Schow P, Goddard AD, Wood W I, Baker KP, Godowski PJ and 
Ashkenazi A. (1999) Identification of a new member of the tumor necrosis factor family and its 
receptor, a human ortholog of mouse GITR. Current Biology 9(4): 215-218. 

Ridgway JBB.'Ng E, Kern JA ,Lee J, Brush J, Goddard A and Carter P. (1999) Identification 
of a human anti-CD55 single-chain Fv by subtractive panning of a phage library using tumor 
and nontumor cell lines. Cancer Research 59: 2718-2723. 

Pitti RM, Marsters SA, Lawrence DA, Roy M, Kischkel FC, Dowd P, Huang A, Donahue CJ, 
Sherwood SW, Baldwin DT, Godowski PJ, Wood Wl, Gurney AL, Hillan KJ, Cohen RL, 
Goddard AD, Botstein D and Ashkenazi A. (1998) Genomic amplification of a decoy receptor 
for Fas ligand in lung and colon cancer. Nature 396(6712): 699-703. 

Pennica D, Swanson TA, Welsh JW, Roy MA, Lawrence DA ( Lee J, Brush J, Taneyhill LA, 
Deuel B, Lew M, Watanabe C T Cohen RL, Melhem MF, Finley GG, Quirke P, Goddard AD, 
Hillan KJ, Gurney AL, Botstein D and Levine AJ. (1998) WISP genes are members of the 
connective tissue growth factor family that are up-regulated in wnt-1 -transformed cells and 
aberrantly expressed in human colon tumors. Proc. Natl. Acad. ScL USA. 95(25): 14717- 
14722. 

Yang RB, Mark MR, Gray A, Huang A, Xie MH, Zhang M, Goddard A, Wood Wl, Gurney AL 
and Godowski PJ. (1998) Toll-like receptor-2 mediates lipopolysaccharide-induced cellular 
signalling. Nature 395(6699): 284-288. 

Merchant AM, Zhu Z, Yuan JQ, Goddard A, Adams CW, Presta LG and Carter P. (1998) An 
efficienUroute to human bispecific IgG. Nature Biotechnology 16(7): 677-681 . 

Marsters SA, Sheridan JP, Pitti RM, Brush J, Goddard A and Ashkenazi A. (1998) 
Identification of a ligand for the death-domain-containing receptor Apo3. Current Biology 8(9): 
525-528. 

Xie J, Murone M, Luoh SM, Ryan A, Gu Q, Zhang C, Bonifas JM, Lam CW, Hynes M, 
Goddard A, Rosenthal A, Epstein EH Jr. and de Sauvage FJ. (1998) Activating Smoothened 
mutations in sporadic basal-cell carcinoma. Nature. 391(6662): 90-92. 

Marsters SA, Sheridan JP, Pitti RM, Huang A, Skubatch M, Baldwin D, Yuan J, Gurney A, 
Goddard AD, Godowski P and Ashkenazi A. (1997) A novel receptor for Apo2l/TRAIL 
contains a truncated death domain. Current Biology. 7(12): 1003-1006. 

Hynes M t Stone DM, Dowd M, Pitts-Meek S, Goddard A, Gurney A and Rosenthal A. (1997) 
Control of cell pattern in the neural tube by the zinc finger transcription factor Gli-1. Neuron 
19: 15-26. 

Sheridan JP, Marsters SA, Pitti RM, Gurney A., Skubatch M, Baldwin D, Ramakrishnan L, 
Gray CL, Baker K, Wood Wl, Goddard AD, Godowski P, and Ashkenazi A. (1997) Control of 
TRAIL-lnduced Apoptosis by a Family of Signaling and Decoy Receptors. Science 277 
(5327): 818-821. 
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Goddard AD, Dowd P, Chernausek S t Geffner M, Gertner J, Hintz R, Hopwood N, Kaplan S, 
Plotnick L, Rogol A t Rosenfield R, Saenger P, Mauras N, Hershkopf R, Angulo M and Attie, K. 
(1997) Partial growth hormone insensitivity: The role of growth hormone receptor mutations in 
idiopathic short stature. J. Pediatr. 131: S51-55. 

Klein RD f Sherman D, Ho WH, Stone D, Bennett GL, Moffat B t Vandlen R, Simmons L, Gu Q, 
Hongo JA, Devaux B f Poulsen K, Armanini M, Nozaki C, Asai N, Goddard A, Phillips H, 
Henderson CE, Takahashi M and Rosenthal A. (1997) A GPI-linked protein that interacts with 
Ret to form a candidate neurturin receptor. Nature. 387(6634): 717-21. 

Stone DM, Hynes M, Armanini M, Swanson TA, Gu Q, Johnson RL, Scott MP, Pennica D, 
Goddard A, Phillips H, Noll M, Hooper JE f de Sauvage F and Rosenthal A. (1996) The 
tumour-suppressor gene patched encodes a candidate receptor for Sonic hedgehog. Nature 
384(6605): 129-34. 

Marsters SA, Sheridan JP, Donahue CJ, Pitti RM, Gray CL, Goddard AD, Bauer KD and 
Ashkenazi A. (1996) Apo-3, a new member of the tumor necrosis factor receptor family, 
contains a death domain and activates apoptosis and NF-kappa p. Current ' Biology 6(12): 
1669-76. 

Rothe M, Xiong J, Shu HB, Williamson K, Goddard A and Goeddel DV. (1996) l-TRAF is a 
novel TRAF-interacting protein that regulates TRAF-mediated signal transduction. Proc. Natl. 
Acad. ScL USA 93: 8241-8246. 

Yang M, Luoh SM, Goddard A, Reilly D, Henzel W and Bass S. (1 996) The bglX gene 
located at 47.8 min on the Escherichia coli chromosome encodes a periplasmic beta- 
glucosidase. Microbiology 142: 1659-65. 

Goddard AD and Black DM. (1996) Familial Cancer in Molecular Endocrinology of Cancer. 
Waxman, J. Ed. Cambridge University Press, Cambridge UK, pp. 187-21 5. 

Treanor JJS, Goodman L, de Sauvage F, Stone DM, Poulson KT, Beck CD, Gray C, Armanini 
MP, Pollocks RA, Hefti F, Phillips HS, Goddard A, Moore MW, Buj-Bello A, Davis AM, Asai N, 
Takahashi M, Vandlen R, Henderson CE and Rosenthal A. (1996) Characterization of a 
receptor for GDNF. Nature 382: 80-83. 

Klein RD, Gu Q, Goddard A and Rosenthal A. (1996) Selection for genes encoding secreted 
proteins and receptors. Proc. Natl. Acad. ScL USA 93: 7108-7113. 

Winslow JW, Moran P, Valverde J, Shih A, Yuan JQ, Wong SC, Tsai SP, Goddard A, Henzel 
WJ, Hefti F and Caras I. (1995) Cloning of AL-1, a ligand for an Eph-related tyrosine kinase 
receptor involved in axon bundle formation. Neuron 14: 973-981 . 

Bennett BD, Zeigler-FC, Gu Q, Fendly B, Goddard AD, Gillett N and Matthews W. (1995) 
Molecular cloning of a ligand for the EPH-related receptor protein-tyrosine kinase Htk. Proc. 
Natl. Acad. Sci. USA 92: 1866-1870. 

Huang X, Yuang J, Goddard A, Foulis A, James RF, Lernmark A, Pujol-Borrell R, 
Rabinovitch A, Somoza N and Stewart TA. (1995) Interferon expression in the pancreases of 
patients with type I diabetes. Diabetes 44: 658-664. 

Goddard AD, Yuan JQ, Fairbairn L, Dexter M, Borrow J, Kozak C and Solomon E. (1995) 
Cloning of the murine homolog of the leukemia-associated PML gene. Mammalian Genome 
6:732-737. 
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Goddard AD, Covello R, Luoh SM, Clackson T, Attie KM, Gesundheit N, Rundle AC, Wells 
JA, Carlsson LMTI and The Growth Hormone Insensitivity Study Group. (1995) Mutations of 
the growth hormone receptor in children with idiopathic short stature. N. Engl. J. Med. 333: 
1093-1098. 

Kuo SS ( Moran P, Gripp J f Armanini M, Phillips HS, Goddard A and Caras IW. (1994) 
Identification and characterization of Batk, a predominantly brain-specific non-receptor protein 
tyrosine kinase related to Csk. J. Neurosci. Res. 38: 705-715. 

Mark MR, Scadden DT, Wang Z f Gu Q, Goddard A and Godowski PJ. (1994) Rse, a novel 
receptor-type tyrosine kinase with homology to Axl/Ufo, is expressed at high levels in the 
brain. Journal of Biological Chemistry 269: 10720-10728. 

Borrow J, Shipley J, Howe K, Kiely F, Goddard A, Sheer D, Srivastava A, Antony AC, 
Fioretos T, Mitelman F and Solomon E. (1994) Molecular analysis of simple variant 
translocations in acute promyelocyte leukemia. Genes Chromosomes Cancer 9: 234-243. 

Goddard AD and Solomon E. (1993) Genetics of Cancer. Adv. Hum. Genet. 21: 321-376. 

Borrow J, Goddard AD, Gibbons B, Katz F, Swirsky D, Fioretos T, Dube I, Winfield DA, 
Kingston J, Hagemeijer A, Rees JKH, Lister AT and Solomon E. (1992) Diagnosis of acute 
promyelocytic leukemia by RT-PCR: Detection of PML-RARA and RARA-PML fusion 
transcripts. Br. J. Haematol 82: 529-540. 

Goddard AD, Borrow J and Solomon E. (1992) A previously uncharacterized gene, PML, is 
fused to the retinoic acid receptor alpha gene in acute promyelocytic leukemia. Leukemia 6 
Suppl3: 117S-119S. 

Zhu X, Dunn JM, Goddard AD, Squire JA, Becker A, Phillips RA and Gallie BL. (1992) 
Mechanisms of loss of heterozygosity in retinoblastoma. Cytogenet CelL Genet 59: 248-252. 

Foulkes W, Goddard A. and Patel K. (1991) Retinoblastoma linked with Seascale [letter]. . 
British Med. J. 302: 409. 

Goddard AD, Borrow J, Freemont PS and Solomon E. (1991 ) Characterization of a novel zinc 
finger gene disrupted by the t(15;17) in acute promyelocytic leukemia. Science 254: 1371- 
1374. 

Solomon E, Borrow J and Goddard AD. (1991) Chromosomal aberrations in cancer. Science 
254: 1153-1160. 

Pajunen L, Jones TA, Goddard A, Sheer D, Solomon E, Pihlajaniemi T and Kivirikko Kl. 
(1991) Regional assignment of the human gene coding for a multifunctional peptide (P4HB) 
■ acting as the p-subunit of prolyl-4-hydroxylase and the enzyme protein disulfide isomerase to 
17q25. Cytogenet Cell: Genet 56: 165-168. 

Borrow J, Black DM, Goddard AD, Yagle MK, Frischauf A.-M and Solomon E. (1991) 
Construction and regional localization of a Not\ linking library from human chromosome 17q. 
Genomics 10: 477-480. 

Borrow J, Goddard AD, Sheer D and Solomon E. (1990) Molecular analysis of acute 
promyelocytic leukemia breakpoint cluster region on chromosome 17. Science 249: 1577- 
1,580. 
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Myers JC, Jones TA, Pohjolainen E-R, Kadri AS, Goddard AD, Sheer D, Solomon E and 
Pihlajaniemi T. (1990) Molecular cloning of 5(IV) collagen and assignment of the gene to the 
region of the region of the X-chromosome containing the Alport Syndrome locus. Am. J. Hum. 
Genet 46:1024-1033. 

Gallie BL, Squire JA t Goddard A, Dunn JM, Canton M, Hinton D, Zhu X and Phillips RA. 
(1990) Mechanisms of oncogenesis in retinoblastoma. Lab. Invest 62: 394-408. 

Goddard AD, Phillips RA, Greger V, Passarge E, Hopping W, Gallie BL and Horsthemke B. 
(1990) Use of the RB1 cDNA as a diagnostic probe in retinoblastoma families. Clinical 
Genetics 37: 117-126. 

Zhu XP, Dunn JM, Phillips RA, Goddard AD, Paton KE, Becker A^and Gallie BL (1989) 
Germline, but not somatic, mutations of the RB1 gene preferentially involve the paternal 
allele. Nature 340: .312-314. 

Gallie BL, Dunn JM, Goddard A, Becker A and Phillips RA. (1988) Identification of mutations 
in the putative retinoblastoma gene. In Molecular Biology of The Eve: Genes, Vision and 
Ocular Disease . UCLA Symposia on Molecular and Cellular Biology, New Series, Volume 88. 
J. Piatigorsky, T. Shinohara and P.S. Zelenka, Eds. Alan R. Liss, Inc., New York, 1988, pp. 
427-436. 

Goddard AD, Balakier H, Canton M, Dunn J t Squire J, Reyes E, Becker A, Phillips RA and 
Gallie BL. (1988) Infrequent genomic rearrangement and normal expression of the putative 
RB1 gene in retinoblastoma tumors. Mol. Cell. Biol. 8: 2082-2088. 

Squire J t Dunn J, Goddard A, Hoffman T, Musarella M, Willard HF, Becker AJ, Gallie BL and 
Phillips RA. (1986) Cloning of the esterase D gene: A polymorphic gene probe closely linked 
to the retinoblastoma locus on chromosome 13. Proc. Natl. Acad. Sci. USA 83: 6573-6577. 

Squire J, Goddard AD, Canton M, Becker A, Phillips RA and Gallie BL (1986) Tumour 
induction by the retinoblastoma mutation is independent of N-myc expression. Nature 322: 
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Goddard AD, Heddle JA, Gallie BL and Phillips RA. (1985) Radiation sensitivity of fibroblasts 
of bilateral retinoblastoma patients as determined by micronucleus induction in vitro. Mutation 
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SIMULTANEOUS AMPLIFICATION AND DETECTION OF 
SPECIFIC DNA SEQUENCES 

Russell ffiguchi*, Gaviix Dollmger 1 , P. Sean WalSfa and Robert Griffith 

Roche Molecular System*. Inc., 1400 SStd St, Emeryville, CA 9460ft. 'Cbwoo Corporation, 1400 53rd St. Emeryville, CA 
9460ft, "KSorresponding author. 



We have enhanced the polymerase chain 
reaction (PGR) such that specific DNA 
sequences can be detected without open- 
ing die reaction tube. This enhancement 
requires the addition of ethidium bromide 
(EtBr) to a PGR- Since the fluorescence of 
EtBr increases in the presence of double* 
stranded (ds) DNA an Increase in fluores- 
cence in such a PGR indicates a positive 
amplification, which can be easily moni- 
tored externally. In tact, amplification can 
be continuously monitored in order to 
follow its progress. The ability to simulta- 
neously amplify specific DNA sequences 
and detect the product of the amplification 
both simplifies and improves PGR and 
may facilitate its automation and more 
widespread use in tiie clinic or in other 
situations requiring high sample tiirough- 
put 

Although the potential t*cnc6,u of PCR 1 to. dta- 
ical o^gnosdea arc we& kn^wn 2 * it b&uUoot 
widely used in this Jetting, even diough it is 
Font- year* doco thcxroovtAbfe ©MA po*y™»ei-* 
ase* 4 made PCR practical. Some of the reasons for its slow, 
acceptance are high cost, tack of automation of pre-? and 
post-PCR processing steps, and false positive results, from 
<arryovcTS^ntamination, The first two points arc related 
in that labor is the largest contributor to cost at the present 
stage of PCR development. Most Current assays require 
sonic form of "downstream" processing once the*mocy- 
ding k done in order in determine whether the target 
DNA sequence was present and has amplified. These 
include: DNA hybrkiisabbn** gel eJectropboresU with or 
without use of restriction digestion 7 '*/ H FIX?, or capillary 
electrophoresis 10 . These methods arc labor4ntense, have, 
low throughput, and arc difficult to automate. The third 
point is afao closely related to downstream processing. 
The handling of the PCR product in these downstream 
processes increases the chances that amplified DNA '.will 
spread through the typing lab, resulting in • a .risk of 



carryover" false positives in subsequent testing". 

These downstream processing steps would be elimi- 
nated if specific amplification and detection of amplified 
DMA took place simultaneously within an unopened re- 
action vessel Assays in wbkh such different processes take 
place without, the need to separate reaction components 
have been termed \homogeiMious\ No truly homogc-. 
tieous PCR assay has been demonstrated to date, akhough 
progress towards this end has been reported*- Chenab, et 
at 1 *, developed a FCR product detection scheme using 
fluorescent primers that resulted in a fluorescent PCR 
product AHdc-fipecific primers, each with different fluo- 
rescent tags, were used to indicate the genotype of the 
DNA. However, the unincorporated primers uiust still be 
removed in a downstream process in order to vfeuahVx the 
result Recently, Holland, et al/ 3 , developed an assay in 
whkh the endogenous 5 r <xdoudease assay of Taf DNA 
polymerase was exploited to cleave a labeled oligonucleo- 
tide probe. The probe would only dcave if PCR amplifi- 
cation had produced its complementary sequence. In 
order to detect the dcavage product?, however* a subse- 
quent process w again needed. 

We have developed a truly homogeneous assay for PCR 
and PGR product detection based upon tbc gready in- 
creased fluorescence that ethidiura broinide and other 
DNA binding dyes exhibit when they are bound .to.ds- 
DNA 14 - 1 *. As outlined in Figure h a prototype PCR 
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... 1 Mndplc of simultancoua ampB&atioti and- detection of 
PCR product: the components of a KiRtoattinzipg Etftr that art 
6 viorescent »re listed— EtBr itself, EtBr bound to other ssDNA ot 
dsDNA. There is ft large auoresceoce enhancanent when EtBr is 
bouod to 0NA and bmdingii grc&tly enhanced whea DNA is 
doabk-stranded After sufficient (nf. cycles of PGR, the .net 
increase 'in dapNA residts in additional -£t&r bjodang, and a net 
increase in total titioxcsccnce: 
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FWWJ 2 Gel electrophoresis of PGR amplification prodiicts of the 
human, mtefcar gene, HLA DQa, made in the presence of 
increasing amounts of EtBr (up to 8 H-g/tnl). The presence of 
EtBr has tio obvious effect on the yield or specificity of amplify . 
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% (A) Fluorescence measurements from PCRs that contain 
0.5 ugftn! EtBr and that are specific for Y^ifOtnosomc repeat 
$eo<ueticc*. Five replicate FCRj/^ere begun containing each of the 
DNA* specified. At each mdicttcd cytfe, one of the five replicate 
PCRs for each DNA -was removed from thcrmocydaig and its 
fluorescence measured. UniU of fluorcflccnce are arbitrary. (B) 
UV photography of PCR tubes (0.5 ml Eppcndor&tyfc, polypro- 
pylene microcentrifuge *ubca) conuniing reactions, those start* 
mg from 2 ng male DNA Mid control reactions without any DNA, 
from (AX 



begins with primers that are single-stranded DNA (ss- 
DNA), dNTPs, and DNA polymerase; An amount of 
dsDNA containing the target sequence (target DMA) is 
also typically present. This amount can vary* depending 
on the application, from single-cell amounts of DNA ir to 
micrograms per PCR^ 8 , If EtBr is present, the reagents 
that will fluoresce, in order of increasing fluorescence, are 
free EtBr itself, and EtBr bound to the single-stranded 
DNA primers and to the doublc^tranded target DNA (by 
its intercalation between the stacked bases of the DNA 
doublc^hcfix}. After the first denatu ration cycle, target 
DNA will be largely $mg*c-gtranded. After a PGR is 
completed, the most significant change is the increase in 
the amount of cbDNA (the PGR product itself) of up to 
several mkrd^ratns. Formerly free EtBr is bound to the 
additional dsDNA* resulting in an increase in fluores- 
cence. There is also some decrease -in the amount of 
ssDNA primer, but because the binding of EtBr to ssDNA 
is much |e$$ than to dsDNA, the effect of this change on 
the total flucn*e*ccucc of the sample is small. The fluores- 
cence increase can be measured by directing excitation 
illumination through the walls of the amplification vessel 
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before and after, or even continuously during, thermocy- 
enng. 

RESULTS 

PGR in the presence of EtBr. In order to assess the 
affect of EtBr in PGR, ampMncacions of the human HlA 
DQct gene** were performed with the dye present at 
concentrations from 0,06 to 8,0 ugfrnl (a typical concen- 
tration of EtBr used in staining of nudek aods foftowuig 
get electrophoresis is 0*5 pg/mf). As shown in Figure 2,gel 
electrophoresis revealed little or no difference in the yield 
or quality of the ampbneation product whether EtBr was 
absent or present at any of these concentrations, indicat- 
ing that EtBr does not inhibit PCR. 

Peleetiou of human Y-chrt>nKw>cwi*o specie 
enrcnees* $equerK*^pcriftc, fluorescence enhanoement of 
EtBr as a result of PGR was demonstrated in a scries of 
amplifications containing 0.5 ugfrn} EtBr and primers 
specific to repeat DNA sequences found on the human 
Y-chromoiojnc***. These PCRs initially contained either 
60 ng male, 60 ng female, 2 ng roak human or no DNA. 
Five replicate PCRs were begun for each DNA. After Q, 
17, 21 , 24 and 29 cycles of thermocyding, a PGR for each 
DNA was removed from the therancxyder, and its fluo- 
rescence measured in a spcctrouNacrometer and ' plotted 
vs, amplification cyde number (Fig. 3A). The shape of this 
curve reflects the fact that by the time an increase in 
fluorescence can be detected, the increase in DNA is 
becoming linear and not exrxinenual with cycle number. 
As shown, the fluorescence increased about three-fold 
over the background fluorescence for the PCRs contain- 
ing human male DNA, but did not significantly increase 
for negative control PCRs, which contained either no 
DNA or human female DNAl The more male DNA 
present to begin with— 60 ng versus 2 ng-nte fewer 
cycles were needed to give a detectable increase in fluo- 
rescence. Gel electixmSoresis oo the products of these 
amplifications showed that DNA fragments of the ex- 
pected skc were made in the male DNA containing 
reactions and that Utile DN A synthesis took place in the 
control samples. 

In addition, the increase in fluorescence was visualized 
by simply laying the completed, unopened PCRs on a UV 
tramuhirninatOT and photographing mem through a red 
filter. This is shown m figure SB tor the reactions thai 
began with 2 ng male DNA and those with no DNA- 

Detection of specific: aBcto of tire human p-globtD 
gene* In order to demonstrate that this approach has 
adequate spedfkfcy to allow genetic saeerung, a dttccrson 
of the $icldc-cdl anemia mutation was performed. Figure 
4 shows the fluorescence from completed ampMcatioxu 

containing EtBr (0_S Kg/ml) as detected by photography 
of the reaction tubes on a UV tranrillaniinator, Triese 
reactions were performed using prnncTs specific for ci- 
ther the wild-type or skkle-cell mutation of the. .human 
p^ohin gene* \ The specificity for each aBctc is imparted 
by placing the sickle-rjiutalion site at the terminal V 
nudeaddc of one primer. By using an appropriate primer 
annealing temperature, primer exten^on— and thus an> 
pUncatic^n— can take place only if the S' nucieodde of ^ 
primer is complementary to the p-globiri allele present 73 

Each pair of ampfficauons shown in Figure 4 consists of 
a reaction wkh either the wiMnypc allelt specific (left 
tube) or skkk-aUde spedfk (right tube) primers. Three 
difierent DN As were typed: DNA f^>m a homozygous, 
wfld-type p-globin mdividual (AA); from a heterozygous 
sickle p-giobin individual (AS); and from a homozygous 
sickle p-gk^o individual (SS). Each DNA (50 ng genomic 
DNA to start each PGR) was ar+atyzed m triplicate (3 pain 
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of reactions each). The DNA .type was reflected in the ' 
j^jative fluorescence intensities in each pair of completed 
Am p]uicatk>tis. There was a significant increase in fluores- 
cence only where a £-globm allele DNA matched the 
printer set. Whco measured- or a spectrofluororneter 
Mata not shown), this Buorosccnec was about three times 
present in a PGR where both p-globhi alleles were 
jtiisinatchcd to the primer set. Gel ciearophofe*^ (not 
(ihowii) established tkat this increase in fluorescence was 
due to the synthesis of nearly a microgram of a DNA 
fragment of the expected size for P-globin. There was 
little symhestt of dsDNA b reactions in which the aflete- 
jpedfic primer was mismatched to both alleles. 

Continuous imraftoribog <tf a PGR. Using a fiber optic 
devtcer K i* possible to direct excitation ittutmnauon from 
? qpectrofl uorometer to a PGR undergoing thcrmocycling 
an d to return its fluorescence to the Kpectroftuorotwetcr. 
The fluorescence readout of such an arrangement, di- 
rected at an EtBr-concaining amplificadon of Y-chromo- 
somc specific sequences from 25 ji£ of human male DNA, 
is shown in Figure 5, The readout from a control PCR 
whh no target DNA is also shown. Thirty cycles of PGR 
were monitored for each, 

The fluorescence trace as a function of time dearly 
shows the efFect of the thermocyeiing. Fluorescence inten- 
sity riaes and. fails inversely wttfc temperature. The fluo- 
rescence intensity is minimum at the denaturation tem- 
perature (94°C) and raaxirnurn at the annealing/extension 
temperature (50°C). In the negative-control FCR t these 
fluorescence maxima and minima do not change signifi- 
cantly over the thirty tbarroocycks, indicating that there is 
Ikttc dsDNA synthesis without the appropriate target 
DNA, and there is little if any bleaching of EtBr during 
(he continuous illumination of the sample. 

In the PGR containing male I>NA ( the fluorescence 
maxima at the annealing/extension temperature begin to 
increase at about 4000 second* of therroocycling, and 
continue to increase whh time, indicating that dsDNA is 
being produced at a detectable level* Note that the fluo- 
rescence minima at the denatuTatioa temperature do not 
MgniRcantly increase, presumably because at this temper- 
ature there is no dsDNA for EtBr to bind- Thus the course 
of the amplification is followed by tracking the fluorcs-. 
cence increase at the aancahng temperature. Analysis of 
the products of these two amplifications by gel eleetropho- 
few showed * ON A fragment of die expected aie for the 
male DNA containing sample and no detectable DNA 
synthesis for the control sample. 

DISCUSSION 

Downstream processes such as hybridization to a se- 
quenee-apedfic probe can' enhance die specificity of DNA 
deceuumi lij PGR. The cHnrioatkm of uSeac proecrecf 
means that - the srxxiftcuy of this homogeneous assay 
depends solely on that of PGR* la the case of rickle-celi 
disease, we have shown that PGR alone has sufficient DNA 
sequence specificity to permit genetic screening. Using 
appropriate amplification conditions, there is little non- 
Specific production of dsDNA in the absence of the 
appropriate target allele. 

The specificity required to detect pathogens can be 
more or less than that required to do genetic screening, 
depending on the number of pathogens in the sample and 
the amount of other DNA that must be taken with die 
sample. A difficult target is HIV, which requires detection 
of a viral genome that can be at the level of a few copies 
per thousands of host cells*. Compared with genedc 
screening, which is performed on ceils containing at least 
one copy of die target sequence, HIV detection requires 
both more spectfidty and the input of more total 
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mm 4 UV photography of FOR tubes containing tunp£fie*Uoiu 
using EtBr uwt are speonc to wild-type (A) or fteftk (S) alleles of 
the bunuo £-globin gene. The left ot each pSiir of tubes Contain* 
aMe-tpcdfic primers to the wild-type alleles, the right tube 
primers to the sfcWe aflefc. The phmograph was ta<wd after 30 
cycles of FOR. and the input DNA* and the alleles thev coatam 
are indicated. Fifty tog of DNA was used to begin PGR. Typtn# 
was done hi triplicate (8 pairs of POts) for each input DNA 
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RG0K5 Continuous rcaUhne monitoring of a FCR. A fiber optk 
was wed to carry, excitation fight to a FUR m progress and ako 
emitted light back to a ftaoromctcr (see Expenmcntal Protocol), 
Amptifiraaoa-wiog^uinan nudo-DNA specific primers in a PQ& 
Starting with 20 ng of human male DNA {top}* or in a control 
PCR without PNA (boitnm), were monfaored, Thirty cydes of 
PCR were followed for each. The temperature cycled between 
94*C {denaturauW) and 50*C (annealing and extension}- Note in 
the male DNA PCR*. the cycle (tunc) depietiSicot increase in 
fluorescence at the aaneafo^extensHm temperature. 
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DNA — up to microgram amounts— i a order to have suf- 
ficient numbers of target sequences. This large amount of 
starting DNA ta an amplification simi&catttly increases 
the background fluorescence over which any additional 
fluorescence produced by PCR must be detected. An 
additional complication that occurs with targets irt tow 
copy-number is the formation of the ^rimer-dimer" 
artifact. This ts the rcsvdt of the extension of one primer 
using the other primer as a template. Although this occurs 
infrequently, once it occur* the extension product is a 
substrate for PGR amplification, and can compete with 
true PGR targets if those targets are rare. The primer- 
dimer product i$ of course dsDNA and thus U a potential 
source of false signal in this homogeneous assay. 

To increase FCR specificity ana reduce the eflfect of 
priaier-dimcr arrtplifrcatroa, we are investigating a num- 
ber of approaches, including the use of nested-primer 
amplification* that take place in a single tube 9 , and the 
*hot-start** ( in which nonspecific amplification ts reduced 
by raising the temperature of the reaction before DNA 
synthesis begins 45 . Prelhninary results using these ap- 
proaches suggest that pirOTcrHiuTCr is effectively reduced 
and it is possible to detect the increase in Etfir fluores- 
cence in a PGR instigated by a single HIV genome in a 
background of 10* celts. With larger number* of cells, the 
background fluorescence contributed by genomic DNA 
becomes problematic- To reduce thw background, it may 
be possible to use sexruenee^pecific DNA4>inding dyes 
that can be made to preferentially bind PGR product over 
genomic DNA by incorporating the dye-binding DNA 
sequence into the PGR product through a 5' "add-on" to 
the eAironvdeotide primer* 4 . 

We nave shown that the detection of fluorescence 
generated by an EtBr-containing PCR is straightforward, 
both once FCR is completed and continuously during 
thermocyefing. The ease with which automation of spe- 
cific DNA detection can be accomplished is the most 
promising aspect of this assay. Hie Huoreseence analysis 
of completed PCRs is alreadypo^tblc with existing instru- 
mentauon in 96-well formar* In this format, the fluores- 
cence in each PCR can be cjuantitated before, after, and 
even at selected points during rJierraocyciing by moving 
the rack of PCRs to a 96HmierowcM plate fluorescence 
reader 20 , 4 

The instrumentation necessary to continuously monitor 
multiple PCRs simultaneously is also simple in principle. 
A direct extension of the apparatus used here is to have 
multiple fiberopdcs transmit the citation light and flu- 
orescent emissions to and from multiple PCRs. The ability 
to monitor multiple PCRs continuously may allow quan- 
titation of target DNA copy number. Figure 3 shows that 
the larger the amount of starting target DNA, the sooner 
during PCR a fluorescence increase is detected. Prcfirai- 
nary experiments <Biguchi and DolEnger, manuscript in 
preparation) with continuous monitoring' have shown a 
sensitivity to two-fold differences in initial target DNA 
concentration. 

ConvexselY, if the number of target molecules is 
known — a$ it can be in genetic screeriing— rcontinuo\is 
monitoring may provide a means of detecting fabc posi- 
tive and false negative result** With a known number of 
target roolecutes, a true positive would exhibit detectable 
fluorescence by a predictable number of cycles of PCR. 
Increases in fluorescence detected before or after that 
cycle would indicate potential artifacts* False negative 
results due to, for example,. inhibition of DNA polymer- 
ase, may be detected by including within each PCR an 
ineffietendy amplifying marker. This marker results in a 
fluorescence increase only after a large number of cy- 
cles— many more* than are necessary CO detect a true 
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positive. If a sample fails to have a fluorescence increase 
after this many cycles, tnhrbuton may be suspected. Since, 
in this assay, conclusions are drawn based on the presence 
or absence of fluorescence signal alone, such controls may 
be important. In any event, before any test based on this 
principle is ready for the dink, an assessment of its false 
positive/false negative rates will need to be obtained using 
a large number of known samples. 

In summary, the inclusion m PCR of dyes whose fluo- 
rescence is enhanced upon binding dsDNA makes it 
possible to detect specific DNA ampufieation from outside 
the PCR tube. In the future, instruments based upon this 
principle may facilitate the more widespread use of PCR 
ui applications that demand the high throughput of 
samples. 

EXPERIMENTAL PROTOCOt . . ' 

H«m*n HLA-DQn gene *mpHBcsUioifcs containing EtBr. 
PCRs were set up iniOO (d volumes containing 10 mM Tris-HQ, 
pH 8.3; 50 mM KCI; 4 mM MgO*: S3 units of To* DNA 
pc4yrocfa$C (P*rVm*£}mcr Ccrua, Norwalk, CT); 20 mriole caca 
of human HtA-DQa * gene specific oligonucleotide primers 
(>H26 and CH27" and approximately NT copies of DQfe PCR 
product diluted from a previous faction. Edjichum bromide 
(Etfcr; SigtwA) was used at the conccntrauonE indicated uj Figure 
2. Thcrmocyding proceeded for 20 evdes in a model 460 
ihcrmocycicr (Perkm-EJwer Ceu», rWatk, CT) roi^-stcp- 
cycJc" program of 94*C for 2 raia-deaatuntuon and WW 
sec flnncaEbng and 72°C for $0 tec extension. 

Y-chromosxnnc specific PCR- PCRs (100 fil total reaction 
volume) containing %J> yd^A EtBr were prepared as described 
for HLA-DQoi, except w*(h different primers and target DNAs. 
These PCRs conuUticd J $ pmolc each male DNA-»pccujc primer* 
YI.1 and Vi.* M , and either 60 ng male, €0 eefenwle, 2 ng mak. 
or no human UNA. Thcrmocydmg was M^Tor 1 min- and 6^C 
for I txiin using a "rtcp-eyde* program. The number of cycles for 
a sample were as indicated in Figure 3. Fluorescence measure- 
ment is described below. , 

Allck-raceific, human p-globift $co* PCR* Amplifications of 
100 p4 volume u«ng 0-5 f^Anl of ZtBr were prepared aj 
described for HLA-DQt* above except with different pfuncri and 
target DNAs. These PCto contained e5«W. pnmcr pair HGPi/ 
H6HA <w8dHype globm spechtc primers) or HGF2/H£14S (ajek- 
ie-giobin specific primers) at 10 pmole eac* pruncr pet -PCR. 
T&*t prfaners were deeped by Wu ct aL 21 . Three different 
tACgei IjNAa Were used in separate amplificauons-^50 ng cacti of 
human DNA that was hc^inozygous for the sickle trait (SS), DNA 
that was heterorrgous for the sickle traH (AS), or DNA that v«u 
honwwrgous for the w.i- globin (AA). ThcrraocycBug wj» F^rSO 
cycles at 9<TC for 1 mm. and 55*0 for 1 mm. itsut| a ^stcMycfc 
prosram. An anneaHog temperature of 55 q C been shown by 
Wn ei H** to provide aUelc<r>ecmc awpUfjca.Uon. Q^plctcd 
PCRs were photographed through a red filter <wmie*rz3A) 
after placing the reaction tubes atop a model TM*36 tranaiflulfli- 
nator <UV-pioducu San- Gabriel, CA>- 

Fhioreseeneetneasiwemem. ntiorescence roe^surcmenw were 
made on PCRs containing Et»r in * Fluorolog^ u^orornetcr 
(SPEX. Edison. NJ). Excitation was at the 500 nra band wuh 
ihout t ntn bond^dth with a GO 435 ntn cut^ftkerjMciles 
Crist. Inc^ Irvine. CA) to exclude second-otder Uglit vn™f* 
fight was detected at 570 nm with a bandwicUh of about 7 nm. An 
OG 530 pm cut-off filter Was used to remove the excxtaaon hgftt 
ContitHKKM ftDoreftcenee mtnutoring of FCR, Connnuous 
monitoring of a PCR in progress was accompJwbed using «iC 
spcctroHuoroineicr and setting* described above as well as a 
fSeroptie accessory (SPEX cat no. 1950) to both send cxotajiQu 
6ght to, and receive emitted light from, a PCR placed m a well of 
a model 480 rbexweyder (Pcrkin-Eltner CetMs)- The probe ena 
of the fiberoptic cable was attached with "5 fx^utCHcpoxy" to tjte 
open top of a PCR cube (a 0^ ml poifpropykaxc ccnirif^c tube 
with iw cap removed) effeaweW flcafiug The exposed^ ^ 
the PCR tube and the end of die fiberopuc eabk were rf>«W«« 
from room Ught and the room lights were kept dunmed dwrmg 
■ each m»- The monaorcd PCft was an ampufieldon of Y-cdto- 
rnosome^pcdfk repeat scsvences as o>»cribed above. c»ccpt 
us«5ff.an arincaKngfextenskm tetnperauirc of5VC The reacuoa 
was covered yrtth m\n&tf oil (2 drops) to prevent cvapora&on- 
Thenxjocycfing- and fluorescence rowuremcot wrc started ^ 
multanccisly. A time-base «c*rt W1(^ a 10 second integral t»nc 
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used and che emb&iott Stgnal was ratioed to' tbc excitation 
signal to « mtw> * fo€ change* in Ji^htMourcc intensity. Data were 
fleeted osing the droSOOOf, version 15 (SPEX) data system. 

Wc fhAfdc Bob lone* for help with th« sp^ctrofluormctnc 
UKywuremcitU^nd Hcalhcrhe!! Fon^for editing this manuscript. 
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Oligonucleotides with Fluorescent Dyes at 
Opposite Ends Provide a Quenched Probe 
System Useful for Detecting PCR Product 
and Nucleic Acid Hybridization 

Kenneth J. Livak, Susan J.A. Flood, Jeffrey Marmaro, William Ciusti, and Karin Deetz 

Perkin-EImer, Applied Biosystems Division, Foster City, California 94404 



The 5' nuclease PCR assay detects the 
accumulation of specific PCR product 
by hybridization and cleavage of a 
double-labeled fluorogenlc probe 
during the amplification reaction. 
The probe is an oligonucleotide with 
both a reporter fluorescent dye and a 
quencher dye attached. An increase 
|tn reporter fluorescence intensity In- 
dicates that the probe has hybridized 
to the target PCR product and has 
been cleaved by the 5' nucle- 
olytic activity of Taq DNA polymerase. 
In this study, probes with the 
quencher dye attached to an Internal 
nucleotide were compared with 
probes with the quencher dye at- 
tached to the 3 -end nucleotide. In all 
cases, the reporter dye was attached 
to the 5' end. All Intact probes 
showed quenching of the reporter 
fluorescence. In general, probes with 
the quencher dye attached to the 3'- 
end nucleotide exhibited a larger sig- 
nal In the 5' nuclease PCR assay than 
the Internally labeled probes. It is 
proposed that the larger signal Is 
caused by Increased likelihood of 
cleavage by Taq DNA polymerase 
when the probe Is hybridized to a 
template strand during PCR. Probes 
with the quencher dye attached to 
the 3 '-end nucleotide also exhibited 
an Increase in reporter fluorescence 
Intensity when hybridized to a com- 
plementary strand. Thus, oligonucle- 
otides with reporter and quencher 
es attached at opposite ends can 
le used as homogeneous hybridiza- 
tion probes. 



Jr\ homogeneous assay for detecting 
the accumulation of specific PCR prod- 
uct that uses a double-labeled fluoro- 
genic probe was described by Lee et al/ l) 
The assay exploits the 5' ->3' nucle- 
olytic activity of Taq DNA poly- 
merase (2 - 3) and Is diagramed in Figure 1. 
The fluorogenic probe consists of an oli- 
gonucleotide with a reporter fluorescent 
dye, such as a fluorescein, attached to 
the 5' end; and a quencher dye, such as a 
rhodamine, attached internally. When 
the fluorescein is excited by irradiation, 
its fluorescent emission will be 
quenched if the rhodamine is close 
enough to be excited through the pro- 
cess of fluorescence energy transfer 
(FET).< 4 ' 5 > During PCR, if the probe is hy- 
bridized to a template strand, Taq DNA 
polymerase will cleave the probe be- 
cause of its inherent 5' '-*> 3' nucleolytic 
activity. If the cleavage occurs between 
the fluorescein and rhodamine dyes, it 
causes an Increase in fluorescein fluores- 
cence Intensity because the fluorescein 
is no longer quenched. The increase in 
fluorescein fluorescence intensity indi- 
cates that the probe-specific PCR product 
has been generated. Thus, FET between a 
reporter dye and a quencher dye is criti- 
cal to the performance of the probe in 
the 5' nuclease PCR assay. 

Quenching is completely dependent 
on the physical proximity of the two 
dyes. (6) Because of this, it has been as- 
sumed that the quencher dye must be 
attached near the 5' end. Surprisingly, 
we have found that attaching a rho- 
damine dye at the 3' end of a probe 
still provides adequate quenching for 
the probe to perform in the 5' nuclease 



PCR assay. Furthermore, cleavage of this 
type of probe is hot required to achieve 
some reduction in quenching. Oligonu- 
cleotides with a reporter dye on the 5' 
end and a quencher dye on the 3' end 
exhibit a much higher reporter fluores- 
cence when double-stranded as com- 
pared with single-stranded. This should 
make it possible to use this type of dou- 
ble-labeled probe for homogeneous de- 
tection of nucleic acid hybridization. 



MATERIALS AND METHODS 
Oligonucleotides 

Table 1 shows the nucleotide sequence 
of the oligonucleotides used in this 
study. Linker arm nucleotide (LAN) 
phosphoramidite was obtained from 
Glen Research. The standard DNA phos- 
phoramidites, 6-carboxyfluorescein (6- 
FAM) phosphoramidite, 6-carboxytet- 
ramethylrhodamine succinimidyl ester 
(TAMRA NHS ester), and Phosphalink 
for attaching a 3'-blocking phosphate, 
were obtained from Perkin-EImer, Ap- 
plied Biosystems Division. Oligonucle- 
otide synthesis was performed using an 
ABI model 394 DNA synthesizer (Applied 
Biosystems). Primer and complement 
oligonucleotides were purified using 
Oligo Purification Cartridges (Applied 
Biosystems). Double-labeled probes were 
synthesized with 6-FAM-iabeied phos- 
phoramidite at the 5' end, LAN replacing 
one of the Ts in the sequence, and Phos- 
phalink at the 3' end. Following de- 
protection and ethanol precipitation, 
TAMRA NHS ester was coupled to the 
LAN-containing oligonucleotide in 250 
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FIGURE 1 Diagram of 5' nuclease assay. Stepwise representation of the 5' - 3' ni^eoiyttcac- 
tiviry of Taq DNA polymerase acting on a fluorogenic probe during one extension phase of PCR. 




mM Na-bicarbonate buffer (pH 9.0) at 
room temperature. Unreacted dye was 
removed by passage over a PD-10 Sepha- 
dex column. Finally, the double-labeled 
probe was purified by preparative high- 
performance liquid chromatography 
(HPLC) using an Aquapore C 8 220x4.6- 
mm column with 7-u.m particle size. The 
column was developed with a 24-min 
linear gradient of 8-20% acetonitrile in 
0.1 m TEAA (triethylamine acetate). 
Probes are named by designating the se- 
quence from Table 1 and the position of 
the LAN-TAMRA moiety. For example, 
probe Al-7 has sequence Al with LAN- 
TAMRA at nucleotide position 7 from the 
5' end. 



PCR Systems 

All PCR amplifications were performed 
in the Perkin-Elmer GeneAmp PCR Sys- 
tem 9600 using 50-u.l reactions that con- 
tamed 10 mM Tris-HCl (pH 8.3), 50 mM 
KC1, 200 pM dATP, 200 *jlm dCTP, 200 u-M 
dGTP, 400 jtM dUTP, 0.5 unit of AmpEr- 
ase uracil N-glycosylase (Perkin-Elmer), 
and 1.25 unit of AmpliTaq DNA poly- 
merase (Perkin-Elmer). A 295-bp seg- 
ment from exon 3 of the human p-actin 



gene (nucleotides 2141-2435 in the se- 
quence of Nakajlma-Uiima et al.) <7) was 
amplified using primers AFP and ARP 
(Table 1), which are modified slightly 
from those of du Breuil et al. (8> Actin am- 
plification reactions contained 4 mM 
MgC! 2 , 20 ng of human genomic DNA, 
SO nM Al or A3 probe, and 300 hm each 



TABLE 1 Sequences of Oligonucleotides 



primer. The thermal regimen was 50°C 
(2 min), 95°C (10 min), 40 cycles of 95°C 
(20 see), 60°C (1 min), and hold at 72°C 
A 515-bp segment was amplified from a 
plasmid that consists of a segment of X 
DNA (nucleotides 32,220-32,747) in- 
serted in the Smal site of vector pUCl 19. 
These reactions contained 3.5 mM 
MgCl 2 , 1 ng of plasmid DNA, 50 nM P2 or 
P5 probe, 200 nM primer F119, and 200 
nM primer R119. The thermal regimen 
was 50°C (2 min), 95°C (10 min), 25 cy- 
cles of 95°C (20 sec), 57°C (1 min), and 
hold at 72°C 



Fluorescence Detection 

For each amplification reaction, a 40-u,l 
aliquot of a sample was transferred to an 
individual well of a white, 96-weil micro- 
titer plate (Perkin-Elmer). Fluorescence 
was measured on the Perkin-Elmer Taq- 
Man LS-50B System, which consists of a 
luminescence spectrometer with plate 
reader assembly, a 485-nm excitation fil- 
ter, and a S15-nm emission filter. Excita- 
tion was at 488 nm using a 5-nm slit 
width. Emission was measured at 518 
nm for 6-FAM (the reporter or R value) 
and 582 nm for TAMRA (the quencher or 
q value) using a 10-nm slit width. To 
determine the increase in reporter emis- 
sion that is caused by cleavage of the 
probe during PCR, three normalizations 
are applied to the Taw emission data. 
First, emission intensity of a buffer blank 
is subtracted for each wavelength. Sec- 
ond, emission intensity of the reporter is 



Name 



Type 



Sequence 



F119 

R119 

P2 

P2C 

P5 

P5C 

AFP 

ARP 

Al 

A1C 

A3 

A3C 



primer 
primer 
probe 

complement 
probe 

complement 
primer 
primer 
probe 

complement 
probe 

complement 



ACCCACAGGAACTGAfCACCACTC 

ATGTCGCGTTCCGGCTGACGTTCTGC 

TCGCATTACI GATCGTrGCCAACCAGTp 

CTACTGGTTGGCAACGATCAGTAATGCGATG 

CGGAMTTGCTGGTATCrATGACAAGGATi) 

TTCATCCTrGTCATAGATACCAGCAAATCCG 

TCACCCACACTGTGCCCATCTACGA 

CAGCGGAACCGClX^ATTGCCAATGG 

ATGCCCTCCCCCATGCCATCCTGCGTp 

AGACGCAGGATGGCATGGGGGAGGGCATAC 

CGCCCrGGACTTCGAGCAAGAGATp 
CCATCTCTTGCTCGAAGTCCAGGGCGAC 



For each oligonucleotide used in this study, the nucleic acid sequence is given, written i in the 
5' -> y direction. There are three types of oligonucleotides: PCR primer, fluorogenlc probe u ea 
in the 5' nuclease assay, and complement used to hybridize to the corresponding probe, i-or in 
probes, the underlined base indicates a position where LAN with TAMRA attached was suDsn- 
tuted for a T. (p) The presence of a 3' phosphate on each probe. 
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A1-2 RaQgccctcccccatgccatcctgcgtp 

A1 -7 RATGCCCQCCCCCATGCCATCCTGCGTp 

A1 -1 4 RatcccctcccccaQgccatcctgcgtp 

A1-19 RatgccctcccccatgccaQcctgcgtp 

A1-22 RATGCCCTCCCCCATGCCATCCQGCGTp 

A1 -26 RatgccctcccccatgccatcctgcgQp 



Probe 


518 nm 


582 nm 


RQ- 


RQ+ 


ARQ 




no temp. 


+ temp. 


no temp. 


+ temp. 








A1-2 


25.5 ±2.1 


32.711.9 


38.2 ±3.0 


38.2 ±2.0 


0.67 + 0.01 


0.8610.06 


0.19+0.06 


A1-7 


53.5 ± 4.3 


395.1 ±21.4 


108.5 ±6.3 


110.3 ±5.3 


0.4910,03 


3.5810.17 


3.09 ±0.18 


A1-14 


127.0±4.9 


403.5 + 19.1 


109.7 ±5.3 


931 ±6.3 


1.1610.02 


4.3410.15 


3.1810.15 


AM 9 


187.5 ±17.9 


422.7 ±7.7 


70.3 ±7.4 


73.0 ±2.8 


2.6710.05 


5.8010.15 


. 3.13 + 0.16 


A1-22 


224.6 + 9.4 


482.2 ± 43.6 


100.0 ±4.0 


96219.6 


2.2510.03 


5.0210.11 


2.7710.12 


A1-26 


160.2 ±8.9 


454.1 ±18.4 


93.1 ±5.4 


90.7 ±3.2 


1.7210.02 


5.0110.08 


3.2910.08 



FIGURE 2 Results of 5' nuclease assay comparing (3-actin probes with TAMRA at different nucle- 
otide positions. As described in Materials and Methods, PCR amplifications containing the in- 
dicated probes were performed, and the fluorescence emission was measured at 518 and 582 nm. 
Reported values are the average ±1 s.d. for six reactions run without added template (no temp.) 
and six reactions run with template (+temp.). The RQ ratio was calculated for each individual 
reaction and averaged to give the reported RQ~ and RQ + values. 



divided by the emission intensity of the 
quencher to give an RQ ratio for each 
reaction tube. This normalizes for well- 
to-weil variations in probe concentra- 
tion and fluorescence measurement. Fi- 
nally, ARQ is calculated by subtracting 
the RQ value of the no-template control 
(RQ~) from the RQ value for the com- 
plete reaction including template 
(RQ + ). 

RESULTS 

A series of probes with increasing dis- 
tances between the fluorescein reporter 
and rhodamine quencher were tested to 
investigate the minimum and maximum 
spacing that would give an acceptable 
performance in the 5' nuclease PCR as- 
say. These probes hybridize to a target 



sequence in the human 3-actin gene. 
Figure 2 shows the results of an experi- 
ment in which these probes were in- 
cluded in PCR that amplified a segment 
of the 0-actin gene containing the target 
sequence. Performance in the 5' nu- 
clease PCR assay is monitored by the 
magnitude of ARQ, which is a measure 
of the increase in reporter fluorescence 
caused by PCR amplification of the 
probe target Probe Al-2 has a ARQ value 
that is close to zero, indicating that the 
probe was not cleaved appreciably dur- 
ing the amplification reaction. This sug- 
gests that with the quencher dye on the 
second nucleotide from the 5 r end, there 
is insufficient room for Taq polymerase 
to cleave efficiently between the reporter 
and quencher. The other five probes ex- 
hibited comparable ARQ values that are 



clearly different from zero. Thus, ail five 
probes are being cleaved during PCR am- 
plification resulting in a similar increase 
in reporter fluorescence. It should be 
noted that complete digestion of a probe 
produces a much larger increase in re- 
porter fluorescence than that observed 
in Figure 2 (data not shown). Thus, even 
in reactions where amplification occurs, 
the majority of probe molecules remain 
uncleaved. It is mainly for this reason 
that the fluorescence intensity of the 
quencher dye TAMRA changes little with 
amplification of the target. This is what 
allows us to use the 582-nm fluorescence 
reading as a normalization factor. 

The magnitude of RQ~ depends 
mainly on the quenching efficiency in- 
herent in the specific structure of the 
probe and the purity of the oligonucle- 
otide. Thus, the larger RQ" values indi- 
cate that probes Al-14, Al-19, Al-22, and 
A 1-26 probably have reduced quenching 
as compared with A 1-7. Still, the degree 
of quenching is sufficient to detect a 
highly significant increase in reporter 
fluorescence when each of these probes 
is cleaved during PCR. 

To further investigate the ability of 
TAMRA on the 3' end to quench 6-FAM 
on the 5' end, three additional pairs of 
probes were tested in the 5' nuclease 
PCR assay. For each pair, one probe has 
TAMRA attached to an internal nucle- 
otide and the other has TAMRA attached 
to the 3' end nucleotide. The results are 
shown in Table 2. For all three sets, the 
probe with the 3' quencher exhibits a 
ARQ value that is considerably higher 
than for the probe with the internal 
quencher. The RQ" values suggest that 
differences in quenching are not as great 
as those observed with some of the Al 
probes. These results demonstrate that a 
quencher dye on the 3' end of an oligo- 
nucleotide can quench efficiently the 



TABLE 2 Results of 5' Nuclease Assay Comparing Probes with TAMRA Attached to an Internal or 3'-terrninal Nucleotide 







518 nm 




582 nm 








Probe . 


no temp. 


+ temp. 


no temp. 


+ temp. 


RQ" 


RQ+ 


ARQ 


A3-6 
A3-24 


54,6 ± 3.2 
72.1 ± 2.9 


84.8 ± 3.7 
236,5 ± 11.1 


116.2 ±6.4 
84.2 ± 4.0 


115.6 ± 2.5 
90.2 ± 3.8 


0.47 ± 0.02 
0.86 ± 0.02 


0.73 ± 0.03 
2.62 ± 0.05 


0.26 ± 0.04 
1.76 ± 0.05 


P2-7 
tf2-27 

IsiO 

P5-28 


82.8 ± 4.4 
113.4 ± 6.6 


384.0 ±34.1 
555.4 ± 14.1 


105.1 ± 6,4 
140.7 ± 8.5 


120.4 ± 10.2 
118.7 ±4.8 


0.79 ± 0.02 
0.81 ± 0.01 


3.19 ±0,16 
4.68 ± 0.10 


2.40 ±0.16 
3.88 ± 0.10 


77.5 ± 6.5 
64.0 ± S.2 


244.4 ± 15.9 
333.6 ±12.1 


j 86.7 ± 4.3 
100.6 ±6.1 


95.8 ± 6.7 
94.7 ± 6.3 


0.89 ± 0.05 
0.63 ± 0.02 


2.55 ± 0.06 
3,53 ± 0.12 


1.66 ± 0.08 
2.89 ± 0.13 



Reactions containing the indicated probes and calculations were performed as described in Material and Methods and in the legend to Fig. 2. 
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fluorescence of a reporter dye on the 5' 
end. The degree of quenching is suffi- 
cient for this type of oligonucleotide to 
be used as a probe in the 5' nuclease PCR 
assay. 

To test the hypothesis that quenching 
by a 3' TAMRA depends on the flexibility 
of the oligonucleotide, fluorescence was 
measured for probes in the single- 
stranded and double-stranded states. Ta- 
ble 3 reports the fluorescence observed 
at 518 and 582 nm. The relative degree 
of quenching is assessed by calculating 
the RQ ratio. For probes with TAMRA 
6-10 nucleotides from the 5' end, there 
is little difference in the RQ values when 
comparing single-stranded with double- 
stranded oligonucleotides. The results 
for probes with TAMRA at the 3' end are 
much different. For these probes, hy- 
bridization to a complementary strand 
causes a dramatic increase in RQ, We 
propose that this loss of quenching is 
caused by the rigid structure of double- 
stranded DNA, which prevents the 5' 
and 3' ends from being in proximity. 

When TAMRA is placed toward the 3' 
end, there is a marked Mg 2 "*" effect on 
quenching. Figure 3 shows a plot of ob- 
served RQ values for the Al series of 
probes as a function of Mg 2 "*" concentra- 
tion. With TAMRA attached near the 5' 
end (probe Al-2 or Al-7), the RQ value at 
0 mM Mg 2 * Is only slightly higher than 
RQ at 10 mM Mg 2 *. For probes Al-19, 
Al-22, and Al-26, the RQ values at 0 mM 
Mg 2+ are very high, indicating a much 



reduced quenching efficiency. For each 
of these probes, there is a marked de- 
crease in RQ at 1 mM Mg 2 + followed by 
a gradual decline as. the Mg 2 * concen- 
tration increases to 10 mM. Probe Al-14 
shows an Intermediate RQ value at 0 mM 
Mg 2 * with a gradual decline at higher 
Mg 2+ concentrations. In a low-salt en- 
vironment with no Mg 2+ present, a sin- 
gle-stranded oligonucleotide would be 
expected to adopt an extended confor- 
mation because of electrostatic repul- 
sion. The binding of Mg 2+ ions acts to 
shield the negative charge of the phos- 
phate backbone so that the oligonucle- 
otide can adopt conformations where 
the 3' end is close to the 5' end. There- 
fore, the observed Mg 2 * effects support 
the notion that quenching of a 5' re- 
porter dye by TAMRA at or near the 3' 
end depends on the flexibility of the oli- 
gonucleotide. 

DISCUSSION 

The striking finding of this study is that 
it seems the rhodamine dye TAMRA, 
placed at any position In an oligonucle- 
otide, can quench the fluorescent emis- 
sion of a fluorescein (6-FAM) placed at 
the 5' end. This implies that a single- 
stranded, double-labeled oligonucle- 
otide must be able to adopt conforma- 
tions where the TAMRA is close to the 5' 
end. It should be noted that the decay of 
6-FAM in the excited state requires a cer- 
tain amount of time. Therefore, what 



TABLE 3 Comparison of Fluorescence Emissions of Single-stranded and 
Double-stranded Fluorogenic Probes ^ 



518 nm 



582 nm 



RQ 



Probe 


ss 


ds 


ss 


ds 


ss 


ds 


Al-7 


27.75 


68.53 


61.08 


138.18 


0.45 


0.50 


Al-26 


43.31 


509.38 


53.50 


93.86 


0.81 


5.43 


A3 -6 


16.75 


62.88 


39.33 


165.57 


0.43 


0.38 


A3-24 


30.05 


578.64 


67.72 


140.25 


0.45 


3.21 


P2-7 


35.02 


70.13 


54.63 


121.09 


0.64 


0.58 


P2-27 


39.89 


320.47 


65.10 


61.13 


0.61 


5.25 


PMO 


27.34 


144.85 


61.95 


165.54 


0.44 


0.87 


P5-28 


33.65 


462.29 


72.39 


104.61 


0.46 


4.43 



(ss) Single-stranded. The fluorescence emissions at 518 or 582 nm for solutions containing a final 
concentration of 50 nM indicated probe, 10 mM Tris-HCl (pH 83), 50 mM KC1, and 10 ihm MgCl 2 , 
(ds) Double-stranded. The solutions contained, in addition, 100 nM A1C for probes Al-7 and 
Al-26, 100 nM A3C for probes A3-6 and A3-24, 100 nM P2C for probes P2-7 and P2-27, or 100 nM 
PSC for probes PS-10 and P5-28. Before the addition of MgCl 2 . 120 pd of each sample was heated 
at 95°C for 5 min. Following the addition of 80 jjJ of 25 mM MgCl 2 , each sample was allowed to 
cool to loom temperature and the fluorescence emissions were measured. Reported values are 
the average of three determinations. 



matters for quenching is not the average 
distance between 6-FAM and TAMRA 
but, rather, how close TAMRA can get to 
6-FAM during the lifetime of the 6-FAM 
excited state. As long as the decay time of 
the excited state is relatively long com- 
pared with the molecular motions of the 
oligonucleotide, quenching can occur. 
Thus, we propose that TAMRA at the 3' 
end, or any other position, can quench 
6-FAM at the 5' end because TAMRA is in 
proximity to 6-FAM often enough to be 
able to accept energy transfer from an 
excited 6-FAM. 

Details of the fluorescence measure- 
ments remain puzzling. For example, Ta- 
ble 3 shows that hybridization of probes 
Al-26, A3-24, and P5-28 to their comple- 
mentary strands not only causes a large 
increase in 6-FAM fluorescence at 518 
nm but also causes a modest increase in 
TAMRA fluorescence at 582 nm. If 
TAMRA is being excited by energy trans- 
fer from quenched 6-FAM, then loss of ^ 
quenching attributable to hybridization 
should cause a decrease in the fluores- 
cence emission of TAMRA. The fact that 
the fluorescence emission of TAMRA in- 
creases indicates that the situation is 
more complex. For example, we have an- 
ecdotal evidence that the bases of the 
oligonucleotide, especially G, quench 
the fluorescence of both 6-FAM and 
TAMRA' to some degree. When double- 
stranded, base-pairing may reduce the 
ability of the bases to quench. The pri- 
mary factor causing the quenching of 
6-FAM in an intact probe is the TAMRA 
dye. Evidence for the importance of 
TAMRA is that 6-FAM fluorescence 
remains relatively unchanged when 
probes labeled only with 6-FAM are used 
in the 5' nuclease PCR assay (data not 
shown). Secondary effectors of fluores- 
cence, both before and after cleavage of 
the probe, need to be explored further. 

Regardless of the physical mecha- 
nism, the relative independence of posi- 
tion and quenching greatly simplifies 
the design of probes for the 5' nuclease 
PCR assay. There are three main factors 
that determine the performance of a 
double-labeled fluorescent probe in the 
5' nuclease PCR assay. The first factor is 
the degree of quenching observed in the 
intact probe. This is characterized by the 
value of RQ , which is the ratio of re- 
porter to quencher fluorescent emis- 
sions for a no template control PCR. In- 
fluences on the value of RQ" include 
the particular reporter and quencher 
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FIGURE 3 Effect of Mg 2+ concentration on RQ ratio for the Al series of probes. The fluorescence 
emission intensity at 518 and 582 nm was measured for solutions containing 50 nM probe, 10 mM 
Trls-HCI <pH 8.3), 50 mM KCl # and varying amounts (0-10 mM) of MgCl 2 . The calculated RQ 
ratios (518 nm intensity divided by 582 nm intensity) are plotted vs. MgCl 2 concentration (mM 
Mg). The key (upper right) shows the probes examined. 



dyes used, spacing between reporter and 
quencher dyes, nucleotide sequence 
context effects, presence of structure or 
k>ther factors that reduce flexibility of 
Rhe oligonucleotide, and purity of the 
probe. The second factor is the efficiency 
of hybridization, which depends on 
probe T m , presence of secondary struc- 
ture in probe or template, annealing 
temperature, and other reaction condi- 
tions. The third factor is the efficiency at 
which Taq DNA polymerase cleaves the 
bound probe between the reporter and 
quencher dyes. This cleavage is depen- 
dent on sequence complementarity be- 
tween probe and template as shown by 
the observation that mismatches in the 
segment between reporter arid quencher 
dyes drastically reduce the cleavage of 
probe. (1) 

The rise in RQ~ values for the Al se- 
ries of probes seems to indicate that the 
degree of quenching is reduced some- 
what as the quencher is placed toward 
the 3' end. The lowest apparent quench- 
ing is observed for probe Al-19 (see Fig. 
3) rather than for the probe where the 
TAMRA is at the 3' end (Al-26). This is 
understandable, as the conformation of 
the 3' end position would be expected to 
be less restricted than the conformation 

I of an internal position. In effect, a 
Quencher at the 3' end is freer to adopt 
conformations close to the 5' reporter 
dye than is an internally placed 
quencher. For the other three sets of 



probes, the interpretation of RQT values 
is less clear-cut. The A3 probes show the 
same trend as Al, with the 3' TAMRA 
probe having a larger RQ~ than the in- 
ternal TAMRA probe. For the P2 pair, 
both probes have about the same RQ" 
value. For the P5 probes, the RQ" for the 
3' probe is less than for the internally 
labeled probe. Another factor that may 
explain some of the observed variation is 
that purity affects the RQ" value. Al- 
though all probes are HPLC purified, a 
small amount of contamination with 
unquenched reporter can have a large ef- 
fect on RQ". 

Although there may be a modest ef- 
fect on degree of quenching, the posi- 
tion of the quencher apparently can 
have a large effect on the efficiency of 
probe cleavage. The most drastic effect is 
observed with probe Al-2, where place- 
ment of the TAMRA on the second nu- 
cleotide reduces the efficiency of cleav- 
age to almost zero. For the A3, P2, and P5 
probes, ARQ is much greater for the 3' 
TAMRA probes as compared with the in- 
ternal TAMRA probes. This is explained 
most easily by assuming that probes 
with TAMRA at the 3' end are more likely 
to be cleaved between reporter and 
quencher than are probes with TAMRA 
attached internally. For the Al probes, 
the cleavage efficiency of probe Al-7 
must already be quite high, as ARQ does 
not increase when the quencher is 
placed closer to the 3' end. This illus- 



trates the importance of being able to 
use probes with a quencher on the 3' 
end in the 5' nuclease PCR assay. In this 
assay, an increase in the intensity of re- 
porter fluorescence is observed only 
when the probe is cleaved between the 
reporter and quencher dyes. By placing 
the reporter and quencher dyes on the 
opposite ends of an oligonucleotide 
probe, any cleavage that occurs will be 
detected. When the quencher is attached 
to an internal nucleotide, sometimes the 
probe works well (Al-7) and other times 
not so well (A3-6). The relatively poor 
performance of probe A3-6 presumably 
means the probe is being cleaved 3' to 
the quencher rather than between the 
reporter and quencher. Therefore, the 
best chance of having a probe that reli- 
ably detects accumulation of PCR prod- 
uct in the 5' nuclease PCR assay is to use 
a probe with the reporter and quencher 
dyes on opposite ends. 

Placing the quencher dye on the 3' 
end may also provide a slight benefit in 
terms of hybridization efficiency. The 
presence of a quencher attached to an 
internal nucleotide might be expected to 
disrupt base-pairing and reduce the T m 
of a probe. In fact, a 2°C-3°C reduction 
in T m has been observed for two probes 
with internally attached TAMRAs. (9) This 
disruptive effect would be minimized by 
placing the quencher at the 3' end. Thus, 
probes with 3' quenchers might exhibit 
slightly higher hybridization efficiencies 
than probes with internal quenchers. 

The combination of Increased cleav- 
age and hybridization efficiencies means 
that probes with 3' quenchers probably 
will be more tolerant of mismatches be- 
tween probe and target as compared 
with internally labeled probes. This, tol- 
erance of mismatches can be advanta- 
geous, as when trying to use a single 
probe to detect PCR-amplified products 
from samples of different species. Also, it 
means that cleavage of probe during PCR 
is less sensitive to alterations in an- 
nealing temperature or other reaction 
conditions. The one application where 
tolerance of mismatches may be a disad- 
vantage is for allelic discrimination. Lee 
et al. (1> demonstrated that allele-specific 
probes were cleaved between reporter 
and quencher only when hybridized to a 
perfectly complementary target This al- 
lowed them to distinguish the normal 
human cystic fibrosis allele from the 
AF508 mutant. Their probes had TAMRA 
attached to the seventh nucleotide from 
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the 5' end and were designed so that any 
mismatches were between the reporter 
and quencher. Increasing the distance 
between reporter and quencher would 
lessen the disruptive effect of mis- 
matches and allow cleavage of the probe 
on the incorrect target. Thus, probes 
with a quencher attached to an internal 
nucleotide may still be useful for allelic 
discrimination. 

In this study loss of quenching upon 
hybridization was used to show that 
quenching by a 3' TAMRA is dependent 
on the flexibility of a single-stranded oli- 
gonucleotide. The increase in reporter 
fluorescence intensity, though, could 
also be used to determine whether hy- 
bridization has occurred or not. Thus, 
oligonucleotides with reporter and 
quencher dyes attached at opposite ends 
should also be useful as hybridization 
probes. The ability to detect hybridiza- 
tion in real time means that these probes 
could be used to measure hybridization 
kinetics. Also, this type of probe could be 
used to develop homogeneous hybrid- 
ization assays for diagnostics or other ap- 
plications. Bagwell et al. <10) describe Just 
this type of homogeneous assay where 
hybridization of a probe causes an in- 
crease in fluorescence caused by a loss of 
quenching. However, they utilized a 
complex probe design that requires add- 
ing nucleotides to both ends of the 
probe sequence to form two imperfect 
hairpins. The results presented here 
demonstrate that the simple addition of 
a reporter dye to one end of an oligonu- 
cleotide and a quencher dye to the other 
end generates a fluorogenic probe that 
can detect hybridization or PCR arriplifi- 
. cation. 
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We have developed a novel "real time" quantitative PCR method. The method measure* PCR product 
accumulation through a duaHahdtxI fluotogenle probe {i.e., TaqMan Prota). This method provides very 
accurate and reproducible quantitation of gene copies. Unlike other quantitative PCR methods, real-time PCR 
does nor require post-PCR sample handling, preventing potential PCR product carry-over contamination and 
resulting In much faster and higher throughput assays. The mMlm* PCR method has a very large dynamic 
ranee of starting target molecule determination (at least five orders of magnitude). Real-lime quantitative 
PCR is extremely accurate and tesslaboMn tensive than current quantitative PCR methods. 



Quantitative nucleic acid sequence analysis has 
had an important rote in many fields of biologi- 
cal research. Measurement of genu expression 
(UNA) has bee.n used extensively In monitoring 
biological responses to various stimuli (Tan el ai. 
1994; Huang et at. I995a,b; Prud'homme et al. 
1995), Quantitative gene analysis (DNA) has 
Jx/cn used to determine the genemu* quantity of 3 
particular gene, as in the case, of the human HKR2 
gene, which Is amplified in -30% of breast tu- 
mors (Slamon et aL 1987). Gene flnd genome 
quantitation (DNA and UNA) also have been used 
for analysis of human immunodeficiency virus 
UliV) bwden demonstrating changes in the lev- 
els of vi rus throughout the different phases of the 
disease (Connor et al. 19°3; Platak ct al. jvv.sh; 
Purtadn et al. 199S). 

Many methods have been described for the. 
quantitative analysis ot nucleic acid sequences 
(both for RNA and DNA; Southern 1 W6; Sharp el 
al. 19K0; Thomas 1980). Recently, PCR has 
proven to be a powerful tool fOT quantitative 
nucleic acid analysis. PCJR and reverse transcrip- 
tase (KT)-PCR have permitted the analysis of 
minimal starting quantities of nucleic acid (as 
little as one cell equivalent). This has made. i>os- 
siblc many experiments that could not have been 
performed with traditional methods. Although 
PCR has provided a powerful tool, it is imperative 



that it he U5«U properly for quantitation (tt»«y- 
maekers 1995). Many early reports of quantita- 
tive PCR and RT-PCR described quantitation of 
the VCR product but did not measure the initial 
target sequence quantity. It is essential to design 
proper controls for the quantitation of the initial 
target sequences (Pcrrc 1992; dementi et al. 
100?) 

Ke.Mhitfchers have developed several methods 
of quantitative PCR and RT-PCR. One approach 
measures PCR product quantity in the log phase 
of the read ton before the plateau (Kellogg et aL 
1990; Pang ct a). 1990). This method requires 
that each sample has equal Input amounts of 
nucleic- add and that each sample undev analysis 
amplifies with identical efficiency up to the. point 
of quantitative analysis. A gene sequence (con- 
tained in all samples at relatively constant quan- 
tity, such as p-aclin) can bo used for sample 
amplification efficiency normalization. Using 
conventional methods of PCR detection and 
quantitation (gel electrophoresis or plate capture 
hybridization), it is extremely laborious to assure 
that all samples are analyzed during the log phase 
of the reaction (for both the target gene, and the 
normalization gene). Another method, quantita- 
tive competitive (QQ'VCR, bas been developed 
and is used widely for PCR quantitation. QC-i'CR 
n:lies on the inclusion of an internal control 
competitor in each reaction (Becker-Andre 1991; 
Hatak cl al. 1993<*,1>). The efficiency of each re. 
action is normalized to the internal competitor. 
a wnnum airmuut oi Internal competitor can be 
aORH Z0S6 092, 6*6 Tfd 6S:*T Z00Z/S0/ZT 
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added to oach sample.. To obtain rotative rmani- 
ration, the unknown target PGR product is com- 
pared with the known competitor I'CU product. 
Success of a quantitative competitive i'CU assay 
relies on developing an internal control that am- 
l»1ifir.s with the same efficiency as the Uugm 
eculc. The design of the coiupctltoi *md the val- 
uation of amplification efficiencies jcqtdrc n 
dedicated effort. However, because QCM'CR does 
not require lhat PCK jnoducts be analyzed during 
the Joe, phase of Uiv. amplification, it is tin: easier 
Kit the two methods to use. 

Several detection rtystcius uiv used for quan 
Utative l^K and RT-!*CU analysis: (1) agin-oAc* 
ycis, (2) fluorescent labeling of Kill products and 
detection with InseT-indueed fluorcacvncc \isUi£ 
capillar)' electrophoresis (hwseo el al. 1995} Wil- 
liams ct al. 1996) or acrylamldc gels, and (3) jj1«1c 
captures and sandwich probe hybrid l*a I Jot t (Mul- 
der ei al. 1994). Although these ijicOuKiypmvwJ 
successful, each method requires post -PCR ma- 
nipulations That acid Time to the analysis ami 
may lead to labor a tut y t oiilrtui iiiation. The 
sample throughput of these iiiellitals i> limited 
(wil)t I he i'xccpllcm of the plate capture ap- 
proach)* jtuu\, therefore, these methods ore not 
well suited fin um,^ demanding high so m pic 
throughput (I.e., screening of large numbers of 
bli>tliwl«n.ulc:> 111 jiiitty/.ln^ samples fuj diagnos- 
tics or clinical tri«ils), 

1 1 ere. we report the development of ;t novel 
n*s&y for quantitative DNA analysis. The assay is 
hasted on the us*-, of the 5' nuclease assay first 
described by Holland ct al. (1993), The method 
uses the 5' tiuclca.se activity of 7W</ polymerase to 
cleave a noncxtcndlblc hybridisation probe dur- 
ing the extension phase of 1'OU. The approach 
uses dual-labcktd fluorogcnic hyhridi/.atlon 
probes (Lcc ct a). 1993} Gassier ct al. 1995; hivok 
et al. 199&o f b). One fluorescent dye serves «s a 
reporter |FAM (i.e., ^-carboxyHuorvsceio)! and its 
emission spectra is quenched by the second fluo- 
rescent dye, TAMRA (I.e., o-carboxy-tetramcthyl- 
lliodaminc). The nuclease degradation of the hy- 
hrtdixiitton probe releases the quenching of Die 
I'AM fluorescent emission, resulting in an In- 
crease hi peak fluorescent emission at Sjy run. 
The use of <i sequence detector (AUI Prism) allows 
measurement of fluorescent spectra of all 90 wells 
of the thermal cycler continuously during the 
l*OR amplification. Therefore, the reactions aje 
mouitorcU 111 real lime. The output data is de- 
scribed and quantitative analysis of input target 
I )N A sequences is discussed below. 



RESULTS 



PCR Product Derectlon in R«ai Time 

'Hie goal was to develop a high-throughput, sen- 
sitive, and accurate gene quant 11 at I on as«ay for 
use In monitoring lipid mediated tlwapcuTic 
gene delivery, A plasm kl encoding human factor 
V1I1 gene sequence, pl-'8TM (see. Methods). w;is 
used as a model ihcrapctitic gemi. The assay us^ 
fluorescent Taqman methodology and an instru- 
ment capable of measuring fluorescence- in real 
time (AUI Prism 7700 Sequence Ddcclor). The 
Taqman reaction requires a hybridization probe 
lalxled with two different fluorescent dyes. One 
dye is a reporter dy« (I'AM), the other ix X quench- 
ing dye (TAMRA). When the pruln: \s intact, fluo- 
icsccni energy transfer occurs and the reporter 
dye fluorescent emission is absorbed by the 
quenching dye (TAMRA). During Die extension 
phase of the VCR cycle, the fluorescent hybrid- 
IxAlion probe Is clcaveil by the 5'-3* nucleolytic 
aclivity of the ONA polymerase. On cleavage of 
the probe, the reporter dye emission is no longer 
transferred efficiently to the quenching dye, re 
suhiitK hi an increase- of the reporter dye fluores- 
cent cmtjuilcm apectra. VCIK primers uud probes 
were designed fm thu iiumao factor VJ1J se- 
quence and human p-aetln gene (as de.nenbed in 
Methods). Optimization reactions were per- 
formed to choose the appropriute probe and 
magnesium concentrations yielding the iijej«".$* 
Intently of re|>ortcr fluorescent signal without 
sacrificing specificity. The Instrumenl ur.es a 
charge -co 1 j pled device (i.e., CCD camera) for 
measuring the fluorescent emission apeetm from 
FMt to C,$0 nm. Kach VCM tube was monitored 
sequentially for 2f> nvsee with continuous moni- 
toring throughout the aniplificatiuii. Each tube 
wa.% rc.-exandncd every tl*5 see. Computer soft- 
ware, wan deigned to examine ilie fluorescent \n- 
tenslty of both the rej>orter dye (I'AM) and 
the quenching dye (TAM11A). *J*hc llu«resc.cnt 
intensity of the quenching dye, TAMUA, changes 
very Uul« over the course of the PCR ampHfl* 
cation (data not shown). Therefore.! the intensity 
of TAMKA dye emission serves hh an Internal 
standard with which lo nortnullyxs the reporter 
vlye (l ? AM) emission variations. The »oftw«re cal- 
culoles u value termed AUn (or ARQ) uslnj; the 
following equation; ARn - (Hn J ) (ft"*")! where 
Kn 4 urnlMKlou intensity of roiKirler/emissiou in- 
tensity of quencher at any given time In ft retie 
tlon tube, anil Ru - emission intemility of re- 
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porter/ omission hut-mi l y of quencher measured 
prior lo I'CH itmpiiiication in that same reaction 
tube. Yot tlie purpose of quantitation, the last 
three data points (ARns) collected d(jring the ex* 
tension step for each I J Ck cycle wore analyzed. 
The micleolytic degradation of the hybiidix-ition 
probe occurs during the extension phase- of niu,- 
and, therefore, reporter fluorescent cmi^juiH In- 
crease!; during this time, 'nit: iliiw data ]>olntM 
were averaged for each K-*K cycle and the mean 
value for each was plotted in an "amplification 
plot" shown In J'igurC 3 A- Tlie AKn mean value is 
plotted on the j^axis, and time, represented by 
cycle number, is plotted on the* A-axis. During the 
early cycles of the T'CU amplification, die AKn 



value remains at bast Jine When .sufficient hy- 
bridisation probe has been cleaved hy the 7wf 
polymerase miclwwe activity, the intensity of ro- 
porUr /lwor«<:crti emission in<:rew*vtM Most K'U 
anipliH^iions reach i» plateau phaNe of reporter 
fJuorcMxnit emission U the reaction 1* carried out 
to high cycle muiuVis. The amplification plot lit 
examined vuiiy in tlut reaction, at a point thai 
■ (.•presents the log phaw of .prixkicf arnnnula* 
lion. 'This Js done hy assigning an ai biliary 
'threshold thai in based on the variability of the 
base-line did*- In MgUTG 1A, the Ihfftshold was set 
at 10 standard deviations above, the mean of 
base linfl embwuiM talrulated from tiydcn 1 lo 1 5. 
Once the threshold is chosen, the point at which 
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Figure 1 PCR product detection in real time {A) The Model 7700 Mjftware wilt construct amplification ploti 
from the extension phase fluorescent emission data collected during the PCR amplification* The standard de- 
viation is determined lrom the data points collected from thfi base line of the amplification ploL C, values are 
calculated by determining the point at which the fluorescence exceeds a threshold limit (usually 10 times the 
Standard deviation of the base line), (fi) Overlay ot amplification plots of serially (1:2) diluted human genomic 
DNA samolcs amplified with p-actin primers. (Q Input DN/V concentration of the samples plotted versus C T . All 
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the amplification plot crosses the threshold' ivtlf 
fined as C P C, is reported an ihc cycle number at 
thi?; point. Ar vulll be demonstrated, flu* tl t value 
In piedicilve of the quantity of input target. 

C,- Values Provide a Quantitative Measurement* o>' 
Input Target Sequences 

Plgurc IB shows amplification plot* of li«>di¥fc»'- 

ent PGR amplifications overlaid, 'i*ho amplifcuv 
tlons were performed on a 1:2 serial dilution •«« 
human genomic PNA. i*hc amplified target w:u. 
human 3 actln. The- amplification plofti xhifl to 
the right (to higher threshold cycles) ns the input 
target quantity fc reduced. 'Jlw. is exacted ho- 
eauxu ruactlnnti with fewer starting eopuw of the 
target molecule require greater amplification to 
degrade enough probe to attain the Threshold 
fluorescence; An arbitrary threshold of 10 stan- 
dard deviations above the base Jine was used to 
determine the O r values. Figure 1C represents the 
C r values plotted versus ihe sample dilution 
value, Each dilution was amplified in triplicate 
VCR amplifications and plotted as mean values 
with error bars representing one standard devia- 
tion. The C T values decrease linearly wjtb increas- 
ing target quantity, Thus, C r values can be used 
as a quantitative measurement of the input target 
number. It should be noted that the amplifica- 
tion plot for the 15.6- ng sample shown In Plgure 
IB does not reflect the same fluorescent rate of 
Increase exhibited by most of the other samples. 
The 15«6-ng sample also achieves endpoinl pla- 
teau at a lower fluorescent value than would be 
expected based on the input DNA. This phcuom. 
cnon has been, observed occasionally with other 
samples (daia not shown) and may be attribut- 
able to late cycle inhibition; this hypothe sis is - 
sun under Investigation. It is important to note 
that the flattened slope and early plateau do not 
impact significantly the calculated O, value as 
demonstrated by the 111 on Ihe Hue shown In 
Figure l C. All triplicate amplifications resulted in 
very similar Or values— the standard deviation 
did not exceed 0.5 for any dilution. This experi- 
ment contains a >100,00G~fold range of input tar- 
get molecules. Using C v values for quantitation 
permits a much larger assay range than directly 
using total fluorescent emission intensity for 
quantitation. The -linear range. ol iluorcsccni in- 
tensity mea.Mire.meni of the All! I'rlam 7700 iic- 

aoroi 



rmm'tR over a very large ^n|»c of r*1ativf» siarring 
target quantities. 

Sample Preparation Validation 

Several parameters influence the ef'Hclenry nf 
PGR amplification; magnesium and sail cuiicen» 
nations, reaction conditions (i.e., time and tem- 
perature), PCH target size and composition, 
primer sequences, and sample purity. All of The 
above (actors are common to a single PGR assay, 
except sample to sample purity, in an effort to 
validate the method of sample preparation fox 
the iacior Viil assay, PGR amplification reprodnc.. 
ihility and eifldcncy ol 30 replicate sample 
pre] larati oris wen* examined. After genomic DNA 
was prepared from the 10 replicate samples, the 
DNA was qunnUtalcd by ultraviolet spectroscopy. 
Amplifications were performed analyzing p-acdn 
gene content in 100 and 25 ng of total genomic 
DNA. Each PGR amplification was performed in 
triplicate. Comparison of C r values for each trip, 
iicate sample show minimal variation based on 
standard deviation and coefficient of variance 
(Table 1). Itjercforc, each ol the triplicate PGR 
amplifications was highly reproducible, demon- 
strating that real time PGR using this instrumen- 
tation introduces minimal variation into the 
quantitative PGR analysis. Comparison of the 
mean C n values of the 10 replicate sample prepa- 
rations also showed minimal variability, indicat- 
ing that each sample preparation yielded similar 
results for H-acUn gene quantity. The highest in- 
difference between any of the samples was 0.f>5 
and 0.71 for the 100 and 25 ng samples, respec- 
tively. Additionally, the amplification of each 
sample exhibited an equivalent rate of fluorcv- 
cent emission intensity change per amount of 
DNA target ana)y7ed as indic&icd by similar 
slopes derived from Ihc sample dilutions (Pig. 2). 
Any sample containing an excess of a PC)< inhibi- 
tor would exhibit a greater measured 0-actln C r 
value for a given quantity of DNA. in addition, 
the inhibitor would be diluted along with the 
sample in the dilution analysis {H#. 2), altering 
the expected C, value change. Each sample am- 
plification yielded a similar result in the analysis, 
demonstrating that tins method of sample prepa- 
ration is highly reproducible with regard to 
sample purity. 

Quantitative Analysis of a Plasmid After 
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Toblo 1. Roprodu<JbMltv of S*mple Preparation Method 



1 

2 
3 
4 
5 



8 
9 
10 

Mean 



100 ng 



25 ng 



Samplo 

no. C T 



standard 
m^an deviation 



CV 



18,24 
18.23 
.10,33 
18.33 
18.35 
18.44 
18.3 
18.3 
1«M2 
18.15 
18.23 
1S.32 
18.4 
18.38 
18.46 
18.54 
18.67 

18.28 

18.36 

18.52 

18,45 

18.7 

18.73 

18.18 

18.34 

18.36 

18.42 

18.57 

0 10) 



1W.27 

0.06 

18.34 0.07 

18.23 0.08 

1B.42 0.04 

18.74 0.24 

18.39 0.12 

18.63 0.16 

18.29 0.1 

18.55 0.12 

18.12 0.17 



0.32 

03? 

0,36 

0.46 

0.23 

1.26 

0.66 

0.83 

0.M 

0.65 
0,90 



20.48 

20.55 

20,5 

20.61 

20.59 

20.41 

20.54 

20.6 

20.49 

20.48 

20.44 

20.38 

20.68 

20.87 

20.63 

21.09 

21.04 

21.04 

20.67 

20,73 

20.65 

20.98 

20.84 

20.75 

20.46 

20.54 

20.48 

20.79 

20.78 

20.62 



standard 
mean deviation CV 



20.51 0.03 0.17 

?0JH 0,11 0.54 

20.54 0.06 0,28 

20.43 0.05 0.26 

20.73 0.13 0.61 

21.06 0.03 0.15 

20.68 0.04 0.2 

20.86 0,12 0.57 

20.51 0,07 0.32 

20.73 0.1 0.16 

20.66 0.19 0,94 



tor containing a partial cUNA for human factor 
vm, pi-'oTM. a scries of tniosfcctions wa«t sei 
up using a decreasing Amount of ihc pIasmid*H0, 
4, U.5, and O.I y,g). Twenty-four hours posl- 
travttfertinn, total DMA was purified <tom each 
flask uf-irlh. p-Acliu gcue.quajilUy wai chUMrii a?> 
a value for normal i/al'itm of g^t juiftfc.ONA con- 
cenrraUon frum each sample. In lids cxpexjujcnt, 
fi-actin gene content should remain constant 
relative to total genomic UNA. Figure 3 shows the 
result of the p-actln JDNA measurement (100 ng 
total DNA determined hy ultraviolet spectros- 
copy) 01 each sample. Kadi sample was analyzed 
in triplicate and trie mean p-actin G,. values of 
the triplicates were plotted (error bars represent 
^« r*"j^#i«fri H«*<t/iaiiom "I h*» htntiPM- rtiffprrnrr 



between any iwo sampla moan ft was Q.9S Ten 
nanograms of tots! UNA of each sample were also 
rxauihictd f<>r fi-actin. The results again showed 
that very similar amount a of genomic 1>NA were 
present; the maximum mean |* actio C: x value 
difference wa.s 1 .0. As Figure 3 shows, the rale of 
p-actln C r change between the J 00 and 10-ng 
sample* was similar (slope values rangw between 
3,56 anJ - 3,45), ThU verifies again lhal "the 
method of sample preparation yields samples of 
identical PCR integrity (j,e~ # no sample contained 
an excessive ainuunl of a PCR Inhibitor). How. 
ever, these results indicate that each sample con 
taincd slight diffeiences in the adwol amount of 
gcnumJc DNA analysed. Determination of actual 
ucuomic i>NA concentration was accomplished 
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Figure 2 S<ji i tple preparation purity. 1 he replicsta 
samples shown In Tabic 1 wore also amplified In 
topical e using 25 ng of each DNA sample, The fig* 
ui* shows die input DNA concentration (TOO and 
25 ng) vs. C, In irw- ftrjnrp, ih*> 100 and P5 ng 
points (or *ach sample are connected by a line. 



hy plotting the mean £-actio O, value obtained 
for i»at:h 100* ng sample on a p-actin stand*™! 
i-urve (shown in J'Sg- 40). The actual genomic 
1>NA coricciitratl"" of each sum pic, was ob 
talned l>y extrapolation to thu X-axii. 

Figure: 4A shows the .measured. (l-u- f n«n* 
normalised) quaulilie.N «f factor VII J plasm id 
ONA (preTM) frcun each' of the four transient cell 
Irai infection*. Each reaction contained 100 ng of 
total sample DNA (as determined by UV speetrov 
copy)- r <ach sample was unalyzed hi tri pintle 
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Figure 3 Analysis of tidiisfectcd cdl DNA quantity 
and purity, the DNA preparations of the four 293 
cell transactions (40, 4, 0.5, and 0.1 ng of pFSTM) 
were analyzed for the 0-actln gene. 100 and 10 ng 
(determined by ultraviolet spectroscopy) of each 
sample were amplified in triplicate. For each 
amount of pF8TM trial was transacted, the fi-aaln 
C T values are plotted versus the total Input DNA 



90RH 



1>C:U amiplificatiottsc. As shown, pI*'8TM purified 
,hoic Jbc 20;< cells decreases (mean C, values in- 
cruwi*.) with decreasing amounts of pi asm (d 
,truitsfc'UCCL The mean C A values obtained for 
pKfcTM in Tigurc 4A were plotted on a standard 
curve comprised uf scilully diluted pKHTM, 
shown .in figure 4R. The quantity uJ pltti'M, h, 
found in each of the four transection* was do 
termined by extrnpoJation to the* * axlt; of tho 
fiiandard curve in l'igurc 4H. 'Hutsc uncorrected 
values, h, for pKHTM wen? nor mailed to del er- 
mine the actual amount of pl'8'lM found per 100 
ng of genomic DNA by using the equation:. 

b X 100 ng ucluial pl-*8TM copies per 

~ 100 ng of genomic DNA 

where a - actual genomic DNA in u sample and 
/;i-.pl ; HTM copies from the standard curve. The 
normalised quantity of pl'STM per 100 ng of ge- 
nomic DNA for cacti 0/ the four inmVfcctlons is 
shown in Figure 'Hicmi roult?» alien* mat the 
quantity of factor vin plasiuid a^>cJa<ed wnh 
the cens # Z\ \\r after irunsfv:e.iUin, d^.ieus^. 
wltti decreasing pla^mai u«Jt.«iUiatJou used In 
the transection. The qu«Hlty 0/ pl'BJ M ossocJ- 
atcd with 293 cells, ciftcr transfecUon vvlvh 40 ixg 
Of pia-isniid, was 35 pg pvr 100 ng genomic DNA. 
Tills results in -520 plasmid copies per cell. 



DISCUSSION 

Wo have described a new method for quantitnt- 
ing gene copy numbers using fcaMimc analysis 
of PCR amplifications. ReaMlmc PCK is compat- 
ible with cJtbtn- of the two PGR (KT-PCR) ap- 
proacho: (1) quantitative conifaetitive where an 
internal wmpcllicji' for each target sequence i» 
used for normaliJUitJon (data not shown) or (2) 
quantitative comparative PCK using *» iuiiin*iliza- 
tion gene contained within the sample (i.e., pec- 
tin) or a "housekeeping" gene for RT-PCK. If 
equal amounts of nucleic add are analyzed for 
each sample and if the amplification cf/kimcy 
before quantitative analyst o identical for each 
sample, the Internal cunhul (nurmaH^liou gene 
or competitor) should give equal signals for flli 
samples. 

The real-time PCK method offers several ad- 
vantages over the other two methods currently 
employed (t*ie ihe Introduction). l : irst, the real- 
time PCR method is performed in a doscd-tube 
system and requires no post-PCR manipulation 
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Figure 4 Quantitative anolyiii* of pFSTM in transfccted eclb. </4) Amount of 
plasmid DMA used for (he transaction plotted against the m«un C, value deter. 
min«d for pf#TM remaining ;H hr alter transection. (B,Q Standard curves of 
pfWM and fi-actin, respectively. pfftTM DNA (0) and gcnorrirc. PNA (Q were 
diluted *Arfally 1 ;i before amplification with the appropriate primer*. The f*-actiii 
standard curv© way usod to normalise the results oM to 100 rig of genomic DNA. 
(0) The amount of pF8TM present per 1 00 no, of genomic DNA, 



of sample. Therefore, I h*» potential for TCH con- 
tamination in the laboratory is reduced because 
amplified products can he. aiialywd and disposed 
of without opening the reaction tubes. Second, 
this method suppoiU the u.se of a normalisation 
gene (Lc, fi-actin) for quantitative. PCR or house- 
keeping genes for cjoantitntive RT-1'CK controls. 
Analysis Is performed in real time during the log 
phase of product accumulation. Analysis during 
ki>; phase permits many different genes (over a 
wide input target range) to be analy/ccf simulta- 
neously, without concern of reaching reaction 
plateau at different cycle*. This will make Jiuilll- 
gene analysis assays much caMei tv develop, be- 
cause individual internal compeilOu* will nut l>c 
necded for each gene under analysts. Third, 
sample throughput will inoeasc dianialiially 
with the new method because, there is no post- 
PCK procendng time. Additionally, walking In a 
^6-wcll format is highly compatible, with auto* 
tuation technology, 

The real-time PCR method is highly 'repro- 
ducible. RepMcate amplifications can be analyzed 



for ?nch sample minimizing potential error. The. 
system allows* for a very large assay dynamic 
range (approaching l,OOO,OO0-fold starting Uh- 
got). Ualiig u st at ul a rd curve for the. target oi in- 
terest, relative copy number values can be deter- 
mined for any unknown Nam pie. Fluorescent 
threshold values, C, v conrJair. linearly with rela- 
tive DNA copy numbers. Heal time quantitative 
U'M'CK methodology (Ciihson ct al., this Issue*) 
ha* aho been developed, finally, real time quai^ 
titativc PCU methodology can be used lu develop 
high-throughput screening assay* for n variety of 
applications [quantitative gene c*pjea&ion (RT- 
PCR), Rene copy aanuys (Hcrfc, I31V, etc.), gcni> 
typlng (knockout mouse analysis), and Immuno- 
PORJ. ' 

Real-time POl may also he performed using 
intercalating dyes (Higuehi ct ul, WXJ.) such as 
ciJtJditim bromide. The fluorogenic probe 
method offers a major advantage over inter- 
calating dyes- 'greater specificity (i.e., primer 
dimvrs and nonspecific PCR products are noi de- 
tuned). 
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METHODS 

Generation of <t Piasmld Containing a Partial 
cDNA far Human Factor VIII 

Total RNA w<o harvested t,ttNA*nt » ' Test, h»c, 

rrjendswood, TX) frutu w\h n*iwfe<iwi with a factor VHI 
rxjiraykm vector, pC:iSZ.tk?.iU (Katoit et M. dor* 
man ct at. 1990). A factor VIII j>artlal t-llNA verpicmv WrtJi 
^c ricirnicd by HT K IM |<i«iicAmp \&. iTlh ItN* \*CM Kit 

(pan NoOK-0179, l't Applied liiosysicm;;, 1'omvi City, <iA)J 

Using the l'C:U priuiurs Wfor *n»d Pflrcv (|»rii"«*f Sequence* 

arc shown below), Tin* ampHcon was reamplifird uMng 
modified Itffor and Wrcv primers (appended with-nVtwIU 
jind HmdIH rcslricllon site sequences «t tliv V cn<J> »mrV 
clonal Into jXiKM- 3Z (Promt^u Corp., Madison, Wl). The 
resulting clone, pWSTM, was uwtl lor transient transfectJon 
of 293 ccJIv 



Amplification of Target DNA ami Detection of 
Amplicon Factor VHI Masmid DNA 

(pWM) was <iinp1lfi»l with tlnr [«ii*»v-i> 1'Bf<#f 5 '-<;<;< 

crrnc;<;AA<.iAu:iXJA(XiiCiTC<3' and p*rev 5'-aaa<:c;j'- 

t^OCCrroCiATCUrrACJCi-^'.'Ilic rvnclluu p/oduvrd « 
up KM product. The forwiird primer w«j> denned lauv 
ognlzc tt unique M'lpieme- Ann id h i (he 5' untranslated 
rejpon of the paieiu pC132.tkZ51> pldMimt find ilmreforc 
doe* not k'vwknUi: and amplify iIk- human factor VIII 
gene, I'rimorft woro chosen witli the **sift*nrc of I he com- 
puter program Oliga l*n {Niitiwud UiuNcicnecs, lnc„ Ply. 
mouth, MN). The human p-acthi g<*ne was Amplified with 
the primer* U-tu-iin forward primer <S* TCACCCACA< rr<:T 
GCCCATOI'ACOA-.V and ^-actin reverse p#iwer .S'-CAf;. 

CCGAACCX:(rix;A!*r(;<:c^A'j'GG-3\ The reaction pro- 
duced a 295- hp rc:k product. 

Amplification reactions (SO p.!) contained a DNA 
sample, 10 x P<;it ituffcr II (»S 200 jlm dATP, dCTl\ 
dGTP, and 400 |vm dUTI», 4 mx< Mjn%, 1.2.S Unils Ampll 
7W<; r;NA polymerase, 0,5 unit AmpKrnw uracil fV-j;ly- 
wwyiuw,' <UN0), £0pt»ok*of cftch f«cloi VIII jirlmci, und 15 
P«ih»Ii* of tiacti ft ACtln piinic^. 'Huf ioat tl<M^ i»»n(olucd 
One Of lh(.' following d^CCt Inn prolw»K (HMi rtM nu ll): 

i'&j>n»ii<- A'(i'AM)Ac:crjYri , c:cu<:crr<ifrn , <:T , rr<:TCT- 

OCCTT(TAMRA)p 3' «ud p-«<.-tiu probe 5 f (TAM>ATCf.X:c- 
XCrAMKAJCCCCCATCCCATCp-.T where p indicates 
pho5phnryI/kiion nnd X indicates a linker arm nucleotide. 
Reaction M>o w«rt' MicrtiAmp Optit^t Tubes (par I rium* 
bcr NK01 0*m, Pcrkln Ulxnuf) that wore frost (at IVrW" 
r.lmcr) to |>i\-v<-ii* li^iil from /c fleeting. *)\il>c capi were 
similar m MitToAmp Cnjta i>ul specially dwiflncd to pre- 
vent Uj;lit s*.«(tcr»nj(. All ol IK:!* i/j»*inul>U'* wcro »up- 
lAw.<\ Ivy PK Applied lljwyrttni* (h>*U*r f!Uy r CA) exccpl 
the factor VJlt primers, wlilcl* weir syniluvil/c-d ul Ccnvn 
lech, Inc. (Sou t>i $i"t Tfanclscoj CA). Prohev wm- dosl(<no<t 
using the OIjj;o 4.0 .noftworc, following gnldclliifv kiifc- 

jjcmoo in tnc Model 7700 .Sequence Detector Iii.kiiiuih'iiI 
manual. Urlefly, probe T m ^mtiUt !«• *U least $PC hty t \\ct 
than riir annual Iijji ttriupt'Mlurc; u.ied during (hcrmul cy- 
rhtig; primers sho\ild nui fuim sUh\v duplexw willi iln- 
pronr. 

The theriuol i'ycllng Cunditkm.s Included 2 jiiln ftt 
50 V C; and 10 niin at 95 W C. Th«rmal cycling procerdrd with 
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rcaclioin. wore perforated i'» I ho h4orU«l 7700 Sequence IX*- 
UtMft (Pt Apphed Ulusyvtvuiv), wltirh ctmtxUn a Cnw* 
Amp IU";U SyRtvn* VJ^OO. kt ; a<:llon condiliOi^ wi-rf pro* 
RruiituieU uti .i iWor Maeinti»h /100 (Apple C>.«npiHPr # 
Santa Clara, CA) linked directly io the Model VVfiA <;i»- 
ouonev Didffclor* Aiiaty«U *»f data w»* aUo |wrf/«rmcd on 
the Mm lfUonh e<jmpviter, ( :ollnrtlon ancl HiialycU c«fiw:4rc 
wu% dcveki|wd h\ PK Applied Blo<y*lt\ms. 

Traiufection of Cells with Factor VHI Construct 

Pour T17.S flask* of 293 colls (ATCX! CAU. 1571t), n human 
fciol kidney sutipeiifiion cell line, were ^uwn lo HOAMi.con- 
llweney And transfcefed pt-'KIM. Cells were grown in tho 
rollnwlng media; S0% HAM'S ¥\2 wlthotil GUT, 50% low 
glucose JXtJhcrWa modified Ka^lc medium (UMIIM) with, 
out glycine with sodium bicarb* mate, 10% letal bovine 
scruni, 2 iiim t-^iuUminc, and 1% penicillifi-strcptomy- 
vln. The media ww di^nfjcd 30 mtn l>c^m' Hk- iransfee 
lion. pPUTM DMA ttmountt* of 40, 4, OA, and O.J ^; were 
iiOiitMi h> ^.S ml of a solution containing 0.125 m CU*a 7 ; 
and t X MWUS. Hie four mixtures were left al rtwm tern- 

|wvt<jw ftfn 1U mJn and thieti iidded Hnipwlsc to the cells. 
The fl*i>H> m.-ii' a uii.ubalcd al 37°C and 5% CO. for 24 hr ( 
washed with PUS, and r<utu*pcndcd in PUS. The h'xim 
jn-ruK-d cells were divided into (dupioU and UNA wm c>/- 
touited Immediately iminj; IhvQiAump KUhkI Kit (QUfion. 
aimyrwrtb, CA), DNA w ( is <:luted Into 200 of 30 miM 
IWs-lia ol pll 8.0. 
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ABSTRACT Wnt family members are critical to many 
developmental processes, and components of the Wnt signal- 
ing pathway have been linked to tumorigenesis in familial and 
sporadic colon carcinomas. Here we report the identification 
of two genes, WISP-1 and WISP-2, that are up-regulated in the 
mouse mammary epithelial cell line C57MG transformed by 
Wnt-1, but not by Wnt-4. Together with a third related gene, 
WISPS, these proteins define a subfamily of the connective 
tissue growth factor family. Two distinct systems demon- 
strated WISP induction to be associated with the expression of 
Wnt-1. These included (/) C57MG cells infected with a Wnt-1 
retroviral vector or expressing Wnt-l under the control of a 
tetracylihe repressible promoter, and (it) Wnt-1 transgenic 
mice. The WISP-1 gene was localized to human chromosome 
8q24.1-8q24 3. WISP*1 genomic DNA was amplified in colon 
cancer cell lines and in human colon tumors and its RNA 
overexpressed (2- to > 30-fold) in 84% of the tumors examined 
compared with patient-matched normal mucosa. WISP-3 
mapped to chromosome 6q22-6q23 and also was overex- 
pressed (4- to > 40-fold) in 63% of the colon tumors analyzed. 
In contrast, WISP-2 mapped to human chromosome 20ql2- 
20ql3 and its DNA was amplified, but RNA expression was 
reduced (2- to > 30-fold) in 79% of the tumors. These results 
suggest that the WISP genes may be downstream of Wnt-1 
signaling and that aberrant levels of WISP expression in colon 
cancer may play a role in colon tumorigenesis. 



Wnt-1 is a member of an expanding family of cysteine- rich, 
glycosylated signaling proteins that mediate diverse develop- 
mental processes such as the control of cell proliferation, 
adhesion, cell polarity, and the establishment of cell fates (1, 
2). Wnt-1 originally was identified as an oncogene activated by 
the insertion of mouse mammary tumor virus in virus-induced 
mammary adenocarcinomas (3, 4). Although Wnt-1 is not 
expressed in the normal mammary gland, expression of Wnt-1 
in transgenic mice causes mammary tumors (5). 

In mammalian cells, Wnt family members initiate signaling 
by binding to the seven-transmembrane spanning Frizzled 
receptors and recruiting the cytoplasmic protein Dishevelled 
(Dsn) to the cell membrane (1, 2, 6). Dsn then inhibits the 
kinase activity of the normally constitutively active glycogen 
synthase kinase-3j3 (GSK-30) resulting in an increase in 
|3-catenin levels. Stabilized )3-catenin interacts with the tran- 
scription factor TCF/Lefl, forming a complex that appears in 
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the nucleus and binds TCF/Lefl target DNA elements to 
activate transcription (7, 8). Other experiments suggest that 
the adenomatous polyposis coli (APC) tumor suppressor gene 
also plays an important role in Wnt signaling by regulating 
£-catenin levels (9). APC is phosphorylated by GSK-3/3, binds 
to 0-catenin, and facilitates its degradation. Mutations in 
either APC or /3-catenin have been associated with colon 
carcinomas and melanomas, suggesting these mutations con- 
tribute to the development of these types of cancer, implicating 
the Wnt pathway in tumorigenesis (1). 

Although much has been learned about the Wnt signaling 
pathway over the past several years, only a few of the tran- 
scriptionally activated downstream components activated by 
Wnt have been characterized. Those that have been described 
cannot account for all of the diverse functions attributed to 
Wnt signaling. Among the candidate Wnt target genes are 
those encoding the nodal-related 3 gene, Xnr3, a member of 
the transforming growth factor (TGF)-/3 superfamily, and the 
homeobox genes, engrailed, goosecoid, twin (Xtwn ), and siamois 
(2). A recent report also identifies c-myc as a target gene of the 
Wnt signaling pathway (10). 

To identify additional downstream genes in the Wnt signal- 
ing pathway that are relevant to the transformed cell pheno- 
type, we used a PCR-based cDNA subtraction strategy, sup- 
pression subtractive hybridization (SSH) (11), using RNA 
isolated from C57MG mouse mammary epithelial cells and 
C57MG cells stably transformed by a Wnt-1 retrovirus. Over- 
expression of Wnt-1 in this cell line is sufficient to induce a 
partially transformed phenotype, characterized by elongated 
and refractile cells that lose contact inhibition and form a 
multilayered array (12, 13). We reasoned that genes differen- 
tially expressed between these two cell lines- might contribute 
to the transformed phenotype. 

In this paper, we describe the cloning and characterization 
of two genes up-regulated in Wnt-1 transformed cells, WISP-l 
and WISP-2, and a third related gene, WISP-3. The WISP genes 
are members of the CCN family of growth factors, which 
includes connective tissue growth factor (CTGF), Cyr61, and 
nov, a family not previously linked to Wnt signaling. 

MATERIALS AND METHODS 

SSH. SSH was performed by using the PCR-Select cDNA 
. Subtraction Kit (CLONTECH). Tester double-stranded 

Abbreviations: TGF, transforming growth factor, CTGF, connective 
tissue growth factor; SSH, suppression subtractive hybridization; 
VWC, von Willebrand factor type C module. 
Data deposition: The sequences reported in this paper have been 
deposited in the Genbank database (accession nos. AF100777, 
AF100778, AF100779, AF100780, and AF100781). 
+To whom reprint requests should be addressed, e-mail: dtane@gene. 
com. 
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cDNA was synthesized from 2 u-g of poly(A) + RNA isolated 
from the C57MG/Wnt-1 cell line and driver cDNA from 2 jig 
of poly(A) + RNA from the parent C57MG cells. The sub- 
tracted cDNA library was subcloned into a pGEM-T vector for 
further analysis. 

cDNA Library Screening. Clones encoding full-length 
mouse WlSP-1 were isolated by screening a AgtlO mouse 
embryo cDNA library (CLONTECH) with a 70-bp probe from 
the original partial clone 568 sequence corresponding to amino 
acids 128-169. Clones encoding full-length human WISP-1 
were isolated by screening AgtlO lung and fetal kidney cDNA 
libraries with the same probe at low stringency. Clones en- 
coding full-length mouse and human WISP-2 were isolated by 
screening a C57MG/Wnt-1 or human fetal lung cDNA library 
with a probe corresponding to nucleotides 1463-1512. Full- 
length cDNAs encoding WISP-3 were cloned from human 
bone marrow and fetal kidney libraries. 

Expression of Human WISP RNA. PCR amplification of 
first-strand cDNA was performed with human Multiple Tissue 
cDNA panels (CLONTECH) and 300 /iM of each dNTP at 
94°C for 1 sec, 62°C for 30 sec, 72°C for 1 min, for 22-32 cycles. 
WISP and glyceraldehyde-3-phosphate dehydrogenase primer 
sequences are available on request. 

In Situ Hybridization. 33 P-labeled sense and antisense ribo- 
probes were transcribed from an 897-bp PCR product corre- 
sponding to nucleotides 601-1440 of mouse WISP-1 or a 
294-bp PCR product corresponding to nucleotides 82-375 of 
mouse WISP-2. All tissues were processed as described (40), 

Radiation Hybrid Mapping. Genomic DNA from each 
hybrid in the Stanford G3 and Genebridge4 Radiation Hybrid 
Panels (Research Genetics, Huntsville, AL) and human and 
hamster control DNAs were PCR-amplified, and the results 
were submitted to the Stanford or Massachusetts Institute of 
Technology web servers. 

Cell Lines, Tumors, and Mucosa Specimens. Tissue speci- 
mens were obtained from the Department of Pathology (Uni- 
versity of Pittsburgh) for patients undergoing colon resection 
and from the University of Leeds, United Kingdom. Genomic 
DNA was isolated (Qiagen) from the pooled blood of 10 
normal human donors, surgical specimens, and the following 
ATCC human cell lines: SW480, COLO 320DM, HT-29, 
WiDr, and SW403 (colon adenocarcinomas), SW620 (lymph 
node metastasis, colon adenocarcinoma), HCT 116 (colon 
carcinoma), SK-CO-1 (colon adenocarcinoma, ascites), and 
HM7 (a variant of ATCC colon adenocarcinoma cell line LS 
174T). DNA concentration was determined by using Hoechst 
dye 33258 intercalation f luorimetry. Total RNA was prepared 
by homogenization in 7 M GuSCN followed by centrifugation 
over CsCl cushions or prepared by using RNAzol. 

Gene Amplification and RNA Expression Analysis. Relative 
gene amplification and RNA expression of WISPs and c-myc in 
the cell lines, colorectal tumors, and normal mucosa were 
determined by quantitative PCR. Gene-specific primers and 
fluorogenic probes (sequences available on request) were 
designed and used to amplify and quantitate the genes. The 
relative gene copy number was derived by using the formula 
2<*ct) where ACt represents the difference in amplification 
cycles required to detect the WISP genes in peripheral blood 
lymphocyte DNA compared with colon tumor DNA or colon 
tumor RNA compared with normal mucosal RNA. The 
d-method was used for calculation of the SE of the gene copy 
number or RNA expression level. The M / /5/ J -specific signal was 
normalized to that of the glyceraldehyde-3-phosphate dehy- 
drogenase housekeeping gene. All TaqMan assay reagents 
were obtained from Perkin-EImer Applied Biosystems. 

RESULTS 

Isolation of WISP- 1 and WISP-2 by SSH. To identify Wnt- 
1 -inducible genes, we used the technique of SSH using the 
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mouse mammary epithelial cell line C57MG and C57MG cells 
that stably express Wnt-1 (11). Candidate differentially ex- 
pressed cDNAs (1,384 total) were sequenced. Thirty-nine 
percent of the sequences matched known genes or homo- 
logues, 32% matched expressed sequence tags, and 29% had 
no match. To confirm that the transcript was differentially 
expressed, semiquantitative reverse transcription-PCR and 
Northern analysis were performed by using mRNA from the 
C57MG and C57MG/ Wnt-1 cells. 

Two of the cDNAs, WISP-1 and WISP-2, were differentially 
expressed, being induced in the C57MG/Wnt-1 cell line, but 
not in the parent C57MG cells or C57MG cells overexpressing 
Wnt-4 (Fig. 1 A and B). Wnt-4, unlike Wnt-1, does not induce 
the morphological transformation of C57MG cells and has no 
effect on /3-catenin levels (13, 14). Expression of WISP- 1 was 
up-regulated approximately 3-fold in the C57MG/Wnt-1 cell 
line and WISP-2 by approximately 5-fold by both Northern 
analysis and reverse transcription-PCR. 

An independent, but similar, system was used to examine 
WISP expression after Wnt-1 induction. C57MG cells express- 
ing the Wnt-I gene under the control of a tetracycline- 
repressible promoter produce low amounts of Wnt-1 in the 
repressed state but show a strong induction of Wnt-1 mRNA 
and protein within 24 hr after tetracycline removal (8). The 
levels of Wnt-1 and WISP RNA isolated from these cells at 
various times after tetracycline removal were assessed by 
quantitative PCR. Strong induction of Wnt-1 mRNA was seen 
as early as 10 hr after tetracycline removal. Induction of WISP 
mRNA (2- to 6-fold) was seen at 48 and 72 hr (data not shown). 
These data support our previous observations that show that 
WISP induction is correlated with Wnt-1 expression. Because 
the induction is slow, occurring after approximately 48 hr, the 
induction of WISPs may be an indirect response to Wnt-1 
signaling. 

cDNA clones of human WISP-1 were isolated and the 
sequence compared with mouse WISP-1. The cDNA sequences 
of mouse and human WISP-1 were 1,766 and 2,830 bp in length, 
respectively, and encode proteins of 367 aa, with predicted 
relative molecular masses of 40,000 (M r 40 K). Both have 
hydrophobic N-terminal signal sequences, 38 conserved cys- 
teine residues, and four potential N-Unked glycosylation sites 
and are 84% identical (Fig. 2A ). 

Full-length cDNA clones of mouse and human WISP-2 were 
1,734 and 1,293 bp in length, respectively, and encode proteins 
of 251 and 250 aa, respectively, with predicted relative molec- 
ular masses of 27,000 (Af r 27 K) (Fig. 25). Mouse and human 
WISP-2 are 73% identical. Human WISP-2 has no potential 
N-Iinked glycosylation sites, and mouse WISP-2 has one at 
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Fig. 1. WISP-1 and WISP-2 are induced by Wnt-1, but not Wnt-4, 
expression in C57MG cells. Northern analysis of WISP-l (A) and 
WISP-2 (B) expression in C57MG, C57MG/WnM, and C57MG/ 
Wnt-4* cells. Poly(A) + RNA (2 jig) was subjected to Northern blot 
analysis and hybridized with a 70-bp mouse WISP- 1 -specific probe 
(amino acids 278-300) or a 190-bp W , 75/*-2-specific probe (nucleotides 
1438-1627) in the 3' untranslated region. Blots were rehybridized with 
human 0-actin probe. 
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Fig. 2. Encoded amino acid sequence alignment of mouse and 
human WJSP-1 (A) and mouse and human WISP-2 (B). The potential 
signal sequence, insulin-like growth factor-binding protein (IGF-BP), 
VWC, thrombospondin (TSP), and C-terminal (CT) domains are 
underlined. 

position 197. WISP-2 has 28 cysteine residues that are con- 
served among the 38 cysteines found in WISP-1. 

Identification of WISP-3. To search for related proteins, we 
screened expressed sequence tag (EST) databases with the 
WISP-1 protein sequence and identified several ESTs as 
potentially related sequences. We identified a homologous 
protein that we have called WISP-3, A full-length human 
WISP-3 cDNA of 1,371 bp was isolated corresponding to those 
ESTs that encode a 354-aa protein with a predicted molecular 
mass of 39,293. WISP-3 has two potential N-linked glycosyl- 
ation sites and 36 cysteine residues. An alignment of the three 
* human WISP proteins shows that WISP-1 and WISP-3 are the 
most similar (42% identity), whereas WISP 2 has 37% identity 
with WISP-1 and 32% identity with WISP-3 (Fig. 3A). 

WISPs Are Homologous to the CTGF Family of Proteins. 
Human WISP-l y WISP-2, and WISP-3 are novel sequences; 
however, mouse WISP-1 is the same as the recently identified 
Elml gene. Elml is expressed in low, but not high, metastatic 
mouse melanoma ceils, and suppresses the in vivo growth and 
metastatic potential of K-1735 mouse melanoma cells (15). 
Human and mouse WISP-2 are homologous to the recently 
described rat gene, rCop-1 (16). Significant homology (36- 
44%) was seen to the CCN family of growth factors. This family 
includes three members, CTGF, Cyr61, and the protoonco- 
gene nov, CTGF is a chemotactic and mitogen ic factor for 
fibroblasts that is implicated in wound healing and fibrotic 
disorders and is induced by TGF-/3 (17). Cyr61 is an extracel- 
lular matrix signaling molecule that promotes cell adhesion, 
proliferation, migration, angiogenesis, and tumor growth (18, 
19). nov (nephroblastoma overexpressed) is an immediate 
early gene associated with quiescence and found altered in 
Wilms tumors (20). The proteins of the CCN family share 
functional, but not sequence, . similarity to Wnt-1. All are 
secreted, cysteine-rich heparin binding glycoproteins that as- 
sociate with the cell surface and extracellular matrix. 

WISP proteins exhibit the modular architecture of the CCN 
family, characterized by four conserved cysteine-rich domains 
(Fig. 3B) (21). The N-terminal domain, which includes the first 
12 cysteine residues, contains a consensus sequence (GCGC- 
CXXC) conserved in most insulin- like growth factor (IGF)- 
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Fig. 3. (A) Encoded amino acid sequence alignment of human 
WISPs. The cysteine residues of WISP-1 and WISP-2 that are not 
present in WISP-3 are indicated with a dot. (B) Schematic represen- 
tation of the WISP proteins showing the domain structure and cysteine 
residues (vertical lines). The four cysteine residues in the VWC domain 
that are absent in WISP-3 are indicated with a dot. (C) Expression of 
WISP mRNA in human tissues. PCR was performed on human 
multiple-tissue cDNA panels (CLONTECH) from the indicated adult 
and fetal tissues. 

binding proteins (BP). This sequence is conserved in WISP-2 
and WISP-3, whereas WISP-1 has a glutamine in the third 
position instead of a glycine. CTGF recently has been shown 
to specifically bind IGF (22) and a truncated nov protein 
lacking the IGF-BP domain is oncogenic (23). The von Wil- 
lebrand factor type C module (VWC), also found in certain 
collagens and mucins, covers the next 10 cysteine residues, and 
is thought to participate in protein complex formation and 
oligomerization (24). The VWC domain of WISP-3 differs 
from all CCN family members described previously, in that it 
contains only six of the 10 cysteine residues (Fig. 3 A and B). 
A short variable region follows the VWC domain. The third 
module, the thrombospondin (TSP) domain is involved in 
binding to sulfated glycoconjugates and contains six cysteine 
residues and a conserved WSxCSxxCG motif first identified in 
thrombospondin (25). The C-terminal (CT) module contain- 
ing the remaining 10 cysteines is thought to be involved in 
dimerizatiori and receptor binding (26). The CT domain is 
present in all CCN family members described to date but is 
absent in WISP-2 (Fig. 3 A and B). The existence of a putative 
signal sequence and the absence of a transmembrane domain 
suggest that WISPs are secreted proteins, an observation 
supported by an analysis of their expression and secretion from 
mammalian cell and baculovirus cultures (data not shown). 

Expression of WISP mRNA in Human Tissues. Tissue- 
specific expression of human WISPs was characterized by PCR 
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analysis on adult and fetal multiple tissue cDNA panels. 
WISP-1 expression was seen in the adult heart, kidney, lung, 
pancreas, placenta, ovary, small intestine, and spleen (Fig. 3C). 
Little or no expression was detected in the brain, liver, skeletal 
muscle, colon, peripheral blood leukocytes, prostate, testis, or 
thymus. WISP-2 had a more restricted tissue expression and 
was detected in adult skeletal muscle, colon, ovary, and fetal 
lung. Predominant expression of WISPS was seen in adult 
kidney and testis and fetal kidney. Lower levels of WISPS 
expression were detected in placenta, ovary, prostate, and 
small intestine. 

In Situ Localization of WISP-1 and WISP-2. Expression of 
WISP-1 and WISP-2 was assessed by in situ hybridization in 
mammary tumors from Wnt-1 transgenic mice. Strong expres- 
sion of WISP-1 was observed in stromal fibroblasts lying within 
the fibrovascular tumor stroma (Fig. 4 A-D). However, low- 
level WISP-1 expression also was observed focally within tumor 
cells (data not shown). No expression was observed in normal 
breast. Like WISP-1, WISP-2 expression also was seen in the 
tumor stroma in breast tumors from Wnt-1 transgenic animals 
(Fig. 4 E-H). However, WISP-2 expression in the stroma was 
in spindle-shaped cells adjacent to capillary vessels, whereas 





Fig. 4. (A, C, E, and G) Representative hematoxylin/eosin-stained 
images from breast tumors in Wnt-1 transgenic mice. The correspond- 
ing dark-field images showing WISP-1 expression are shown in B and 
O. The tumor is a moderately well-differentiated adenocarcinoma 
showing evidence of adenoid cystic change. At low power {A and B) y 
expression of WISP-1 is seen in the delicate branching fibrovascular 
tumor stroma (arrowhead). At higher magnification, expression is seen 
in the stromal(s) fibroblasts (C and D), and tumor cells are negative. 
Focal expression of WISP-1, however, was observed in tumor cells in 
some areas. Images of WISP-2 expression are shown in E-H. At low 
power (E and ^.expression of WISP-2 is seen in cells lying within the 
fibrovascular tumor stroma. At higher magnification, these cells 
appeared to be adjacent to capillary vessels whereas tumor cells are 
negative (G and H). 



the predominant cell type expressing WISP-1 was the stromal 
fibroblasts. 

Chromosome Localization of the WISP Genes. The chro- 
mosomal location of the human WISP genes was determined 
by radiation hybrid mapping panels. WISP-1 is approximately 
3.48 cR from the meiotic marker AFM259xc5 [logarithm of 
odds (lod) score 16.31] on chromosome 8q24.1 to 8q24.3, in the 
same region as the human locus of the novH family member 
(27) and roughly 4 Mbs distal to c-myc (28). Preliminary fine 
mapping indicates that WISP-1 is located near D8S1712 STS. 
WISP-2 is linked to the marker SHGC-33922 (lod = 1,000) on 
chromosome 20ql2-20ql3.1. Human WISPS mapped to chro- 
mosome 6q22-6q23 and is linked to the marker AFM211ze5 
(lod = 1,000). WISPS is approximately 18 Mbs proximal to 
CTGF and 23 Mbs proximal to the human cellular oncogene 
MYB (21, 29).* 

Amplification and Aberrant Expression of WISPs in Human 
Colon Tumors. Amplification of protooncogenes is seen in 
many human tumors and has etiological and prognostic sig- 
nificance. For example, in a variety of tumor types, c-myc 
amplification has been associated with malignant progression 
and poor prognosis (30). Because WISP-1 resides in the same 
general chromosomal location (8q24) as c-myc, we asked 
whether it was a target of gene amplification, and, if so, 
whether this amplification was independent of the c-myc locus. 
Genomic DNA from human colon cancer cell lines was 
. assessed by quantitative PCR and Southern blot analysis. (Fig. 
5 A and B). Both methods detected similar degrees of WISP-1 
amplification. Most cell lines showed significant (2- to 4-fdld) 
amplification, with the HT-29 and WiDr cell lines demonstrat- 
ing an 8-fold increase. Significantly, the pattern of amplifica- 
tion observed did not correlate with that observed for c-myc, 
indicating that the c-myc gene is not part of the amplicon that 
involves the WISP-1 locus. 

We next examined whether the WISP genes were amplified 
in a panel of 25 primary human colon adenocarcinomas. The 
relative WISP gene copy number in each colon tumor DNA 
was compared with pooled normal DNA from 10 donors by 
quantitative PCR (Fig. 6). The copy number of WISP-1 and 
WISP-2 was significantly greater than one, approximately 
2-fold for WISP-1 in about 60% of the tumors and 2- to 4-fold 
for WISP-2 in 92% of the tumors (P < 0.001 for each). The 
copy number for WISPS was indistinguishable from one (P = 
0.166). In addition, the copy number of WISP-2 was signifi- 
cantly higher than that of WISP-1 (P < 0.001). 

The levels of WISP transcripts in RNA isolated from 19 
adenocarcinomas and their matched normal mucosa were 
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Fig. 5. Amplification of WISP-1 genomic DNA in colon cancer cell 
lines. (A) Amplification in cell line DNA was determined by quanti- 
tative PCR. (B) Southern blots containing genomic DNA (10 jtg) 
digested with EcoK\ (WISP-1) ox Xba\ (c-myc) were hybridized with 
a 100-bp human WISP-1 prpbe (amino acids 186-219) or a human 
c-myc probe (located at bp 1901-2000). The WISP and myc gene* are 
detected in normal human genomic DNA after a longer film exposure. 
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Fig. 6. Genomic amplification of WISP genes in human colon . 
tumors. The relative gene copy number of the WISP genes in 25 
adenocarcinomas was assayed by quantitative PCR, by comparing 
DNA from primary human tumors with pooled DNA from 10 healthy 
donors. The data are means ± SEM from one experiment done in 
triplicate. The experiment was repeated at least three times. 

assessed by quantitative PCR (Fig. 7). The level of WISP-1 
RNA present in tumor tissue varied but was significantly 
increased (2- to >25-fold) in 84% (16/19) of the human colon 
tumors examined compared with normal adjacent mucosa. 
Four of 19 tumors showed greater than 10-fold overexpression. 
In contrast, in 79% (15/19) of the tumors examined, WISP-2 
RNA expression was significantly lower in the tumor than the 
mucosa. Similar to WISP-1 , WISPS RNAwasoverexpressed in 
63% (12/19) of the colon tumors compared with the normal 
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Fig. 7. WISP RNA expression in primary human colon tumors 
relative to expression in normal mucosa from the same patient. 
Expression of WISP mRNA in 19 adenocarcinomas was assayed by 
quantitative PCR. The Dukes stage of the tumor is listed under the 
sample number. The data are means ± SEM from one experiment 
done in triplicate. The experiment was repeated at least twice. 



mucosa. The amount of overexpression of WISPS ranged from 
4- to >40-fold. 



DISCUSSION 

One approach to understanding the molecular basis of cancer 
is to identify differences in gene expression between cancer 
cells and normal cells. Strategies based on assumptions that 
steady-state mRNA levels will differ between normal and 
malignant cells have been used to clone differentially ex- 
pressed genes (31). We have used a PCR^based selection 
strategy, SSH, to identify genes selectively expressed in 
C57MG mouse mammary epithelial cells transformed by 
.Wnt-1. 

Three of the genes isolated, WISP-L WISP-2, and WISPS, 
are members of the CCN family of growth factors, which 
includes CTGF, Cyr61, and nov, a family not previously linked 
to Wnt signaling. 

Two independent experimental systems demonstrated that 
WISP induction was associated with the expression of Wnt-1. 
The first was C57MG cells infected with a Wnt-1 retroviral 
vector or C57MG cells expressing Wnt-1 under the control of 
a tetracyline-repressible promoter, and the second was in 
Wnt-1 transgenic mice, where breast tissue expresses Wnt-1, 
whereas normal breast tissue does not. No WISP RNA expres- 
sion was detected in mammary tumors induced by polyoma 
virus middle T antigen (data not shown). These data suggest 
a link between Wnt-1 and WISPs in that in these two situations, 
WISP induction was correlated with Wnt-1 expression. 

It is not clear whether the WISPs are directly or indirectly 
induced by the downstream components of the Wnt-1 signaling 
pathway (i.e., /3-catenin-TCF-l/Lefl). The increased levels of 
WISP RNA were measured in Wnt-l-transformed cells, hours 
or days after Wnt-1 transformation. Thus, WISP expression 
could result from Wnt-1 signaling directly through /3-catenin 
transcription factor regulation or alternatively through Wnt-1 
signaling turning on a transcription factor, which in turn 
regulates WISPs. 

The WISPs define an additional subfamily of the CCN family 
of growth factors. One striking difference observed in the 
protein sequence of WISP-2 is the absence of a CT domain, 
which is present in CTGF, Cyr61, nov, WISP-1, and WISP-3. 
This domain is thought to be involved in receptor binding and 
dimerization. Growth factors, such as TGF-j3, platelet-derived 
growth factor, and nerve growth factor, which contain a cystine 
knot motif exist as dimers (32). It is tempting to speculate that 
WISP-1 and WISP-3 may exist as dimers, whereas WISP-2 
exists as a monomer. If the CT domain is also important for 
receptor binding, WISP-2 may bind its receptor through a 
different region of the molecule than the other CCN family 
members. No specific receptors have been identified for CTGF 
or nov. A recent report has shown that integrin a v fo serves as 
an adhesion receptor for Cyr61 (33). 

The strong expression of WISP-1 and WISP-2 in cells lying 
within the fibrovascular tumor stroma in breast tumors from 
Wnt-1 transgenic animals is consistent with previous obser- 
vations that transcripts for the related CTGF gene are pri- 
marily expressed in the fibrous stroma of mammary tumors 
(34). Epithelial cells are thought to control the proliferation of 
connective tissue stroma in mammary tumors by a cascade of 
growth factor signals similar to that controlling connective 
- tissue formation during wound repair. It has been proposed 
that mammary tumor cells or inflammatory cells at the tumor 
interstitial interface secrete TGF-|31, which is the stimulus for 
stromal proliferation (34). TGF-pl is secreted by a large 
percentage of malignant breast tumors and may be one of the 
growth factors that, stimulates the production of CTGF and 
WISPs in the stroma. 

It was of interest that WISP-1 and WISP-2 expression was 
observed in the stromal cells that surrounded the tumor cells 
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(epithelial cells) in the Wnt-1 transgenic mouse sections of 
breast tissue. This finding suggests that paracrine signaling 
could occur in which the stromal cells could supply. WISP-1 and 
WISP-2 to regulate tumor cell growth on the WISP extracel- 
lular matrix. Stromal cell-derived factors in the extracellular 
matrix have been postulated to play a role in tumor cell 
migration and proliferation (35). The localization of WISP-1 
and WISP-2 in the stromal cells of breast tumors supports this 
paracrine model. 

An analysis of WISP-1 gene amplification and expression in 
human colon tumors showed a correlation between DNA 
amplification and overexpression, whereas overexpression of 
WISP-3 RNA was seen in the absence of DNA amplification. 
In contrast, WISP-2 DNA was amplified in the colon tumors, 
but its mRNA expression was significantly reduced in the 
majority of tumors compared with the expression in normal 
colonic mucosa from the same patient. The gene for human 
WISP'2 was localized to chromosome 20ql2-20ql3, at a region 
frequently amplified and associated with poor prognosis in 
node negative breast cancer and many colon cancers, suggest- 
ing the existence of one or more oncogenes at this locus 
(36-38). Because the center of the 20ql3 amplicon has not yet 
been identified, it is possible that the apparent amplification 
observed for WISP-2 may be caused by another gene in this 
amplicon. 

A recent manuscript on rCop-1, the rat orthologue of 
WISP-2, describes the loss of expression of this gene after cell 
transformation, suggesting it may be a negative regulator of 
growth in cell lines (16). Although the mechanism by which 
WISP-2 RNA expression is down-regulated during malignant 
transformation is unknown, the reduced expression of WISP-2 
in colon tumors and cell lines suggests that it may function as 
a tumor suppressor. These results show that the WISP genes 
are aberrantly expressed in colon cancer and suggest that their 
altered expression may confer selective growth advantage to 
the tumor. 

Members of the Wnt signaling pathway, have been impli- 
cated in the pathogenesis of colon cancer, breast cancer, and 
melanoma, including the tumor suppressor gene adenomatous 
polyposis coli and j3-catenin (39). Mutations in specific regions 
of either gene can cause the stabilization and accumulation of 
cytoplasmic p-catenin, which presumably contributes to hu- 
man carcinogenesis through the activation of target genes such 
as the WISPs. Although the mechanism by which Wnt-1 
transforms cells and induces tumorigenesis is unknown, the 
identification of WISPs as genes that may be regulated down- 
stream of Wnt-1 in C57MG cells suggests they could be 
important mediators of Wnt-1 transformation. The amplifica- 
tion and altered expression patterns of the WISPs in human 
colon tumors may indicate an important role for these genes 
in tumor development. . 
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methods. Peptides AENK or AEQK were dissolved in water, made isotonic with 
NaCl and diluted into RPMI growth medium. T-cell-proliferation assays were 
done essentially as described 20,21 . Briefly, after antigen pulsing (30u,gmr' 
TTCF) with tetrapeptides (l-2mgmr'), PBMCs or EBV-B cells were 
washed in PBS and fixed for 45 s in 0.05% glutaraldehyde. Glycine was .added 
to a final concentration of 0.1 M and the cells were washed five times in RPMI 
1640 medium containing 1% FCS before co-culture with T-cell clones in 
round-bottom 96-well microtitre plates. After 48 h, the cultures were pulsed 
with 1 jxCi of 3 H-thymidine and harvested for scintillation counting 16 h later. 
Predigestion of native TTCF was done by incubating 200 u.g TTCF with 0.25 u,g 
pig kidney Iegumain in 500 u,l 50 mM citrate buffer, pH 5.5, for I h at 37 °C. 
Glycopeptide digestions. The peptides HIDNEEDI, HIDN(N-glucosamine) 
EEDl and HIDNESDl, which are based on the TTCF sequence, and 
QQQHLFGSNVTDCSGNFCLFR(KKK), which is based on human transferrin, 
were obtained by custom synthesis. The three C-terminal lysine residues were 
added to the natural sequence to aid solubility. The transferrin glycopeptide 
QQQHLFGSNVTDCSGNFCLFR was prepared by tryptic (Promega) digestion 
of 5 mg reduced, carboxy- methylated human transferrin followed by 
concanavalin A chromatography 11 . Glyco peptides corresponding to residues 
622-642 and 421-452 were isolated by reverse-phase HPLC and identified by 
mass spectrometry and N- terminal sequencing. The iyophilized transferrin - 
derived peptides were redissolved in 50 mM sodium acetate, pH 5.5, 10 mM 
dithiothreitol, 20% methanol. Digestions were performed for 3 h at 30 °C with 
5-50 mU ml -1 pig kidney Iegumain or B-cell AEP. Products were analysed by 
HPLC or MALDI-TOF mass spectrometry using a matrix of lOmgrnl"' a- 
cyanocinnamic acid in 50% acetonitrile/0.1% TFA and a PerSeptive Biosystems 
Elite STR mass spectrometer set to linear or reflector mode. Internal standar- 
dization was obtained with a matrix ion of 568.13 mass units. 
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Fas ligand (FasL) is produced by activated T cells and natural 
killer cells and it induces apoptosis (programmed cell death) in 
target cells through the death receptor Fas/Apol/CD95 (ref. 1), 
One important role of FasL and Fas is to mediate immune- 
cytotoxic killing of cells that are potentially harmful to the 
organism, such as virus-infected or tumour cells 1 . Here we 
report the discovery of a soluble decoy receptor, termed decoy 
receptor 3 (DcR3), that binds to FasL and inhibits FasL-induced 
apoptosis. The DcR3 gene was amplified in about half of 35 
primary lung and colon tumours studied, and DcR3 messenger 
RNA was expressed in malignant tissue. Thus, certain tumours 
may escape FasL-dependent immune-cytotoxic attack by expres- 
sing a decoy receptor that blocks FasL. 

By searching expressed sequence tag (EST) databases, we identi- 
fied a set of related ESTs that showed homology to the tumour 
necrosis factor (TNF) receptor (TNFR) gene superfamily 2 . Using 
the overlapping sequence, we isolated a previously unknown full- 
length complementary DNA from human fetal lung. We named the 
protein encoded by this cDNA decoy receptor 3 (DcR3). The cDNA 
encodes a 300-amino-acid polypeptide that resembles members of 
the TNFR family (Fig. la): the amino terminus contains a leader 
sequence, which is followed by four tandem cysteine-rich domains 
(CRDs). Like one other TNFR homologue, osteoprotegerin (OPG) 3 , 
DcR3 lacks an apparent transmembrane sequence, which indicates 
that it may be a secreted, rather than a membrane-asscociated, 
molecule. We expressed a recombinant, histidine-tagged form of 
DcR3 in mammalian cells; DcR3 was secreted into the cell culture 
medium, and migrated on polyacrylamide gels as a protein of 
relative molecular mass 35,000 (data not shown). DcR3 shares 
sequence identity in particular with OPG (31%) and TNFR2 
(29%), and has relatively less homology with Fas (17%). All of 
the cysteines in the four CRDs of DcR3 and OPG are conserved; 
however, the carboxy- terminal portion of DcR3 is 101 residues 
shorter. 

We analysed expression of DcR3 mRNA in human tissues by 
northern blotting (Fig. lb). We detected a predominant 1.2-kilobase 
transcript in fetal lung, brain, and liver, and in adult spleen, colon 
and. lung. In addition, we observed relatively high DcR3 mRNA 
expression in the human colon carcinoma cell line SW480. 

To investigate potential ligand interactions of DcR3, we generated 
a recombinant, Fc-tagged DcR3 protein. We tested binding of 
DcR3-Fc to human 293 cells transfected with individual TNF- 
family ligands, which are expressed as type 2 transmembrane 
proteins (these transmembrane proteins have their N termini in 
the cytosol). DcR3-Fc showed a significant increase in binding to 
cells transfected with FasL 4 (Fig. 2a), but not to cells transfected with 
TNF 5 , Apo2L/TRAIL 6,7 , Apo3L/TWEAK*'*, or OPGL/TRANCE/ 
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RANKL 10 "' 2 (data not shown). DcR3-Fc immunoprecipitated shed 
FasL from FasL-transfected 293 cells (Fig. 2b) and purified soluble 
FasL (Fig. 2c), as did the Fc-tagged ectodomain of Fas but not 
TNFR1. Gel- filtration chromatography showed that DcR3-Fc and 
soluble FasL formed a stable complex (Fig. 2d). Equilibrium 
analysis indicated that DcR3-Fc and Fas-Fc bound to soluble 
FasL with a comparable affinity (K d — 0.8 ± 0.2 and 
U±0.1nM, respectively; Fig. 2e), and that DcR3-Fc could 
block nearly all of the binding of soluble FasL to Fas-Fc (Fig. 2e, 
inset). Thus, DcR3 competes with Fas for binding to FasL. 

To determine whether binding of DcR3 inhibits FasL activity, we 
tested the effect of DcR3-Fc on apoptosis induction by soluble 
FasL in Jurkat T leukaemia cells, which express Fas (Fig. 3a). DcR3- 
Fc and Fas-Fc blocked soluble-FasL-induced apoptosis in a 
similar dose-dependent manner, with half-maximal inhibition at 
—0.1 fjLgmr 1 . Time-course analysis showed that the inhibition did 
not merely delay cell death, but rather persisted for at least 24 hours 
(Fig. 3b). We also tested the effect of DcR3-Fc on activation- 
induced cell death (AICD) of mature T lymphocytes, a FasL- 
dependent process 1 . Consistent with previous results' 3 , activation 
of interleukin-2-stimulated CD4-positive T cells with anti-CD3 
antibody increased the level of apoptosis twofold, and Fas-Fc 
blocked this effect substantially (Fig. 3c); DcR3-Fc blocked the 



induction of apoptosis to a similar extent. Thus, DcR3 binding 
blocks apoptosis induction by FasL. 

FasL-induced apoptosis is important in elimination of virus- 
infected cells and cancer cells by natural killer cells and cytotoxic T 
lymphocytes; an alternative mechanism involves perforin and 
granzymes 1,l4 ~"\ Peripheral blood natural killer cells triggered 
marked cell death in Jurkat T leukaemia cells (Fig. 3d); DcR3-Fc 
and Fas-Fc each reduced killing of target cells from —65% to 
—30%, with half-maximal inhibition at — lpLgml" 1 ; the residual 
killing was probably mediated by the perforin/granzyme pathway. 
Thus, DcR3 binding blocks FasL-dependent natural killer cell 
activity. Higher DcR3-Fc and f as-Fc concentrations were required 
to block natural killer cell activity compared with those required to 
block soluble FasL activity, which is consistent with the greater 
potency of membrane-associated FasL compared with soluble 
FasL' 7 . 

Given the role of immune-cytotoxic cells in elimination of 
tumour cells and the fact that DcR3 can act as an inhibitor of 
FasL, we proposed that DcR3 expression might contribute to the 
ability of some tumours to escape immune-cytotoxic attack. As 
genomic amplification frequently contributes to tumorigenesis, we 
investigated whether the DcR3 gene is amplified in cancer. We 
analysed DcR3 gene-copy number by quantitative polymerase chain 
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Figure 1 Primary structure and expression of human DcR3. a, Alignment of the 
amino-acid sequences of DcR3 and of osteoprotegerin (OPG); the C-terminal 101 
residues of OPG are not shown. The putative signal cleavage site (arrow), the 
cysteine-rich domains (CRD 1 -4), and the AHinked glycosylation site (asterisk) are 
shown, b, Expression of DcR3 mRNA. Northern hybridization analysis was done 
using the DcR3 cDNA as a probe and blots of poly(A)* RNA (Clontech) from 
human fetat and adult tissues or cancer cell lines. PBL. peripheral blood 
lymphocyte. 



Figure 2 Interaction of DcR3 with FasL a, 293 cells were transfected with pRK5 
vector (top) or with pRK5 encoding full-length FasL (bottom), incubated with 
DcR3-Fc (solid line, shaded area). TNFR1-Fc (dotted line) or buffer control 
(dashed line) (the dashed and dotted lines overlap), and analysed for binding by 
FACS. Statistical analysis showed a significant difference (P < 0.001 ) between the 
binding of DcR3-Fc to ceils transfected with FasL or pRK5. PE. phycoerythrin- 
labelied cells, b. 293 cells were transfected as in a and metabolically labelled, and 
cell supernatants were immunoprecipitated with Fc-tagged TNFR1, DcR3 or Fas. 
c, Purified soluble FasL (sFasL) was immunoprecipitated with TNFR1 -Fc, DcR3- 
Fc or Fas-Fc and visualized by immunoblot with anti-FasL antibody. sFasL was 
loaded directly for comparison in the right-hand lane, d, Flag-tagged sFasL was 
incubated with DcR3-Fc or with buffer and resolved by gel filtration; column 
fractions were analysed in an assay that detects complexes containing DcR3-Fc 
and sFasL-Flag ; e, Equilibrium binding of DcR3-Fc or Fas-Fc to sFasL-Flag. 
Inset, competition of DcR3-Fc with Fas-Fc for binding to sFasL-Flag. 
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reaction (PCR) 18 in genomic DNA from 35 primary lung and colon 
tumours, relative to pooled genomic DNA from peripheral blood 
leukocytes (PBLs) of 10 healthy donors. Eight of 18 lung tumours 
and 9 of 17 colon tumours showed DcR3 gene amplification, 
ranging from 2- to 18- fold (Fig. 4a, b). To confirm this result, we 
analysed the colon tumour DNAs with three more, independent sets 
of DcR3-based PCR primers and probes; we observed nearly the 
same amplification (data not shown). 

We then analysed DcR3 mRNA expression in primary tumour 
tissue sections by in situ hybridization. We detected DcR3 expres- 
sion in 6 out of 15 lung tumours, 2 out of 2 colon tumours, 2 out of 5 
breast tumours, and i out of 1 gastric tumour (data not shown). A 
section through a squamous-cell carcinoma of the lung is shown in 
Fig. 4c. DcR3 mRNA was localized to infiltrating malignant epithe- 
lium, but was essentially absent from adjacent stroma, indicating 
tumour-specific expression. Although the individual tumour speci- 
mens that we analysed for mRNA expression and gene amplification 
were different, the in situ hybridization results are consistent with 
the finding that the DcR3 gene: is amplified frequently in tumours. 
SW480 colon carcinoma cells, which showed abundant DcR3 
mRNA expression (Fig. lb), also had marked DcR3 gene amplifica- 
tion, as shown by quantitative PCR (fourfold) and by Southern blot 
hybridization (fivefold) (data not shown). 

If DcR3 amplification in cancer is functionally relevant, then 
DcR3 should be amplified more than neighbouring genomic 
regions that are not important for tumour survival. To test this, 
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we mapped the human DcR3 gene by radiation-hybrid analysis; 
DcR3 showed linkage to marker AFM218xe7 (T160), which maps to 
chromosome position 20ql3. Next, we isolated from a bacterial 
artificial chromosome (BAC) library a human genomic clone that 
carries DcR3, and sequenced the ends of the clone s insert. We then 
determined, from the nine colon tumours that showed twofold or 
greater amplification of DcR3, the copy number of the DcR3- 
flanking sequences (reverse and forward) from the BAC, and of 
seven genomic markers that span chromosome 20 (Fig. 4d). The 
DcR3 -linked reverse marker showed an average amplification of 
roughly threefold, slightly less than the approximately fourfold 
amplification of DcR3; the other markers showed little or no 
amplification. These data indicate that DcR3 may be at the 'epi- 
centre' of a distal chromosome 20 region that is amplified in colon 
cancer, consistent with the possibility that DcR3 amplification 
promotes tumour survival 

Our results show that DcR3 binds specifically to FasL and inhibits 
FasL activity. We did not detect DcR3 binding to several other TNF- 
ligand- family members; however, this does not rule out the possi- 
bility that DcR3 interacts with other ligands, as do some other 
TNFR family members, including OPG 2 ' 19 . . 

FasL is important in regulating the immune response; however, 
little is known about how FasL function is controlled. One mechan- 
ism involves the molecule cFLIP, which modulates apoptosis signal- 
ling downstream of Fas 20 . A second mechanism involves proteolytic 
shedding of FasL from the cell surface 17 . DcR3 competes with Fas for 




10- 2 1Q- 1 ; 10° 
Inhibitor (uxj mM) 




Fas DcR3 




80 



15 20 



Time (h) 



10* 1 10° 10 1 
Inhibitor (ug mM) 



Figure 3 Inhibition of FasL activity by DcR3. a t Human Jurkat T leukaemia cells 
were incubated with Flag-tagged soluble FasL (sFasUSngml -1 ) oligomerized 
with anti-Flag antibody (0.1 u.gmr') in the presence of the proposed inhibitors 
DcR3-Fc, Fas-Fc or human IgGl arid assayed for apoptosis (mean ± s.e.m. of 
triplicates), b, Jurkat cells were incubated with sFasL-Flag plus anti-Flag antibody 
as in a. in presence of 1 u,g ml -1 DcR3-Fc (filled circles). Fas-Fc (open circles) or 
human IgGl (triangles), and apoptosis was determined at the indicated time 
points, c. Peripheral blood T cells were stimulated with PHA and interleukin-2. 
followed by control (white bars) or anti-CD3 antibody (filled bars), together with 
phosphate-buffered saline (PBS), human IgGl, Fas-Fc, or DcR3-Fc (10u.g mr 1 ). 
After 16 h, apoptosis of CQ4* cells was determined (mean ± s.e.m. of results from 
five donors), d, Peripheral blood natural killer cells were incubated with 5, Cr- 
labelled Jurkat cells in the presence of DcR3-Fc (filled circles). Fas-Fc (open 
circles) or human lgG1 (triangles), and target-celi death was determined by 
release of 61 Cr (mean ± s.d. for two donors, each in triplicate). 
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Figure 4 Genomic amplification of DcR3 in tumours, a. Lung cancers, comprising 
eight adenocarcinomas (c, d, f, g. h, j, k, r), seven squamous-cell carcinomas (a. e, 
m, n. o, p. q), one non-small-cell carcinoma (b). one small-cell carcinoma (i). and 
one bronchial adenocarcinoma (I). The data are means ± s.d. of 2 experiments 
done in duplicate, b, Colon tumours, comprising 17 adenocarcinomas. Data are 
means ± s.e.m. of five experiments done in duplicate, c. In situ hybridization 
analysis of OcR3 mRNA expression in a squamous-ceil carcinoma of the lung. A 
representative bright-field image (left) and the corresponding dark-held image 
(right) show DcR3 mRNA over infiltrating malignant epithelium (arrowheads). 
Adjacent non-malignant stroma (S), blood vessel (V) and necrotic tumour tissue 
(N) are also shown, d. Average amplification of DcR3 compared with amplifica- 
tion of neighbouring genomic regions (reverse and forward. Rev and Fwd). the 
DcR3-linked marker T160, and other chromosome-20 markers, in the nine colon 
tumours showing DcR3 amplification of twofold or more (b). Data are from two 
experiments done in duplicate. Asterisk indicates P < 0.01 for a Student's Mest 
comparing each marker with DcR3. 
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FasL binding; hence, it may represent a third mechanism of 
extracellular regulation of FasL activity. A decoy receptor that 
modulates the function of the cytokine interleukin-1 has been 
described 21 . In addition, two decoy receptors that belong to the 
TNFR family, DcRl and DcR2, regulate the FasL-related apoptosis- 
inducing molecule Apo2L" Unlike DcRl and DcR2, which are 
membrane-associated proteins, DcR3 is directly secreted into the 
extracellular space. One other secreted TNFR- family member is 
OPG*, which shares greater sequence homology with DcR3 (31%) 
than do DcRl (17%) or DcR2 (19%); OPG functions as a third 
decoy for Apo2L 19 . Thus, DcR3 and OPG define a new subset of 
TNFR- family members that function as secreted decoys to mod- 
ulate ligands that induce apoptosis. Pox viruses produce soluble 
TNFR homologues that neutralize specific TNF-family ligands, 
thereby modulating the antiviral immune response 2 . Our results 
indicate that a similar mechanism, namely, production of a soluble 
decoy receptor for FasL, may contribute to immune evasion by 
certain tumours. □ 



Methods 

Isolation of DcR3 cONA. Several overlapping ESTs in GenBank (accession 
numbers AA025672, AA025673 and W67560) and in Lifeseq™ (Incyte 
Pharmaceuticals; accession numbers 1339238, 1533571, 1533650, 1542861, 
1789372 and 2207027) showed similarity to members of the TNFR family. We 
screened human cDNA libraries by PCR with primers based on the region of 
EST consensus; fetal lung was positive for a product of the expected size. By 
hybridization to a PCR-generated probe based on the ESTs, one positive clone 
(DNA30942) was identified. When searching for potential alternatively spliced 
forms of DcR3 that might encode a transmembrane protein, we isolated 50 
more clones; the codirig"regibns of these clones were identical in size to that of 
the initial clone (data not shown). 

Fc-fusion proteins (immunoadhesins). The entire DcR3 sequence, or the 
ectodomain of Fas or TNFR1, was fused to the hinge and Fc region of human 
IgGl, expressed in insect SF9 cells or in human 293 cells, and purified as 
described 23 . 

Fluorescence-activated cell sorting (FACS) analysis. We transfected 293 
cells using calcium phosphate or Effectene (Qiagen) with pRK5 vector or pRK5 
encoding full-length human FasL 4 (2 u,g), together with pRK5 encoding CrmA 
(2fxg) to prevent cell death. After 16 h, the cells were incubated with 
biotinylated DcR3-Fc or TNFRl-Fc and then with phycoerythrin -conjugated 
streptavidiri (GibcoBRL), and were assayed by FACS. The data were analysed by 
Kolmogorov-Smirnov statistical analysis. There was some detectable staining 
of vector-transfected cells by DcR3-Fc; as these cells express little FasL (data 
not shown),, it is possible that DcR3 recognized some other factor that is 
expressed constitu lively on 293 cells. 

tmmunoprecipitation. Human 293 cells were transfected as above, and 
metabolically labelled with [^cysteine and [ 35 S]methionine (0.5 mCi; 
Amersham). After 16 h of culture in the presence of z-VAD-fmk (10u.M), 
the medium was immunoprecipitated with DcR3-Fc, Fas-Fc or TNFRl-Fc 
(5 ng), followed by protein A-Sepharose (Repligen). The precipitates were 
resolved by SDS-PAGE and visualized on a phosphorimager (Fuji BAS2000). 
Alternatively, purified. Flag-tagged soluble FasL (1 u,g) (Alexis) was incubated 
with each Fc-fusion protein (1 u.g), precipitated with protein A-Sepharose, 
resolved by SDS-PAGE and visualized by immunoblotting with rabbit anti- 
FasL antibody (Oncogene Research). 

Analysis of complex formation. Flag-tagged soluble FasL (25 y,g) was 
incubated with buffer or with DcR3-Fc (40 p-g) for 1.5 h at 24 °C. The reaction 
was loaded onto a Superdex 200 HR 10/30 column (Pharmacia) and developed 
with PBS; 0.6- ml fractions were collected. The presence of DcR3-Fc-FasL 
complex in each fraction was analysed by placing 100 \i\ aliquots into microtitre 
wells precoated with anti-human IgG (Boehringer) to capture DcR3-Fc, 
followed by detection with biotinylated anti-Flag antibody Bio M2 (Kodak) and 
streptavidin-horseradish peroxidase (Amersham). Calibration of the column 
indicated an apparent relative molecular mass of the complex of 420K (data not 
shown), which is consistent with a stoichiometry of two DcR3-Fc homodimers 
to two soluble FasL homotrimers. 

Equilibrium binding analysis. Microtitre wells were coated with anti-human 



IgG, blocked with 2% BSA in PBS. DcR3-Fc or Fas-Fc was added, followed by 
serially diluted Flag-tagged soluble FasL. Bound ligand was detected with anti- 
Flag antibody as above. In the competition assay, Fas-Fc was immobilized as 
, above, and the wells were blocked with excess IgGl before addition of Flag- 
tagged soluble FasL plus DcR3-Fc. 

T-cell AICD. CD3 + lymphocytes were isolated from peripheral blood of 
individual donors using anti-CD3 magnetic beads (Miltenyi Biotech), 
stimulated with phytohaemaggluunin (PHA; 2 \i% ml"') for 24 h, and cultured 
in the presence of interleukin-2 (100 U ml" 1 ) for 5 days. The cells were plated in 
wells coated with anti-CD3 antibody (Pharmingen) and analysed for apoptosis 
16 h later.by FACS analysis of annexin-V-binding of CD4 + cells 24 . 
Natural killer cell activity. Natural killer cells were isolated from peripheral 
blood of individual donors using anti-CD56 magnetic beads (Miltenyi 
Biotech), and incubated for 16 h with 51 Cr-loaded Jurkat cells at an effector- 
to-target ratio of 1:1 in the presence of DcR3-Fc, Fas-Fc or human IgGl. 
Target-cell death was determined by release of 5l Cr in effector-target co- 
cultures relative to release of 5, Cr by detergent lysis of equal numbers of Jurkat 
cells. 

Gene-amplification analysis. Surgical specimens were provided by J. Kern 
(lung tumours) and P. Quirke (colon tumours). Genomic DNA was extracted 
(Qiagen) and the concentration was determined using Hoechst dye 33258 
intercalation fluorometry. Amplification was determined by quantitative PCR 1 * 
using a TaqMan instrument (ABI). The method was validated by comparison of 
PCR and Southern hybridization data for the Myc and HER-2 oncogenes (data 
not shown). Gene-specific primers and fiuorogenic probes were designed on 
the basis of the sequence of DcR3 or of nearby regions identified on a BAC 
carrying the human DcR3 gene; alternatively, primers and probes were based 
on Stanford Human Genome Center marker AFM218xe7 (T160), which is 
linked to DcR3 (likelihood score = 5.4), SHGC-36268 (T159), the nearest 
available marker which maps to -500 kilobases from T 160, and five extra 
markers that span chromosome 20. The DcR3-specific primer sequences were 
5'-CTTCTTCGCGCACGCTG-3' and 5'-ATCACGCCGGCACCAG-3' and the 
fiuorogenic probe sequence was 5 ' - ( FAM - ACACG ATGCGTGCTCCAAGCAG 
AAp-(TAMARA), where FAM is 5' -fluorescein phosphoramidite. Relative 
gene-copy numbers were derived using the formula 2 ucn , where ACT is the 
difference in amplification cycles required to detect DcR3 in peripheral blood 
lymphocyte DNA compared to test DNA. 
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ABC transporters (also known as traffic ATPases) form a large 
family of proteins responsible for the translocation of a variety 
of compounds across membranes of both prokaryotes and 
eukaryotes'. The recently completed Escherichia colt genome 
sequence revealed that the largest family of paralogous £. coli 
proteins is composed of ABC transporters 2 . Many eukaryotic 
proteins of medical significance belong to this : family, such as 
the cystic fibrosis transmembrane conductance regulator (CFTR), 
the P-glycoprotein (or multidrug-resistance protein) and the 
heterodimeric transporter associated with antigen processing 
(Tapl-Tap2). Here we report the crystal structure at 1.5 A resolu- 
tion of HisP, the ATP-binding subunit of the histidine permease, 
which is an ABC transporter from Salmonella typhimurium. We 
correlate the details of this structure with the biochemical, genetic 
and biophysical properties of the wild-type and several mutant 
HisP proteins. The structure provides a basis for understanding 
properties of ABC transporters and of defective CFTR proteins. 

ABC transporters contain four structural domains: two nucleo- 
tide-binding domains (NBDs), which are highly conserved 
throughout the family, and two transmembrane domains 1 . In 
prokaryotes these domains are often separate sub units which are 
assembled into a membrane -bound complex; in eukaryotes the 
domains are generally fused into a single polypeptide chain. The 
periplasmic histidine permease of 5. typhimurium and E. co/i 1 * 3 " 8 is a 
well-characterized ABC transporter that is a good model for this 
superfamily It consists of a membrane-bound complex, HisQMP 2 , 
which comprises integral membrane subunits, HisQ and HisM, and 
two copies of HisP, the ATP-binding subunit. HisP, which has 
properties intermediate between those of integral and peripheral 
membrane proteins 9 , is accessible from both sides of the membrane, 
presumably by its interaction with HisQ and HisM 6 . The two HisP 
subunits form a dimer, as shown by their cooperativity in ATP 
hydrolysis 5 , the requirement for both subunits to be present for 
activity*, and the formation of a HisP dimer upon chemical cross- 
linking. Soluble HisP also forms a dimer*. HisP has been purified 
and characterized in an active soluble form 3 which can be recon- 
stituted into a fully active membrane-bound complex 8 . 

The overall shape of the crystal structure of the HisP monomer is 
that of an T with two thick arms (arm I and arm II); the ATP- 
binding pocket is near the end of arm I (Fig. I). A six-stranded p- 
sheet (p3 and p8.~P 12) spans both arms of the L, with a domain of a 
a- plus P- type structure (pi, P2, P4-P7, al and a2) on one side 
(within arm I) and a domain of mostly ct-hehces (a3^a9) on the 




Figure 1, Crystal structure of HisP. a, View of the dimer along an axis 
perpendicular to its two-fold axis. The top and bottom of the dimer are suggested 
to face towards the periplasmic and cytoplasmic sides, respectively (see text). 
The thickness of arm II is about 25 A. comparable to that of membrane. a-Helices 
are shown in orange and p-sheets in green, b, View along the two-fold axis of the 
HisP dimer, showing the relative displacement of the monomers not apparent in 
a. The (J-strands at the dimer interface are labelled, c, View of one monomer from 
the bottom of arm I, as shown in a. towards arm II, showing the ATP-binding 
pocket a-c. The protein and the bound ATP are" in 'ribbon' and 'ball-and-stick' 
representations, respectively. Key residues discussed in the text are indicated in 
c. These figures were prepared with MOLSCRIPT 29 . N, amino terminus; C. C 
terminus. 
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Gene amplification is a common event in the progression of 
human cancers, and amplified oncogenes have been shown to 
have diagnostic, prognostic and therapeutic relevance. A 
kinetic quantitative polymerase-chain-reaction (PCR) method, 
based on fluorescent TaqMan methodology and a new instru- 
ment (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real-time, was used to quantify 
gene amplification in tumor DNA. Reactions are character- 
ized by the point during cycling when PCR amplification is still 
in the exponential phase, rather than the amount of PCR 
product accumulated after a fixed number of cycles. None of 
the reaction components is limited during the exponential 
phase, meaning that values are highly reproducible in reac- 
tions starting with the same copy number. This greatly 
improves the precision of DNA quantification. Moreover, 
real-time PCR does not require post-PCR sample handling, 
thereby preventing potential PCR-product .carry-over con- 
tamination; it possesses a wide dynamic range of quantifica- 
tion and results in much faster and higher sample throughput. 
The real-time PCR method, was used to develop and validate 
a simple and rapid assay for the detection and quantification 
of the 3 most frequently amplified genes (myc, ccndl and 
erbB2) in breast tumors. Extra copies of myc, ccndl and erbB2 
were observed in 10, 23 and 15%, respectively, of 108 breast- 
tumor DNA; the largest observed numbers of gene copies 
were 4.6, 18.6 and 15.1, respectively. These results correlated 
well with those of Southern blotting. The use of this new 
semi-automated technique will make molecular analysis of 
human cancers simpler and more reliable, and should find 
broad applications in clinical and research settings. Int. J. 
Cancer 78:661 -666, 1998. 
© 1998 miey-Liss. Inc. 

Gene amplification plays an important role in the pathogenesis 
of various solid tumors, including breast cancer, probably because 
over-expression of the amplified target genes confers a selective 
advantage. The first technique used to detect genomic amplification 
was cytogenetic analysis. Amplification of several chromosome 
regions, visualized either as extrachromosoma! double minutes 
(dmins) or as integrated homogeneously staining regions (HSRs), 
are among the main visible cytogenetic abnormalities in breast 
tumors. Other techniques such as comparative genomic hybridiza- 
tion (CGH) (Kallioniemi et al, 1994) have also been used in broad 
searches for regions of increased DNA copy numbers in tumor 
cells, and have revealed some 20 amplified chromosome regions in 
breast tumors. Positional cloning efforts are underway to identify 
the critical gene(s) in each amplified region. To date, genes known 
to be amplified frequently in breast cancers include myc (8q24), 
ccnd\ ( 1 1 q 1 3), and erbBl ( 1 7q 1 2-q2 1 ) (for review, see Bieche and 
Lidereau, 1995). 

Amplification of the myc, ccndl, and erbBl proto-oncogenes 
should have clinical relevance in breast cancer, since independent 
studies have shown that these alterations can be used to identify 
sub-populations with a worse prognosis (Berns et al, 1992; 
Schuuring et ai, 1992; Slamon et al. 1987). Muss et al (1994) 
suggested that these gene alterations may also be useful for the 
prediction and assessment of the efficacy of adjuvant chemotherapy 
and hormone therapy. 

However, published results diverge both in terms of the fre- 
quency of these alterations and their clinical value. For instance, 
over 500 studies in 10 years have failed to resolve the controversy 



surrounding the link suggested by Slamon et al (1987) between 
erbBl amplification and disease progression. These discrepancies 
are partly due to the clinical, histological and ethnic heterogeneity 
of breast cancer, but technical considerations are also probably 
involved. 

Specific genes (DNA) were initially quantified in tumor cells by 
means of blotting procedures such as Southern and slot blotting. 
These batch techniques require large amounts of DNA (5-10 
ug/reaction) to yield reliable quantitative results. Furthermore, 
meticulous care is required at all stages of the procedures to 
generate blots of sufficient quality for reliable dosage analysis. 
Recently, PCR has proven to be a powerful tool for quantitative 
DNA analysis, especially with minimal starting quantities of tumor 
samples (small, early-stage tumors and formalin-fixed, paraffin- 
embedded tissues). 

Quantitative PCR can be performed by evaluating the amount of 
product either after a given number of cycles (end-point quantita- 
tive PCR) or after a varying number of cycles during the 
exponential phase (kinetic quantitative PCR). In the first case, an 
internal standard distinct from the target molecule is required to 
ascertain PCR efficiency. The method is relatively easy but implies 
generating, quantifying and storing an internal standard for each 
gene studied. Nevertheless, it is the most frequently applied 
method to date. 

One of the major advantages of the kinetic method is its rapidity 
in quantifying a new gene, since no internal standard is required (an 
external standard cup^e-is sufficient). Moreover, the kinetic method 
has a wide dynamic range (at least 5 orders of magnitude), giving 
an accurate value for samples differing in their copy number. 
Unfortunately, the method is cumbersome and has therefore been 
rarely used. It involves aliquot sampling of each assay mix at 
regular intervals and quantifying, for each aliquot, the amplifica- 
tion product. Interest in the kinetic method has been stimulated by a 
. novel approach using fluorescent TaqMan methodology and a new 
instrument (ABI Prism 7700 Sequence Detection System) capable 
of measuring fluorescence in real time (Gibson et al, 1996; Heid et 
al, 1996). The TaqMan reaction is based on the 5' nuclease assay 
first described by Holland et al (1991). The latter uses the 5' 
nuclease activity of Taq polymerase to cleave a specific fluorogenic 
oligonucleotide probe during the extension phase of PCR. The 
approach uses dual-labeled fluorogenic hybridization probes (Lee 
et al, 1993). One fluorescent dye, co-valently linked to the 5' end 
of the oligonucleotide, serves as a reporter [FAM (i.e., 6-carboxy- 
fluorescein)] and its emission spectrum is quenched by a second 
fluorescent dye, TAMRA (i.e., 6-carboxy-tetramethyl-rhodamine) 
attached to the 3' end. During the extension phase of the PCR 
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cycle, the fluorescent hybridization probe is hydrolyzed by the 
5'-3' nucleolytic activity of DNA polymerase. Nuclease degrada- 
tion of the probe releases the quenching of FAM fluorescence 
emission, resulting in an increase in peak fluorescence emission. 
The fluorescence signal is normalized by dividing the emission 
intensity of the reporter dye (FAM) by the emission intensity of a 
reference dye {i.e., ROX, 6-carboxy-X-rhodamine) included in 
TaqMan buffer, to obtain a ratio defined as the Rn (normalized 
reporter) for a given reaction tube. The use of a sequence detector 
enables the fluorescence spectra of all 96 wells of the thermal 
cycler to be measured continuously during PCR amplification. 

The real-time PCR method offers several advantages over other 
current quantitative PCR methods (Celi et ai, 1994): (i) the 
probe-based homogeneous assay provides a real-time method for 
detecting only specific amplification products, since specific hybri- 
dation of both the primers and the probe is necessary to generate a 
signal; (ii) the Q (threshold cycle) value used for quantification is 
measured when PCR amplification is still in the log phase of PCR 
product accumulation. This is the main reason why Q is a more 
reliable measure of the starting copy number than are end-point 
measurements, in which a slight difference in a limiting component 
can have a drastic effect on the amount of product; (Hi) use of C t 
values gives a wider dynamic range (at least 5 orders of magni- 
tude), reducing the need for serial dilution; (iv) The real-time PCR 
method is run in a closed-tube system and requires no post-PCR 
sample handling, thus avoiding potential contamination; (v) the 
system is highly automated, since the instrument continuously 
measures fluorescence in all 96 wells of the thermal cycler during 
PCR amplification and the corresponding software processes, and 
analyzes the fluorescence data; (vi) the assay is rapid, as results are 
available just one minute after thermal cycling is complete; (vii) the 
sample throughput of the method is high, since 96 reactions can be 
analyzed in 2 hr, 

Here, we applied this semi -automated procedure to determine 
the copy numbers of the 3 most frequently amplified genes in breast 
tumors (myc, ccndl and er&B2), as well as 2 genes {alb and app) 
located in a chromosome region in which no genetic changes have 
been observed in breast tumors. The results for 108 breast tumors 
were compared with previous Southern-blot data for the same 
samples. 

MATERIAL AND METHODS 
Tumor and blood samples 

Samples were obtained from 1 08 primary breast tumors removed 
surgically from patients at the Centre Rene Huguenin; none of the 
patients had undergone radiotherapy or chemotherapy. Immedi- 
ately after surgery, the tumor samples were placed in liquid 
nitrogen until extraction of high-molecular-weight DNA. Patients 
were included in this study if the tumor sample used for DNA 
preparation contained more than 60% of tumor cells (histological 
analysis). A blood sample was also taken from 18 of the same 
patients. 

DNA was extracted from tumor tissue and blood leukocytes 
according to standard methods. 

Real-time PCR 

Theoretical basis. Reactions are characterized by the point 
during cycling when amplification of the PCR product is first 
detected, rather than by the amount of PCR product accumulated 
after a fixed number of cycles. The higher the starting copy number 
of the genomic DNA target, the earlier a significant increase in 
fluorescence is observed. The parameter Q (threshold cycle) is 
defined as the fractional cycle number at which the fluorescence 
generated by cleavage of the probe passes a fixed threshold above 
baseline. The target gene copy number in unknown samples is 
quantified by measuring C t and by using a standard curve to 
determine the starting copy number. The precise amount of 
genomic DNA (based on optical density) and its quality (i.e., lack 



of extensive degradation) are both difficult to assess. We therefore 
also quantified a control gene (alb) mapping to chromosome region 
4qll-ql3, in which no genetic alterations have been found in 
breast-tumor DNA by means of CGH (Kallioniemi et ai, 1994). 

Thus, the ratio of the copy number of the target gene to the copy 
number of the alb gene normalizes the amount and quality of 
genomic DNA. The ratio defining the level of amplification is 
termed "N'\ and is determined as follows: 

copy number of target gene {app. myc, ccndl, erbBl) 

N — . 

copy number of reference gene {alb) 

Primers, probes, reference human genomic DNA and PCR 
consumables. Primers and probes were chosen with the assistance 
of the computer programs Oligo 4.0 (National Biosciences, Ply- 
mouth, MN), EuGene (Daniben Systems, Cincinnati, OH) and Primer 
Express (Perkin-Elmer Applied Biosystems, Foster City, CA). 

Primers were purchased from DNAgency (Malvern, PA) and 
probes from Perkin-Elmer Applied Biosystems. 

Nucleotide sequences for the oligonucleotide hybridization 
probes and primers are available on request. 

The TaqMan PCR Core reagent kit, Micro Amp optical tubes, 
and MicroAmp caps were from Perkin-Elmer Applied Biosystems. 

Standard-curve construction. The kinetic method requires a 
standard curve. The latter was constructed with serial dilutions of 
specific PCR products, according to Piatak et ai (1993). In 
practice, each specific PCR product was obtained by amplifying 20 
rig of a standard human genomic DNA (Boehringer, Mannheim, 
Germany) with the same primer pairs as those used later for 
real-time quantitative PCR. The 5 PCR products were purified 
using MicroSpin S-400 HR columns (Pharmacia, Uppsala, Swe- 
den) electrophorezed through an acrylamide gel and stained with 
ethidium bromide to check their quality. The PCR products were 
then quantified spectrophotometrically and pooled, and serially 
diluted 10-fold in mouse genomic DNA (Clontech, Palo Alto, CA) 
at a constant concentration of 2 ng/ul. The standard curve used for 
real-time quantitative PCR was based on serial dilutions of the pool 
of PCR products ranging from 10" 7 (10 5 copies of each gene) to 
10"'° (10 2 copies). This series of diluted PCR products was 
aliquoted and stored at -80°C until use. 

The standard curve was validated by analyzing 2 known 
quantities of calibrator human genomic DNA (20 ng and 50 ng). 

PCR amplification. Amplification mixes (50 ul) contained the 
sample DNA (around 20 ng, around 6600 copies of disomic genes), 
10X TaqMan buffer (5 ul), 200 uM dATP, dCTP, dGTP, and 400 
uM dUTP, 5 mM MgCI 2 , 1.25 units of AmpliTaq Gold, 0.5 units of 
AmpErase uracil N-glycosylase (UNG), 200 nM each primer and 
100 nM probe. The thermal cycling conditions comprised 2 min at 
50°C and 10 min at 95°C. Thermal cycling consisted of 40 cycles at 
95°C for 15 s and 65°C for 1 min. Each assay included: a standard 
curve (from 10 5 to I0 2 copies) in duplicate, a no-template control, 
20 ng and 50 ng of calibrator human genomic DNA (Boehringer) in 
triplicate, and about 20 ng of unknown genomic DNA in triplicate 
(26 samples can thus be analyzed on a 96-well microplate). All 
samples with a coefficient of variation (CV) higher than 10% were 
retested. 

All reactions were performed in the ABI Prism 7700 Sequence 
Detection System (Perkin-Elmer Applied Biosystems), which 
detects the signal from the fluorogenic probe during PCR. 

Equipment for real-time detection. The 7700 system has a 
built-in thermal cycler and a laser directed via fiber optical cables 
to each of the 96 sample wells. A charge-coupled-device (CDD) 
camera collects the emission from each sample and the data are 
analyzed automatically. The software accompanying the 7700 
system calculates Q and determines the starting copy number in the 
samples. 



GENE AMPLIFICATION BY REAL-TIME PCR 



Determination of gene amplification. Gene amplification was 
calculated as described above. Only samples with an N value 
higher than 2 were considered to be amplified. 

RESULTS 

To validate the method, real-time PCR was performed on 
genomic DNA extracted from 108 primary breast tumors, and 18 
normal leukocyte DNA samples from some of the same patients. 
The target genes were the myc, ccndl and erbB2 proto-oncogenes, 
and the p-amyloid precursor protein gene (app), which maps to a 
chromosome region (2 lq21.2) in which no genetic alterations have 
been found in breast tumors (Kallioniemi et ai, 1994). The 
reference disomic gene was the albumin gene (alb, chromosome 
4qll-ql3). 
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Validation of the standard curve and dynamic range 
of real- time PCR 

The standard curve was constructed from PCR products serially 
diluted in genomic mouse DNA at a constant concentration of 
2 ng/ul. It should be noted that the 5 primer pairs chosen to analyze 
the 5 target genes do not amplify genomic mouse DNA (data not 
shown). Figure 1 shows the real-time PCR standard curve for the 
alb gene. The dynamic range was wide (at least 4 orders of 
magnitude), with samples containing as few as 10 2 copies or as 
many as 10 5 copies. 

Copy-number ratio of the 2 reference genes (app and a\b) 

The app to alb copy-number ratio was determined in 1 8 normal 
leukocyte DNA samples and all 108 primary breast-tumor DNA 
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samples. We selected these 2 genes because they are located in 2 
chromosome regions (app, 21q21.2; alb, 4qll-ql3) in which no 
obvious genetic changes (including gains or losses) have been 
observed in breast cancers (Kallioniemi e/ ai, 1994). The ratio for 
the 18 normal leukocyte DNA samples fell between 0.7 and 1.3 
(mean 1.02 ± 0.21), and was similar for the 108 primary breast- 
tumor DNA samples (0.6 to 1.6, mean 1.06 ± 0.25), confirming 
that alb and app are appropriate reference disomic genes for 
breast-tumor DNA. The low range of the ratios also confirmed that 
the nucleotide sequences chosen for the primers and probes were 
not polymorphic, as mismatches of their primers or probes with the 
subject's DNA would have resulted in differential amplification. 

myc, ccndl and cxbB2 gene dose in normal leukocyte DNA 

To determine the cut-off point for gene amplification in breast- 
cancer tissue, 18 normal leukocyte DNA samples were tested for 
the gene dose (N), calculated as described in "Material and 
Methods". The N value of these samples ranged from 0.5 to 1.3 
(mean 0.84 ± 0.22) for myc, 0.7 to 1.6 (mean 1.06 ± 0.23) for 
ccndl and 0.6 to ! .3 (mean 0.9 1 ± 0. 1 9) for erbBl. Since N values 
for myc, ccndl and erbB2 in normal leukocyte DNA consistently 
fell between 0.5 and 1.6, values of 2 or more were considered to 
represent gene amplification in tumor DNA. 

myc, ccndl and erb52 gene dose in breast-tumor DNA 

myc. ccndl and erbB2 gene copy numbers in the 108 primary 
breast tumors are reported in Table I. Extra copies of ccndl were 
more frequent (23%, 25/108) than extra copies of erbB2 (15%, 
16/108) and myc (10%, 11/108), and ranged from 2 to 18.6 for 
ccndl, 2 to 15.1 for erbB2, and only 2 to 4.6 for the myc gene. 
Figure 2 and Table II represent tumors in which the ccndl gene was 
amplified 16-fold (T145), 6-fold (T133) and non-amplified (TI18), 
The 3 genes were never found to be co-amplified in the same tumor. 
erbB2 and ccndl were co-amplified in only 3 cases, myc and ccndl 
in 2 cases and myc and erbB2 in 1 case. This favors the hypothesis 
that gene amplifications are independent events in breast cancer. 
Interestingly, 5 tumors showed a decrease of at least 50% in the 
erbB2 copy number (N < 0.5), suggesting that they bore deletions 
of the 17q21 region (the site of erbB2). No such decrease in copy 
number was observed with the other 2 proto-oncogenes. 

. Comparison of gene dose determined by real-time quantitative 
PCR and Southern-blot analysis 

Southern-blot analysis of myc, ccndl and erbB2 amplifications 
had previously been done on the same 108 primary breast tumors. A 
perfect correlation between the results of real-time PCR and 
Southern blot was obtained for tumors with high copy numbers 
(N ^ 5). However, there were cases (I myc, 6 ccndl and 4 erbB2) 
in which real-time PCR showed gene amplification whereas 
Southern-blot did not, but these were mainly cases with low extra 
copy numbers (N from 2 to 2.9). 

DISCUSSION 

The clinical applications of gene amplification assays are 
currently limited, but would certainly increase if a simple, standard^ 
ized and rapid method were perfected. Gene amplification status 
has been studied mainly by means of Southern blotting, but this 
method is not sensitive enough to detect low-level gene amplifica- 
tion nor accurate enough to quantify the full range of amplification 
values. Southern blotting is also time-consuming, uses radioactive 



TABLE I - DISTRIBUTION OF AMPLIFICATION LEVEL (N) FOR myc 
ccndl AND crbBl GENES IN 108 HUMAN BREAST TUMORS 



. Gene 




Amplification level (N) 




<0.5 


0.5-1.9 


2-4.9 


^5 


myc 
ccndl 
erbB2 . 


0 
0 

5 (4.6%) 


97 (89.8%) 

83 (76.9%) 
87 (80.6%) 


11 (10.2%) 
17(15.7%) 
8 (7.4%) 


0 

8(7.4%) 
8 (7.4%) 
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reagents and requires relatively large amounts of high-quality 
genomic DNA, which means it cannot be used routinely in many 
laboratories. An amplification step is therefore required to deter- 
mine the copy number of a given target gene from minimal 
quantities of tumor DNA (small early-stage tumors, cytopuncture 
specimens or formalin-fixed, paraffin-embedded tissues). 

In this study, we validated a PCR method developed for the 
quantification of gene over-representation in rumors. The method, 
based on real-time analysis of PCR amplification, has several 
advantages over other PCR-based quantitative assays such as 
competitive quantitative PCR (Celi et al, 1994). First, the real-time 
PCR method is performed in a closed-tube system, avoiding the 
risk of contamination by amplified products. Re-amplification of 
carryover PCR products in subsequent experiments can also be 
prevented by using the enzyme uracil N-glycosylase (UNG) 
(Longo et al., 1990). The second advantage is the simplicity and 
rapidity of sample analysis, since no post-PCR manipulations are 
required. Our results show that the automated method is reliable. 
We found it possible to determine, in triplicate, the number of 
copies of a target gene in more than 1 00 tumors per day. Third, the 
system has a linear dynamic range of at least 4 orders of magnitude, 
meaning that samples do not have to contain equal starting amounts 
of DNA. This technique should therefore be suitable for analyzing 
formalin-fixed, paraffin-embedded tissues. Fourth, and above all, 
real-time PCR makes DNA quantification much more precise and 
reproducible, since it is based on C, values rather than end-point 
measurement of the amount of accumulated PCR product. Indeed, 
the ABI Prism 7700 Sequence Detection System enables C, to be 
calculated when PCR amplification is still in the exponential phase 
and when none of the reaction components is rate-limiting. The 
within-run CV of the C t value for calibrator human DNA (5 
replicates) was always below 5%, and the between-assay precision 
in 5 different runs was always below 10% (data not shown). In 
addition, the use of a standard curve is not absolutely necessary, 
since the copy number can be determined simply by comparing the 
Q ratio of the target gene with that of reference genes. The results 
obtained by the 2 methods (with and without a standard curve) are 
similar in our experiments (data not shown). Moreover, unlike 
competitive quantitative PCR, real-lime PCR does not require an 
internal control (the design and storage of internal controls and the 
validation of their amplification efficiency is laborious). 

The only potential disavantage of real-time PCR, like all other 
PCR-based methods and solid-matrix blotting techniques (South- 
ern blots and dot blots) is that is cannot avoid dilution artifacts 
inherent in the extraction of DNA from tumor cells contained in 
heterogeneous tissue specimens. Only FISH and immunohistbehem- 
istry can measure alterations on a cell -by-cell basis (Pauletti et ai, 
1996; Slamon et al,, 1989). However; FISH requires expensive 
equipment and trained personnel and is also time-consuming. 
Moreover, FISH does not assess gene expression and therefore 
cannot detect cases in which the gene product is over-expressed in 
the absence of gene amplification, which will be possible in the 
future by real-time quantitative RT-PCR. Immunohistochemistry is 
subject to considerable variations in the hands of different teams, 
owing to alterations of target proteins during the procedure, the 
different primary antibodies and fixation methods used and the 
criteria used to define positive staining. 

The results of this study are in agreement with those reported in 
the literature, (i) Chromosome regions 4qll-ql3 and 21q21.2 
(which bear alb and app, respectively) showed no genetic alter- 
ations in the breast-cancer samples studied here, in keeping with 
the results of CGH (Kallioniemi et ai, 1994). (//) We found that 
amplifications of these 3 oncogenes were independent events, as 
reported by other teams (Berns et ai, 1 992; Borg et ai, 1 992). (iif) 
The frequency and degree of myc amplification in our breast tumor 
DNA series were lower than those of ccndl and erbB2 amplifica- 
tion, confirming the findings of Borg et al. (1 992) and Courjal et al. 
(1997). (iv) The maxima of ccndl and erbB2 over-representation 
were 1 8-fold and 1 5- fold, also in keeping with earlier results (about 
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T118 



27.3 



4605 



26.5 



4365 



T133 



23.2 



61659 



25.2 



10092 



T145 



22.1 



125892 



25.6 



7762 



Figure 2 - ccndl and alb gene dosage by real-time PCR in 3 breast tumor samples: Tl 18 (El 2, C6, black squares), Tl 33 (G 1 1 . B4, red squares) 
and T145 (A8, C8, blue squares). Given the C, of each sample, the initial copy number is inferred from the standard curve obtained during the same 
experimenL Triplicate plots were performed for each tumor sample, but the data for only one are shown here. The results are shown in Table II. 



30-fold maximum) (Bernse/a/., 1992; Borge/aA, 1992; Courjal et 
aL, 1997). (v) The erbBl copy numbers obtained with real-time 
PCR were in good agreement with data obtained with other 
/quantitative PCR-based assays in terms of the frequency and 
degree of amplification (An et aL, 1995; Deng et aL, 1996; Valeron 



et aL, 1996), Our results also correlate well with those recently 
published by Gelmini et aL ( 1 997), who used the TaqMan system to 
measure erbBl amplification in a small series of breast tumors 
(n — 25), but with an instrument (LS-50B luminescence spectrom- 
eter, Perk in-Elmer Applied Biosystems) which only allows end- 
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TABLE II - EXAMPLES OF ccndl GENE DOSAGE RESULTS 
FROM 3 BREAST TUMORS 1 



Tumor 




ccndl 






alb 




Uccndt/alb 


Copy 
number 


Mean 


SD 


Copy 
number 


Mean 


SD 


T118 


4525 






4223 










4605 


4603 


77 


4365 


4325 


89 


1.06 




4678 






4387 








T133 


59821 






9787 










61659 


61100 


1111 


10092 


10137 


375 


6.03 




61821 






10533 








T145 


128563 






7321 










125892 


125392 


3448 


7762 


7672 


316 


16.34 




121722 






7933 









1 For each sample, 3 replicate experiments were performed and the mean 
and the standard deviation (SD) was determined. The level of ccndl gene 
amplification (Nccndl/alb) is determined by dividing the average ccndl 
copy namber value by the average alb copy number value. 



point measurement of fluorescence intensity. Here we report myc 
and ccndl gene dosage in breast cancer by means of quantitative 
PCR. (vi) We found a high degree of concordance between 
real-time quantitative PCR and Southern blot analysis in terms of 
gene amplification, especially for samples with high copy numbers 
(>5-foldj. The slightly higher frequency of gene amplification 
(especially ccndl and erbB2) observed by means of real-time 
quantitative PCR as compared with Southem-blot analysis may be 
explained by the higher sensitivity of the former method. However, 
we cannot rule out the possibility that some tumors with a few extra 



gene copies observed in real-time PCR had additional copies of an 
arm or a whole chromosome (trisomy, tetrasomy or polysomy) 
rather than true gene amplification. These 2 types of genetic 
alteration (polysomy and gene amplification) could be easily 
distinguished in the future by using an additional probe located on 
the same chromosome arm, but some distance from the target gene. 
It is noteworthy that high gene copy numbers have the greatest 
prognostic significance in breast carcinoma (Borg et al, 1992; 
Slamoneva/., 1987). 

Finally, this technique can be applied to the detection of gene 
deletion as well as gene amplification. Indeed, we found a 
decreased copy number of erb&2 (but not of the other 2 proto- 
oncogenes) in several tumors; erbBl is located in a chromosome 
region (17q21) reported to contain both deletions and amplifica- 
tions in breast cancer (Bieche and Lidereau, 1995). 

In conclusion, gene amplification in various cancers can be used 
as a marker of pre-neoplasia, also for early diagnosis of cancer, 
staging, prognostication and choice of treatment. Southern blotting 
is not sufficiently sensitive, and FISH is lengthy and complex. 
Real-time quantitative PCR overcomes both these limitations, and 
is a sensitive and accurate method of analyzing large numbers of 
samples in a short time. It should find a place in routine clinical 
gene dosage. 

ACKNOWLEDGEMENTS 

RL is a research director at the Institut National de la Sante et de 
la Recherche Medicale (FNSERM). We thank the stafTof the Centre 
Rene Huguenin for assistance in specimen collection and patient 
care. 



REFERENCES 



AN, H.X., NlEDERACHER, D.. BECXMANN, M.W.. GOHRING, U.J., SCHARL, A., 

Picard, F., Van Roeyen, C., Schnurch, H.G. and Bender, H.G., erb&2 
gene amplification detected by fluorescent differential polymerase chain 
reaction in paraffin -embedded breast carcinoma tissues. Int. J. Cancer 
(Pred. Oncol.), 64, 29 1 -297 ( 1 995). 

Berns, E.M.J.J., Klijn, J.G.M., Van Putten, W.LJ., Van Staveren, I.L., 
Portengen, H. and Foejcens, J.A., c-myc amplification is a better prognos- 
tic factor than HERHneu amplification in primary breast cancer Cancer 
Res., 52, 1107-1113(1992). 

Bieche, I. and Lidereau, R., Genetic alterations in breast cancer. Genes 
Chrom. Cancer, 14, 227-25 1 ( 1 995). 

Borg, A., Baldetorp, B., Ferno, M., Olsson, H. and Sigurdsson, H., 
c-myc amplification is an independent prognostic factor in post-men opausal 
breast cancer, int. J. Cancer, 51, 687-691 (1992). 

Celi, F.S., Cohen, M.M., Anton arakjs, S.E., Werthejmer, E., Roth, J. 
and Shuldiner, A.R,, Determination of gene dosage by a quantitative 
adaptation of the. polymerase chain reaction (gd-PCR): rapid detection of 
deletions and duplications of gene sequences. Genomics, 21, 304-310 
(1994). 

COURJAL, R, CUNY, M., SlMONY-LAFONTArNE, J., LOUASSON, G., SpEISER, P., 

Zei lunger, R., Rodriguez, C. and Theillet, C, Mapping of DNA 
amplifications at 15 chromosomal localizations in 1875 breast tumors: 
definition of phenotypic groups. Cancer Res., 57, 4360-4367 ( 1 997). 

Deng, G., Yu, M., Chen, L.C., Moore, D., Kurisu, W„ Kallioniemi, A., 
Waldman, F.M., Collins, C. and Smith, H.S., Amplifications of oncogene 
erbB-2 and chromosome 20q in breast cancer determined by differentially 
competitive polymerase chain reaction. Breast Cancer Res. Treat, 40, 
271-281 (1996). 

Gelmini; S., Oriando, C, Sestini, R., Vona, G., Pinzani, R. Ruocco, L. 
and PaZZagli, M., Quantitative polymerase chain reaction-based homoge- 
neous assay with fluorogenic probes to measure c-erB-2 oncogene amplifi- 
cation. Clin. Chem., 43, 752-758 (1997). 

Gibson, U.E.M., Heid, C.A. and Williams, P.M., A novel method for 
real-time quantitative RT-PCR. Genome Res., 6,995-1001 (1996): 

Heid, C.A., Stevens, L, Livak, KJ. and Williams, P.M., Real-time 
quantitative PCR. Genom e Res. . 6, 986-994 ( 1 996). 

Holland, P.M., Abramson, R.D., Watson, R. and Gelfand, D.H., 
Detection of specific polymerase chain reaction product by utilizing the 5' 
to 3' exonuclease activity of Thermus aquaticus DNA polymerase. Proc. 
nat. Acad. Sci. (Wash.), 88, 7276-7280 (1991). 



Kallioniemi, A., Kallioniemi, O.P., Piper, J., Tanner, M., Stokkes, T, 
Chen, L., Smith, H.S., Pinkel, D., Gray, J.W, and Waldman, F.M., 
Detection and mapping of amplified DNA sequences in breast cancer by 
comparative genomic hybridization. Proc. nat. Acad. Sci. (Wash.), 91, 
2156-2160(1994). 

LEE, L.G., Connell, C.R. and Bioch, W., Allelic discrimination by 
nick-translation PCR with -fluorogenic probe. Nucleic Acids Res.. 21, 
3761-3766(1993), 

Longo, N., Berninger, N.S. and Hartley, J.L., Use of uracil DNA 
glycosylase to control carry-over contamination in polymerase chain 
reactions. Gene, 93, 125-128 (1990). 

Muss, H.B., Thor, A.D., Berry, D.A., Kajte, T., Liu, E.T., Koerner, F., 
Cirrincione, C.T., Budman, D.R., Wood, W.C., Barcos, M. and Hender- 
son, I.C., c-erbB-2 expression and response to adjuvant therapy in. women 
with node-positive early breast cancer. New Engl. J. Med., 330, 1260-1266 
(1994). 

Pauletti, G. ( Godolphtn, W., Press, M.F. and Salmon, D.J., Detection and 
quantification of HER-2/neu gene amplification in human breast cancer 
archival material using fluorescence in situ hybridization. Oncogene, 13, 
63-72(1996). 

Piatak, M., Luk, K.C., Williams, B. and Lifson, J.D., Quantitative 
competitive polymerase chain reaction for accurate quantitation of HIV 
DNA and RN A species, Biotechniques. 14, 70-80 ( 1 993). . 

ScHUURrNG, E., Verhoeven, E., Van Tinteren, H., Peterse, J.L., Nunnik, 
B., Thunnissen, F.B.J.M., Devilee, R, Cornelisse, CJ., Van de Vijver, 
M.J., Moou W.J. and Michalides, R.J.A.M., Amplification of genes within 
the chromosome 1 1 q 1 3 region is indicative of poor prognosis in patients 
with operable breast cancer. Cancer Res., 52, 5229-5234 (1992). 

Slamon, DJ., Clark, G.M., Wong, S.G./Levin, W.S., Ullrich, A. and 
McGuire, W.L., Human breast cancer: correlation of relapse and survival 
with amplification of the HEK-Vneu oncogene. Science, 235, 177-182 
(1987). 

Slamon, D.J., Godolphin, W., Jones, L.A., Holt, j;a., Wong, S.G., Keith, 
D.E., Levin, W.J., Stuart, S.G., Udove, J., Ullrich, A. and Press, M.F., 
Studies of the HER-2/neu proto-oncogene in human breast and ovarian 
cancer. Science. 244, 707-7 12(19 89). 

Valeron, PR, Chirino, R., Fernandez, L., Torres, S., Navarro, D., 
Aguiar; J., Cabrera, J. J., Diaz-Chico, B.N. and Diaz-Chico, J.C., 
Validation of a differential PCR and an EL1SA procedure in studying 
HER-2/neu status in breast cancer, int. J. Cancer, 65, 129-133 (1996). 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Group Art Unit 1647 

CERTIFICATE OF EXPRESS MAILING 

I hereby certify that this correspondence is 
being deposited with the United States . 
Postal Service with sufficient postage as . 
first class mail in an envelope addressed to 
Commissioner of Patents, Washington 
D.C 20231 on: 



(Date) 



Commissioner of Patents 

P.O.Box 1450 

Alexandria, VA 22313-1450 

DECLARATION OF AVI ASHKENAZI, Ph J) UNDER 37 CJFJLS" 1.132 
I, Avi Ashkenazi, Ph.D. declare and say as follows: - 

1 . I am Director and Staff Scientist at the Molecular Oncology Department of 
Genentech, Inc., South San Francisco, CA 94080. 

2. I joined Genentech in 1988 as a postdoctoral fellow. Since then, I have 
investigated a variety of cellular signal transduction mechanisms, iricluding apoptosis, and have 
developed technologies to modulate such mechanisms as at means of therapeutic intervention in 
cancer and autoimmune disease. I am currently involved in the investigation of a series of ' 
secreted proteins over-expressed in tumors, with the aim to identify useful targets for the 
development of therapeutic antibodies for cancer treatment. 

3. My scientific Curriculum Vitae, including iny list of publications, is attached to 
and forms part of this Declaration (Exhibit A). 

4. Gene amplification is a process in which chromosomes undergo changes to 
contain multiple copies of certain genes that normally exist as a single copy, and is an important 
factor in the pathophysiology of cancer. Amplification of certain genes (e.g.* Myc or Her2/Neu) 



Applicant : Ashkenazi et al. 

App.No. : 09/903,925 

Filed : July 11,2001 

For : SECRETED AND 

TRANSMEMBRANE 
POLYPEPTIDES AND NUCLEIC 
ACIDS ENCODING THE SAME 

Examiner : Hamud, Fozia M 



gives cancer cells a growth or survival advantage relative to normal cells, and might also provide 
a mechanise of tumor cell resistance to chemotherapy or radiotherapy. . 

5. If gene amplification results in over-expression of the mRNA and the 
corresponding gene product, then it identifies that gene product as a promising target for cancer 
therapy, for example by the therapeutic antibody approach. Even in the absence of over- 
expression of the gene product, amplification of a cancer marker gene - as detected^ for example, 
by the reverse transcriptase TaqMan® PGR or the fluorescence in situ hybridization (FISH) 
assays -is useful in the diagnosis or classification of cancer, or in predicting or monitoring the 
efficacy of cancer therapy. An increase in gene copy number can result not only from 
intrachromosomal changes but also from chromosomal aneuploidy. It is important to understand 
that detection of gene amplification can be used for cancer diagnosis even if the determination 
includes measurement of chromosomal aneuploidy. Indeed, as long as a significant difference 
relative to normal tissue is detected, it is irrelevant if the signal originates from an increase in the 
number of gene copies per chromosome and/6r ail abnormal number of chromosomes. 

6. I understand that according to the Patent Office, absent data demonstrating that 
the increased copy number of a gene in certain types of cancer leads to increased expression of 
its product, gene amplification data are insufficient to provide substantial utility or well 
established utility for the gene product (the encoded polypeptide), or an antibody specifically 
binding the encoded polypeptide. However, even when amplification of a cancer marker gene 
does not result in significant over-expression of the corresponding gene product, this very 
absence of gene product over-expression still provides significant information for cancer 
diagnosis and treatment. Thus, if over-expression of the gene product does not parallel gene 
amplification in certain tumor types but does so in others, then parallel monitoring of gene 
amplification and gene product over-expression enables more accurate tumor classification and 
hence better determination of suitable therapy. In addition, absence of over-expression is crucial 
information for the practicing clinician. If a gene is amplified but the corresponding gene 
product is not over-expressed, the clinician accordingly will decide not to treat a patient with 
agents that target that gene product. 

7. I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information or belief are believed to be true, and further that 
these statements were, made with the knowledge that willful false statements and the like so 



made are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the 
United States Code and that such willful statements may jeopardize the validity of the 
application or any patent issued thereon. 



By: n^ J^^^^y Date: 9 j^l Q^> 
Avi Ashkenazi, Ph.D. 



SV 45528 Ivl . 

9/12/03 3:06 PM (39780.7000) 



CURRICULUM VITAE 
AviAshkenazi 
July 2003 



Personal: 

Date of birth: 

Address: 

Phone: 

Fax: 

Email: 



29 November, 1956 

1456 Tarrytown Street, San Mateo, CA 94402 
(650) 578-9199 (home); (650) 225-1853 (office) 
(650) 225-6443 (office) 
aa@gene.com 



Education: 

1983: 
.1986: 



B.S. in Biochemistry, with honors, Hebrew University, Israel 
Ph.D. in Biochemistry, Hebrew University, Israel 



Employment: 

1983-1986: 

1985- 1986: 

1986- 1988: 

1988 -1989: 

1989- 1993: 

1994-1996: 

1996-1997: 

1997.1990: 
1999 -2002: 
2002-present: 



Teaching assistant, undergraduate level course in Biochemistry . 
Teaching assistant, graduate level course on Signal Transduction 
Postdoctoral fellow, Hormone Research Dept., UCSF, and 
Developmental Biology Dept., Genentech, Inc., with J. Ramachandran 
Postdoctoral fellow, Molecular Biology Dept., Genentech, Inc., . 
with D. Capon 

Scientist, Molecular Biology Dept., Genentech, Inc. 
Senior Scientist, Molecular Oncology Dept., Genentech, Inc. 
Senior Scientist and Interim director, Molecular Oncology Dept., 
Genentech, Inc. 

Senior Scientist and preclinical project team leader, Genentech, Inc. 

Staff Scientist in Molecular Oncology, Genentech, Inc. 

Staff Scientist and Director in Molecular Oncology, Genentech, Inc. 



Awards: 

1988: 



First prize, The Boehringer Ingelheim Award 



1 



# 



Editorial: 

Editorial Board Member: Current Biology 
Associate Editor, Clinical Cancer Research. 
Associate Editor, Cancer Biology and Therapy: 

Refereed papers: 

L Gertler, A., Ashkenazi A., and Madar, Z. Binding sites for human growth 

hormone and ovine and bovine prolactins in the mammary gland and liver of the 
lactating cow. Mol Cell Endocrinol 34, 51-57 (1984). 

2. Gertler, A., Shamay, A., Cohen, N., Ashkenazi, A., Friesen, H., Levanon, A., 
Gorecki, M,, Aviv, H , Hadari, D., and Vogel, T. Inhibition of lactogenic 
activities of ovine prolactin and human growth hormone (hGH) by a novel form of 
a modified recombinant hGH. Endocrinology 118, 720-726 (1986). 

3. Ashkenazi/A., Madar, Z., and Gertler, A. Partial purification and characterization 
of bovine mammary gland prolactin receptor. Mol Cell Endocrinol 50, 79-87 
(1987). 

4. AshkenazLA., Pines, M., and Gertler, A. Down-regulation of lactogenic 
hormone receptors in Nb2 lymphoma cells by cholera toxin. Biochemistry 
Internatl. 14, 1065-1072 (1987). 

5. Ashkenazi A., Cohen, R., and Gertler, A. Characterization of lactogen receptors 
in lactogenic hormone-dependent and independent Nb2 lymphoma cell lines. 
pEBSLett. 210, 51-55 (1987). 

6. Ashkenazi, A., Vogel, T., Barash, L, Hadari, D., Levanon, A., Gorecki, M., and 
Gertler, A. Comparative study on in vitro and in vivo modulation of lactogenic 
and somatotropic receptors by native human growth hormone and its modified 
recombinant analog. Endocrinology 121, 414-419 (1987). 

7. Peralta, E., Winslow, J., Peterson, G., Smith, D., Ashkenazi, A., Ramachandran, 
J., Schimerlik, M., and Capon, D. Primary stracture and biochemical properties 
of an M2 muscarinic receptor. Science 236, 600-605 (1987). 

8. Peralta, E. Ashkenazi A., Winslow, J., Smith, Ramachandran, J., and Capon, 
D. J. Distincnt primary structures, ligand-binding properties and tissue-specific 
expression of four human muscarinic acetylcholine receptors. EMBO J. 6, 3923- 
3929(1987). 

9. Ashkenazi, A., Winslow, J., Peralta, E., Peterson, G., Schimerlik, M., Capon, D., 
and Ramachandran, J. An M2 muscarinic receptor subtype coupled to both 
adenylyl cyclase and phosphoinositide turnover. Science 238, 672-675 (1987). 



2 



10. Pines. M. Ashkenazi. A., Cohen-Chapnik, N., Binder, L., and Gertler, A. 
Inhibition of the proliferation of Nb2 lymphoma cells by femtomolar 
concentrations of cholera toxin and partial reversal of the effect by 1 2-o- 
tetradecanoyl-phorbol-13-acetate.y. Cell. Biochem. 37, 119-129 (1988). 

11. Peralta, E. Ashkenazi. A., Winslow, J. Ramachandran, J., and Capon, D. 
Differential regulation of PI hydrolysis and adenylyl cyclase by muscarinic 
receptor subtypes. Nature 334; 434-437 (1988). 

12. Ashkenazi.. A. Peralta. E., Winslow, J., Ramachandran, J., and Capon, D. 
Functionally distinct G proteins couple different receptors to PI hydrolysis in the 
same cell. Cell 56, 487-493 (1989). 

13 . Ashkenazi. A.. Ramachandran, J., and Capon, D. Acetylcholine analogue 
stimulates DNA synthesis in brain-derived cells via specific muscarinic 
acetylcholine receptor subtypes. Nature 340, 146-150 (1989). 

14. Lammare. P.. Ashkenazi. A.. Fleurv. S.. Smith, P., Sekaly, R., and Capon, D. 
The MHC-binding and gpl20-binding domains of CD4 are distinct and separable. 
Science 245, 743-745 (1989). 

15. Ashkenazi.. A.. Presta, L., Marsters, S., Camerato, T., Rosenthal, K., Fendly, B ; , 
and Capon, D. Mapping the CD4 binding site for human irnmunodefflciency 
virus type 1 by alanine-scanning. mutagenesis. Proc. Natl. Acad. Sci. USA. 87, 
7150-7154(1990). 

16. Chamow, S., Peers, P., Bym, R., Mulkerrin, M., Harris, R., Wang, W., Bjorkman, 
P., Capon, P., and Ashkenazi. A. Enzymatic cleavage of a CD4 immunoadhesin 
generates crystallizable, biologically active Fd-like fragments. Biochemistry 29, 
9885-9891 (1990). 

17. Ashkenazi. A.. Smith. P.. Marsters. S., Riddle, L., Gregory, T., Ho, P., and 
Capon, D. Resistance of primary isolates of human immunodefficiency virus type 
1 to soluble CP4 is independent of CP4-rgp 120 binding affinity. Proc. Natl. 
Acad. Sci. USA. 88, 7056-7060(1991). 

18. Ashkenazi. A.. Marsters, S., Capon, D., Chamow, S., Figari., I., Pennica, D.; 
Goeddel., P., Palladino, M., and Smith, P. Protection against endotoxic shock by 
a tumor necrosis factor receptor immunoadhesin. Proc. Natl. Acad. Sci. USA. 88, 
10535-10539(1991). 

19. Moore. J.. McKeating. J.. Huang. Y.. Ashkenazi A ., and Ho, D. Virions of 
primary HTV-1 isolates resistant to sCD4 neutralization differ in sCP4 affinity and 
glycoprotein gpl20 retention from sCD4-sensitive isolates. J. Virol 66, 235-243 
(1992). 



3 



20. Jin, H., Oksenberg, D., Ashkenazi, A., Peroutka, S., Duncan, A., RozmaheL, R., 
Yang, Y., Mengod, G., Palacios, J., and OTDowd, B. Characterization of the 
human 5-hydroxytryptamineiB receptor. J. Biol. Chenu 261 \ 5735-5738 (1992). 

21. Marsters, A., Frutkin, A., Simpson, N., Fendly, B. and Ashkenazi, A. 
Identification of cysteine-rich domains of the type 1 tumor necrosis receptor 
involved in ligand binding. J. Biol Chenu 267, 5747-5750 (1992). 

22. Chamow, S., Kogan, T., Peers, D., Hastings, R., Byrn, R., and Ashkenazi, A. 
Conjugation of sCD4 without loss of biological activity via a novel carbohydrate- 
directed cross-linking reagent. J. Biol Chenu 267, 15916-15922 (1992). 

23. Oksenberg, D., Marsters, A., OTDowd, B., Jin, H., Havlik, S., Peroutka, S., and 
Ashkenazi, A. A single amino-acid difference confers major pharmacologic 
variation between human and rodent 5-HTlB receptors. Nature 360, 161-163 

(1992) . 

24. Haak-Frendscho, M., Marsters, S., Chamow, S., Peers, D., Simpson, N., and 
Ashkenazi, A. Inhibition of interferon y by an interferon y receptor 
immunoadhesin. Immunology 79, 594-599 (1993). 

25. Penica, D., Lam, V., Weber, R., Kohr, W., Basa, L., Spellman, M., Ashkenazi, 
Shire, S., and Goeddel, D. Biochemical characterization of the extracellular 
domain of the 75-kd tumor necrosis factor receptor. Biochemistry 32,3131-3138. 

(1993) . 

26. Barfod, L., Zheng, Y., Kuang, W., Hart, M., Evans^ T., Cerione, R., and 
Ashkenazi, A. Cloning and expression of a human CDC42 GTPase Activating 
Protein reveals a functional SH3-binding domain. Biol Chem. 268, 26059- 
26062 (1993). 

27. Chamow, S., Zhang, D., Tan, X., Mhtre, S., Marsters, S., Peers, D., Byrn, R., 
Ashkenazi. A.« and Yunghans, R. A humanized bispecific immunoadhesin- 
ahtibody that retargets CD3+ effectors to kill HTV-l-infected cells. J. Immunol 
153,4268-4280(1994). 

28. Means, R., Krahtz, S., Luna, J., Marsters, S., and Ashkenazi, A. Inhibition of 
murine erythroid colony formation in vitro by iterferon y and correction by 
interferon y receptor immunoadhesin. Blood 83, 911-915(1994). 

29. Haak-Frendscho, M., Marsters, S., Mordenti, J., Gillet, N., Chen, S., 
andAshkenazh A. Inhibition of TNF by a TNF receptor immunoadhesin: 
comparison with an anti-TNF mAb. J. Immunol. 152, 1347-1353 (1994). 



4 




30. Chamow, S., Kogan, T., Venuti, M., Gadek, T., Peers, D., Mordenti, J., Shak, S., 
and Ashkenazi, A: Modification of CD4 immunoadhesin with monomethoxy- 
PEG aldehyde via reductive alkilation. Bioconj. Cherru 5, 133-140 (1994). 
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Biotechniques 29, 506-512 (2000). 
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2. Ashkenazi. A ., Peralta, E., Winslow, J., Ramachandran, J., and Capon, D. 
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3. Chamow, S., Duliege, A., Ammann, A., Kahn, J., Allen, D., Eichberg, J., Byrn, 
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4. Ashkenazi. A ., Capon, and D. Ward, R. Immunoadhesins. Int. Rev. Immunol 10, 
217-225 (1993). 

5. Ashkenazu A ., and Peralta, E. Muscarinic Receptors. In Handbook of Receptors 
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and Differentiation 10, 66-75 (2003). 
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1. Resistance of primary HIV isolates to CD4 is independent of CD4-gpl20 binding 
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2. Use of immuno-hybrids to extend the half-life of receptors. BBC conference on 
Biopharmaceutical Halflife Extension. New Orleans, LA, June 1992. 

3 . Results with TNF receptor Immunoadhesins for the Treatment of Sepsis. IBC 
conference on Endotoxemia arid Sepsis. Philadelphia, PA, June 1992. 
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Antibody Engineering. San Diego, C A, December 1993. 

5. Tumor necrosis factor receptor: a potential therapeutic for human septic shock. 
American Society for Microbiology Meeting, Atlanta, GA, May 1 993 . 

6. Protective efficiacy of TNF receptor immunoadhesin vs anti-TNF monoclonal 
antibody in a rat model for endotoxic shock. 5th International Congress on TNF. 
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7. Iriterferonry signals via a multisubunit receptor complex that contains two types of 
polypeptide chain. American Association of hnmunologists Conference. San 
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8. hnmunoadhesins: Principles and Applications. Gordon Research Conference on 
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cells. Cambridge Symposium on TNF and Related Cytokines in Treatment of 
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10. Induction of apoptosis by Apo2 Ligand. American Society for Biochemistry and 
Molecular Biology, Symposium on Growth Factors and Cytokine Receptors. New 
Orleans, LA, June, 1996. 

1 L Apo2 ligand, an extracellular trigger of apoptosis, 2nd Clontech Symposium, 
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18. Control of Apo2L signaling by death and decoy receiptors. American Association 
for the Advancement of Science, Philadelphia, PA, February 1998. 

19. Apo2 ligand and its receptors. American Society of Immunologists. San 
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20. Death receptors and ligands. 7th International TNF Congress. Cape Cod, MA, 
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21. Apo2L as a potential therapeutic for cancer. UCLA School of Medicine. LA, 
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22. Apo2L as a potential therapeutic for cancer. Gordon Research Conference on 
Cancer Chemotherapy. New London, NH, July 1998^ 

23; Control of apoptosis by Apo2L. Endocrine Society Conference, Stevenson, WA, 
August 1998. 

24. Control of apoptosis by Apo2L. International Cytokine Society Conference, 
Jerusalem, Israel, October 1998. 
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25. Apoptosis control by death and decoy receptors. American Association for 
Cancer Research Conference, Whistler, BC, Canada, March 1999. 

26. Apoptosis control by death and decoy receptors. American Society for 
Biochemistry and Molecular Biology Conference, San Francisco, CA, May 1999. 

27. Apoptosis control by death and decoy receptors. Gordon Research Conference on 
Apoptosis, New London, NH> June 1999. 

28. Apoptosis control by death and decoy receptors. Arthritis Foundation Research 
Conference, Alexandria GA, Aug 1999. 

29. Safety and anti-tumor activity of recombinant soluble Apo2L/TRAIL. Cold 
Spring Harbor laboratory Symposium on Programmed Cell Death. . Cold Spring 
Harbor, NY, September 1999. , 

30. The Apo2L/TRAEL system: therapeutic potential. American Association for 
Cancer Research, Lake Tahoe, NV, Feb 2000. 

31. Apoptosis and cancer therapy. Stanford University School of Medicine, Stanford, 
CA, Mar 2000. 

32. Apoptosis and cancer therapy. University of Pennsylvania School of Medicine, 
Philadelphia, PA, Apr 2000. 

33 . Apoptosis signaling by Apo2L/TRAIL. International Congress on TNF. 
Trondheim, Norway, May 2000. 

34. The Apo2I7TRAIL system: therapeutic potential. Cap-CURE summit meeting. 
Santa Monica, C A* June 2000. 

35. The Apo2I7TRAIL system: therapeutic potential. MD Anderson Cancer Center. 
Houston, TX, June 2000. 

36. Apoptosis signaling by Apo21/TRAIL. The Protein Society, 14 th Symposium. 
San Diego, CA, August 2000. 

37. Anti-tumor activity of Apo2L/TRAIL. AAPS annual meeting. Indianapolis, IN 
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38. Apoptosis signaling and anti-cancer potential of Apo2I/TRAIL. Cancer Research 
Institute, UC San Francisco, CA, September 2000. 

39. Apoptosis signaling by Apo2L/TRAIL. Kenote address, TNF family 
Minisymposium, NfflL Bethesda, MD, September 2000. 

40. Death receptors: signaling and modulation. Keystone symposium on the 
Molecular basis of cancer. Taos, NM, Jan 2001 . 

41. Preclinical studies of Apo2L/TRAIL in cancer. Symposium on Targeted therapies 
in the treatment of lung cancer. Aspen, CO, Jan 2001. 
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42. Apoptosis signaling by Apx>2L/TRAIL. Wiezmann Institute of Science, Rehovot, 
Israel, March 2001. 

43. Apo2L/TRAIL: Apoptosis signaling and potential for cancer therapy. Weizmann 
Institute of Science, Rehovot, Israel, March 2001. 

44. Targeting death receptors in cancer with Apo2L/TRAIL. Cell Death and Disease 
conference, Nortii Falmouth, MA, Jun 2001. ' 

45. Targeting death receptors in cancer with Apo2L/TRAIL. Biotechnology 
Organization conference, San Diego, C A, Jun 2001. 

46. Apo2L/TRAEL signaling and apoptosis resistance mechanisms. Gordon Research 
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47. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Cleveland Clinic 
Foundation, Cleveland, OH, Oct 2001. 

48. Apoptosis signaling by death receptors: overview. International Society for 
Interferon and Cytokine Research conference, Cleveland, OH* Oct 2001 . 

49. * Apoptosis signaling by death receptors. American Society of Nephrology 

Conference. San Francisco, CA, Oct 200L 

50. Targeting death receptors in cancer. Apoptosis: commercial opportunities. San 
Diego, CA, Apr 2002. 

51. Apo2L/TRAIL signaling and apoptosis resistance mechanisms. Kiirimel Cancer 
Research Center, Johns Hopkins University, Baltimore MD. May 2002. 

52. Apoptosis control by Apo2L/TRAIL. (Keynote Address) University of Alabama 
Cancer Center Retreat, Birmingham, Ab. October 2002. 

53. Apoptosis signaling by Apo2I/TRAIL. (Session co-chair) TNF international, 
conference. San Diego, CA. October 2002. 

54. Apoptosis signaling by Apb2I/TRAIL. Swiss Institute for Cancer Research 
(ISREC). Lausanne, Swizerland. Jari 2003. 

55. Apoptosis induction with Apo2L/TRAIL. Conference on New Targets and 
Innovative Strategies in Cancer Treatment. Monte Carlo. February 2003. 

56. Apoptosis signaling by Apo2L/TRAIL. Heimelin Brain Tumor Center 
Symposium on Apoptosis. Detroit, MI. April 2003. 

57. Targeting apoptosis through death receptors. Sixth Annual Conference on 
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2 I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part ofthe Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins", When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in human tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used technique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor cells at 

— significantly-higher levels^han^in eorrespondmg-noimalJiuman-Gells^^o date^ we- 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 
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expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 
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that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 
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approximately 200 gene transcripts that are present in human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 
expressed from these differentially expressed gene transcripts and have used these 
antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 




expressed from that mRNA in that cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
mat increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 
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Paul Polakis, Ph.D. 
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SECOND DECLARATION OF PAUL POLAKIS, Ph.D. 

I, Paul Polakis, Ph.D., declare and say as follows; 

I am currently employed by Genentech 7 Inc. where my job title is Staff . 
Scientist. 

Since joining Genentech in 1999, one of my primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research 
project with a primary focus on identifying tumor cell markers rhat find use 
as targets for both the diagnosis and treatment of cancer in human*. 

As I stated in my previous Declaration dated May 7, 2004 (attached as 
Exhibit A), my laboratory has been employing a variety of techniques, 
including microarray analysis, to identify genes which are differentially 
expressed in human tumor tissue relative to normal human tissue. The 
primary purpose of this research is to identify proteins that are abundantly 
expressed on certain human tumor tissue(s) and that are either (i) not 
expressed, or (ii) expressed at detectably lower levels, on normal tissue(s)i 

In the course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in human tumor tissue 
at significantly higher levels than in normal human tissue. To date, we 
have successfully generated antibodies that bind to 31 of the tumor antigen 
proteins expressed from these differentially expressed gene transcripts and 
have used these antibodies to quantitatively determine the level of 
production of these tumor antigen proteins in both human tumor tissue and 
normal tissue. We have then quantitatively compared the levels of mRNA 
and protein in both the tumor and normal tissues analyzed. The results of 
these analyses are attached herewith as Exhibit B. In Exhibit B, *V means 
that the mRNA or protein was detectably overexpressed in the tumor tissue 
relative to normal tissue and means that no detectable overexpression 
was observed in the tumor tissue relative to normal tissue. 

As shown in Exhibit B, of the 3 1 genes identified as being detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the mRNA level , 28 of them (Le., greater than 90%) are also detectably 
overexpressed in human tumor tissue as compared to normal human tissue 
at the protein level . As such, in the cases where we have been able to 
quantitatively measure both (i) mRNA and (ii) protein levels in both (i) 
tumor tissue and (ii) normal tissue, we have observed that in the vast 
majority of cases, there is a very strong correlation between increases in 
mRNA expression and increases in the level of protein encoded by that 
mRNA. 




6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4-5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion thai for human genes, an increased level of mRNA in a tumor 
tissue relative to a normal tissue more often than not correlates to a similar 
increase in abundance of the encoded protein in the tumor tissue relative ro 
rhe normal tissue. In fact, it remains a generally accepted working 
assumption in molecular biology that increased mRNA levels are more 
often than not predictive of elevated levels of the encoded protein. In fact, 
an entire industry focusing on the research and development of therapeutic 
antibodies to treat a variety of human diseases, such as cancer, operates on 
this working assumption. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be 
true, and further that these statements were made wirh the knowledge that 
willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful statements may jeopardize the validity of the 
application or any patent issued thereon. 
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EXHIBIT A 

DECLARATION OF PAUL POLAKIS, Ph.D. 
I, Paul Polakis, Ph.D., declare and say as follows: 

1. I was awarded a Ph.D. by the Department of Biochemistry of the Michigan 
State University in 1984. My scientific Curriculum Vitae is attached to and forms 
part of this Declaration (Exhibit A). 

2. I am currently employed by Genentech, Inc. where my job title is Staff 
Scientist. Since joining Genentech in 1999, one of ray primary responsibilities has 
been leading Genentech's Tumor Antigen Project, which is a large research project 
with a primary focus on identifying tumor cell markers that find use as targets for 
both the diagnosis and treatment of cancer in humans. 

3. As part of the Tumor Antigen Project, my laboratory has been analyzing 
differential expression of various genes in tumor cells relative to normal cells. 
The purpose of this research is to identify proteins that are abundantly expressed 
on certain tumor cells and that are either (i) not expressed, or (ii) expressed at 
lower levels, on corresponding normal cells. We call such differentially expressed 
proteins "tumor antigen proteins". When such a tumor antigen protein is 
identified, one can produce an antibody that recognizes and binds to that protein. 
Such an antibody finds use in the diagnosis of human cancer and may ultimately 
serve as an effective therapeutic in the treatment of human cancer. 

4. In the course of the research conducted by Genentech's Tumor Antigen 
Project, we have employed a variety of scientific techniques for detecting and 
studying differential gene expression in huiriab tumor cells relative to normal cells, 
at genomic DNA, mRNA and protein levels. An important example of one such 
technique is the well known and widely used teclinique of microarray analysis 
which has proven to be extremely useful for the identification of mRNA molecules 
that are differentially expressed in one tissue or cell type relative to another. In the 
course of our research using microarray analysis, we have identified 
approximately 200 gene transcripts that are present in-human tumor cells at 
significantly higher levels than in corresponding normal human cells. To date, we 
have generated antibodies that bind to about 30 of the tumor antigen proteins 

expressed trom these differentially express^^ene^ansmpts-and-have-used-these 

antibodies to quantitatively determine the level of production of these tumor 
antigen proteins in both human cancer cells and corresponding normal cells. We 
have then compared the levels of mRNA and protein in both the tumor and normal 
cells analyzed. 

5. From the mRNA and protein expression analyses described in paragraph 4 
above, we have observed that there is a strong correlation between changes in the 
level of mRNA present in any particular cell type and the level of protein 
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expressed from that mRNA in Uiat cell type. In approximately 80% of our 
observations we have found that increases in the level of a particular mRNA 
correlates with changes in the level of protein expressed from that mRNA when 
human tumor cells are compared with their corresponding normal cells. 

6. Based upon my own experience accumulated in more than 20 years of 
research, including the data discussed in paragraphs 4 and 5 above and my 
knowledge of the relevant scientific literature, it is my considered scientific 
opinion that for human genes, an increased level of mRNA in a tumor cell relative 
to a normal cell typically correlates to a similar increase in abundance of the 
encoded protein in the tumor cell relative to the normal cell. In fact, it remains a 
central dogma in molecular biology that increased mRNA levels are predictive of 
corresponding increased levels of the encoded protein. While there have been 
published reports of genes for which such a correlation does not exist, it is my 
opinion that such reports are exceptions to the commonly understood general rule 
that increased mRNA levels are predictive of corresponding increased levels of the 
encoded protein. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information or belief are believed to be true, 
and farther that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful 
statements may jeopardize the validity of the application or any patent issued 
thereon. 
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ABSTRACT 

Genetic changes underlie tumor progression and may lead to cancer- 
specHk expression of criticaJ genes. Over 1100 pobUcationi have de- 
scribed the use of comparative genomic hybridization (CGH) to analyre 
the pattern of copy number alterations in cancer, but very few of the genes 
affected are known. Here, we performed high-resolution CGH analysis on 
cDNA microarrays in breast cancer and directly compared copy number ^ 
and mRNA expression levels of 13,824 genes to quantiUte the Impact of 
genomic changes on gene expression. We Identified and mapped the 
boundaries of 24 independent ampttcons, ranging in size from 0.2 to 12 
Mb. Throughout the genome, both high- and low-level copy number 
change* had a subrtantial impact on gene expression, with 44% of the 
highly amplified genes showing overexpression and 105% of the highly 
overexpressed genes being amplified. Statistical analysis with random 
permutation tests identified 270 genes whose expression level* across 14 
samples were systematically attributable to gene amplification. These 
included roost previously described amplified genes in breast cancer and 
many novel targets for genomic alterations, including the HOXB7 gene, 
the presence of which in a novel amplicon at l7q2U was validated in 
10.2% of primary breast cancers and associated with poor patient prog- 
nosis. In conclusion, CGH on cDNA microarrays revealed hundreds of 
novel genes whose overexpression is attributable to gene amplification. 
These genes may provide insights to the clonal evolution and progression 
of breast cancer and highlight promising therapeutic targets. 

INTRODUCTION 

Gene expression patterns revealed by cDNA microarrays have 
facilitated classification of cancers into biologically distinct catego- 
ries, some of which may explain the clinical behavior of the tumors 
(1-6). Despite this progress in diagnostic classification, the molecular 
mechanisms underlying gene expression patterns in cancer have re- 
mained elusive, and the utility of gene expression profiling in the 
identification of specific therapeutic targets remains limited. 

Accumulation of genetic defects is thought to underlie the clonal 
evolution of cancer. Identification of the genes that mediate the effects 
of genetic changes may be important by highlighting transcripts that 
are actively involved in tumor progression. Such transcripts and their 
encoded proteins would be ideal targets for anticancer therapies, as 
demonstrated by the clinical success of new therapies against ampli- 
fied oncogenes, such as ERBB2 and EGFR (7, 8), in breast cancer and 
other solid tumors. Besides amplifications of known oncogenes, over 
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Fig. J. Impact of gene copy number on global gene expression level*. A. percentage of 
over- and underexpressed genes [Y axis) according to copy number ratios (T axis). 
Threshold values used for over- and undcTCxpression were >2JB4 (global upper 7% or 
the cDN A ratios) and <0.4826 (global tower 7% of the expression ratios). B. percentage 
of amplified and deleted genes according to expression ratios. Threshold value* tor 
amplification end deletion were > 1.5 and <C7. 



20 recurrent regions of DNA amplification have been mapped in 
breast cancer by CGH 5 (9, 10). However, these arnplicons are often 
large and poorly defined, and their impact on gene expression remains 
unknown. 

We hypothesized that genome-wide identification of those gene 
expression changes that are attributable to underlying gene copy 
number alterations would highlight transcripts that are actively in- 
volved in the causation or maintenance of the malignant phenotype. 
To identify such transcripts, we applied a combination of cDNA and 
CGH microarrays to: (o) determine the global impact that gene copy 
number variation plays in breast cancer development and progression; 
and (b) identify and characterize those genes whose mRNA expres- 



3 The abbreviations used are: CGH, comparative genomic bybriduarioo; FISH, nuo- 
rcsccncc in situ hybridization; RT-PCR, reverse transcxiption-PCR. 



6240 



GENE EXPRESSION PATTERNS IN BREAST CANCER 




^Genc^cc^n!-^ 
/(«) the co^genom. from .p *omere » X, ^wnj ^^ A *^^^ t^^XtSU ptoo- » . tac*» of d» p-iUo. 

.nd gr«. toe, . ratio of U. B-C genome-*** copy number udyu » • MCMIvCOHm cDNA of 10 i«aeem ek»a is itaown. Rtd hortonlal toe, the 

of ibecDNA denes .long *e Kama, genome. In B. «*» ^TJLT^^T^Z'^r^^T^Mgh, Z <k* imtate fa upper 2%. .ad <to* mi do* 

copy number mtio of 1 .0. In C, individual d*aponU*<« labeled * J°>" J^-f~""" g ItoSSST !mK .nd Tlr gr«* A*,. *e «*1 5% of tb= «pn«ion trio. 

indicated whb a dashed tine. 



20 2132 



sion is most significantly associated with amplification of the corre- 
sponding genomic template. 

MATERIALS AND METHODS 

Breast Cancer Cell line*. Fourteen breast cancer cell lines (BT-20, BT- 
474, HCC1428, Hs578t, MCF7, MDA-361, MDA-436, MDA-453, MDA-468, 
SKBR-3, T-47D, UACC812, ZR-75-1, and ZR-75-30) were obtained from the 
American Type Culture Collection (Manassas, VA). Cells were grown under 
recommended culture conditions. Genomic DNA and raRNA were isolated 
using standard protocols. ' 

Copy Number and Expression Analyses by cDNA Mlcroarravs. The 
preparation and printing of the 13,824 cDNA clones on glass sKdes were 
performed as described (1 1-13). Of these clones, 244 represented uncharac- 
tcrized expressed sequence tags, and the remainder corresponded to known 
genes. CGH experiments on cDNA microarrays were done as described (14, 
15). Briefly, 20 pg of genomic DNA from breast cancer cell lines and normal 
human WBCs were digested for 14-18 h with Ahl and tool (Life Technol- 
ogies, Inc., Rockvffle, MD) and purified by phenol/chloroform extraction. Six 
fig of digested cell line DNAs were labeled with Cy3-dUTP (Amersham 
Pharmacia) and normal DNA with Cy5-dUTP (Amersham Pharmacia) using 
the Bioprime Labeling kit (Life Technologies, Inc.). Hybridization (14, 15) and 
posthybridization washes (13) were done as described. For the expression 
analyses, a standard reference (Universal Human Reference RNA; Stratagene, 
La Jolla, CA) was used in all experiments. Forty M8 of reference RNA were 
labeled with Cy3-dUTP and 3.5 *g of test mRNA with Cy5-dUTP, and the 
labeled cDNAs were hybridized on microarrays as described (13, 15). For both 
mtcroarray analyses, a laser confocal scanner (Agilent Technologies, Palo 
Alto CA) was used to measure the fluorescence intensities at the target 
locations using the DEARRAY software (16). After background subtraction, 
average intensities at each clone in the test hybridization were divided by the 
average intensity of the corresponding clone in the control hybridization. For 
the copy number analysis, the ratios were normalized on the basis of the 
distribution of ratios of all targets on the array and for the expression analysis 
on the basis of 88 housekeeping genes, which were spotted four omes onto the 
array. Low quality measurements (/.*?.. copy number data with mean reference 
intensity <100 fluorescent units, and expression data with both test and 
reference intensity' < 1 00 fluorescent units and/or with spot size <50 units) 
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were excluded from the analysis and were treated as missing values. The 
distributions of fluorescence ratios were used to define outpoints for increased/ 
decreased copy number. Genes with CGH ratio >1.43 (representing the upper 
5% of the CGH ratios across all experiments) were considered to be amplified, 
and genes with ratio <0.73 (representing the lower 5%) were considered to be 

deleted. ^ 

Statistical Analysis of CGH and cDNA Mteroarray Data. To evaluate 
the influence of copy number alterations on gene expression, we applied the 
following statistical approach. CGH and cDNA calibrated intensity ratios were 
log-transformed and normalized using median centering of the values in each 
cell line. Furthermore, cDNA ratios for each gene across all 14 cell lines were 
median centered. For each gene, the CGH data were represented by a vector 
that was labeled I for amplification (ratio, >1.43) and 0 for no amplification. 
Amplification was correlated with gene expression using the signaMo-noise 
statistics (1). We calculated a weight, w r for each gene as follows: 



where m„. <r xl and o*, denote the means and SDs for the expression 
levels for amplified and nonamptified cell lines, respectively. To assess the 
statistical significance of each weight, we performed 10,000 random permu- 
tations of the label vector. The probability that a gene had a larger or equal 
weight by random permutation than the original weight was denoted by ca. A 
low a (<0.05) indicates a strong association between gene expression and 
amplification. 

Genomic Localization of cDNA Clones and Ampttcon Mapping. Each 
cDNA clone on the microarray was assigned to a Unigenc cluster using the 
Unigene Build 141 • A database of genomic sequence alignment mformation 
for mRNA sequences was created from the August 2001 freeze of the Uni- 
versity of California Santa Cruz's GoldenPath database. 7 The chromosome and 
bp positions for each cDNA clone were then retrieved by relating these data 
sets. Amplicons were defined as a CGH copy number ratio >2.0 in at least two 
adjacent clones in two or more cell lines or a CGH ratio >2.0 in at least three 
adjacent clones in a single cell line. The amplicon start and end positions were 



•Internet address: bttp-y/n»earch.iihgrU^ 
7 Internet address: www.gcnome.ucsc.edu. 
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CGH were validated, with 1421, 17ql2-q21.2, 17q22-q23, 20ql3.1, 
and 2Qql3.2 regions being most commonly amplified. Furthermore, 
the boundaries of these amplicons were precisely delineated. In ad- 
dition,, novel amplicons were identified at 9pl3 (38.65-39.25 Mb), 
and 17q213 (52.47-55.80 Mb). 

Direct Identification of Putative Amplification Target Genes. 
The cDNA/CGH microarray technique enables the direct correla- 
tion of copy number and expression data on a gene-by-gene basis 
throughout the genome. We directly annotated high-resolution 
CGH plots with gene expression data using color coding. Fig. 2C 
shows that most of the amplified genes in the MCF-7 breast cancer 
cell line at lpl3, 17q22-q23, and 20ql3 were highly overex- 
pressed. A view of chromosome 7 in the MDA-468 cell line 
implicates EGFR as the most highly overexpTessed and amplified 
gene at 7pl l-p!2 (Fig. 3j4). In BT-474, the two known amplicons 
at 17ql2 and 17q22-q23 contained numerous highly overex- 
pressed genes (Fig. 35). In addition, several genes, including the 
homeobox genes HOXB2 and HOXB7, were highly amplified in a 
previously undescribed independent amplicon at 17q21.3. HOXB7 
was systematically amplified (as validated by FISH, Fig. 3ft inset) 
as well as overexpressed (as verified by RT-PCR, data not shown) 
in BT-474, UACC8I2, and ZR-75-30 cells. Furthermore, this novel 



extended to include neighboring nonamplified clones (ratio, <1.5). The am- 
plicon size determination was partially dependent on local clone density. 

FISH. Dual-color interphase FISH to breast cancer cell lines was done as 
described (17). Bacterial artificial chromosome clone RP11-361K8 was la- 
beled with SpectrumOrange (Vysis, Downers Grove, ILX and Spectrum- 
Orange-labeled probe for EGFR was obtained from Vysis. SpectrumGrcen- 
tabeled chromosome 7 and 17 centromere probes (Vysis) were used as a 
reference. A tissue microarray containing 612 formalin-fixed, paraffin-embed- 
ded primary breast cancers (17) was applied in FISH analyses as described 
( 1 8). The use of these specimens was approved by the Ethics Committee of the 
University of Basel and by the NIH. Specimens containing a 2-fold or higher 
increase in ihe number of test probe signals, as compared with corresponding 
centromere signals, in at least 10% of the tumor cells were considered to be 
amplified. Survival analysis was performed using the Kaplan-Meier method 
and the log-rank test. 

RT-PCR. The HOXB7 expression level was determined relative to 
GAPDH. Reverse transcription and PCR amplification were performed using 
Access RT-PCR System (Promega Corp., Madison, WI) with 10 ng of mRNA 
as a template. HOXB7 primers were 5 '-GAGC AG AGGGACTCGG ACTT-3 ' 
and 5 '-GCGTCAGGT AGCG ATTGTAG-3 

RESULTS 

Global Effect of Copy Number on Gene Expression. 13,824 
arrayed cDNA clones were applied for analysis of gene expression 
and gene copy number (CGH microarrays) in 14 breast cancer cell 
lines. The results illustrate a considerable influence of copy number 
on gene expression patterns. Up to 44% of the highly amplified 
transcripts (CGH ratio, >2.5) were overexpressed (Le.. belonged to 
the global upper 7% of expression ratios), compared with only 6% for 
genes with normal copy number levels (Fig. 1 A). Conversely, 10.5% 
of the transcripts with high-level expression (cDNA ratio, >10) 
showed increased copy number (Fig. IB). Low-level copy number 
increases and decreases were also associated with similar, although 
Jess dramatic, outcomes on gene expression (Fig. 1). 

Identification of Distinct Breast Cancer Amplicons. Base-pair 
locations obtained for 1 1 ,994 cDNAs (86.8%) were used to plot copy 
number changes as a function of genomic position (Fig. 2, Supple- 
ment Fig. A). The average spacing of clones throughout the genome 
was 267 kb. This high-resohition mapping identified 24 independent 
breast cancer amplicons, spanning from 0.2 to 12 Mb of DNA (Table 
I), Several amplification sites detected previously by chromosomal 
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Fig. 3. Annotation of gene expression data on CGH microarray profiles. A. genes in the 
7p» l-p!2 amplicon in the MDA-468 cell line arc highly expressed (red dots) and include 
the EGFR oncogene. S, several genes in the I7ql2, I7q2l.3, and 17q23 amplicons in me 
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centromere probe (green) to MDA-468 W and J/CLW-specific probe [red) and chro- 
mosome 17 c cre io m crc {green) to BT-474 cells {B). 
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Fig. 4. List or 30 genet with a statistically 
significant correlation (o value <0.Q5) between 
gene copy number and gene expression. Name, 
chromosomal location, and the a value tor each 
gene are indicated. The genes have been ordered 
according to their position in the genome. The color 
maps on the right illustrate the copy number and 
expression ratio patterns in the 14 cell tines. The 
key to the color code Is shown at the bottom of the 
graph. Gray squares, missing values. The complete 
list of 270 genet is shown in sopplcmental Fig. B. 
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amplification was validated to be present in 10.2% of 363 primary 
breast cancers by FISH to a tissue microarray and was associated 
with poor prognosis of the patients (P = 0.001). 

Statistical Identification and Characterization of 270 Highly 
Expressed Genes in Amplicons. Statistical comparison of expres- 
sion levels of all genes as a function of gene amplification identified 
270 genes whose expression was significantly influenced by copy 
number across ail 14 cell lines (Fig. 4, Supplemental Fig. B). Accord- 
ing to the gene ontology data,* 91 of the 270 genes represented 
hypothetical proteins or genes with no functional annotation, whereas 
179 had associated functional information available. Of these, 151 
(84%) are implicated in apoptosis, cell proliferation, signal transduc- 
tion, and transcription, whereas 28 (16%) had functional annotations 
that could not be directly linked with cancer. 



DISCUSSION 

The importance of recurrent gene and chromosome copy number 
changes in the development and progression of solid tumors has been 
characterized in >1000 publications applying CGH 9 (9, 10), as well 
as in a large number of other molecular cytogenetic, cytogenetic, and 
molecular genetic studies. The effects of these somatic genetic 
changes on gene expression levels have remained largely unknown, 
although a few studies have explored gene expression changes occur- 
ring in specific amplicons (15, 19-21). Here, we applied genome- 
wide cDNA microarrays to identify transcripts whose expression 
changes were attributable to underlying gene copy number alterations 
in breast cancer. 

The overall impact of copy number on gene expression patterns was 
substantial with the most dramatic effects seen in the case of high- 



Intcmct address: http*//www.gerteontologv.org/. 



* Internet address: http://www.Dcbijilm.nih.gov/emret 
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level copy number increase. Low-level copy number gains and losses 
also had a significant influence on expression levels of genes in the 
regions affected, but these effects were more subtle on a gene-by-gene 
basis than those of high-level amplifications. However, the impact of 
low-level gains on the deregulation of gene expression patterns in 
cancer may be equally important if not more important than that of 
high-level amplifications. Aneuploidy and low-level gains and losses 
of chromosomal arms represent the most common types of generic 
alterations in breast and other cancers and, therefore, have an influ- 
ence on many , genes. Our results in breast cancer extend the recent 
studies on the impact of aneuploidy on global gene expression pat- 
terns in yeast cells, acute myeloid leukemia, and a prostate cancer 
model system (22-24). 

The CGH microarray analysis identified 24 independent breast 
cancer amplicons. We defined the precise boundaries for many am- 
plicons detected previously by chromosomal CGH (9, 10, 25, 26) and 
also discovered novel amplicons that had not been detected previ- 
ously, presumably because of their small size (only 1-2 Mb) or close 
proximity to other larger amplicons. One of these novel amplicons 
involved the homeobox gene region at 17q2l.3 and led to the over- 
expression of the HOXB7 and HOXB2 genes. The homeodornain 
transcription factors are known to be key regulators of embryonic 
development and have been occasionally reported to undergo aberrant 
expression in cancer (27, 28). HOXB7 transfection induced cell pro- 
liferation in melanoma, breast, and ovarian cancer cells and increased 
tumorigenicity and angiogencsis in breast cancer (29-32). The pres- 
ent results imply that gene amplification may be a prominent mech- 
anism for oyerexpressing HOXB7 in breast cancer and suggest mat 
HOXB7 contributes to tumor progression and confers an aggressive 
disease phenotype in breast cancer. This view is supported by our 
finding of amplification of HOXB7 in 10% of 363 primary breast 
cancers, as well as an association of amplification with poor prognosis 
of the patients. 

We carried out a systematic search to identify genes whose 
expression levels across all 14 cell lines were attributable to 
amplification status. Statistical analysis revealed 270 such genes 
(representing -2% of all genes on the array), including not only 
previously described amplified genes, such as HER-2, AfTC, 
ECFRy ribosomal protein s6 kinase, and AIB3, but also numerous 
novel genes such as NRAS-related gene (lpl3), syndecan-2 (8q22), 
and bone morphogenic protein (20ql3.1), whose activation by 
amplification may similarly promote breast cancer progression. 
Most of the 270 genes have not been implicated previously in 
breast cancer development and suggest novel pathogenetic mech- 
anisms. Although we would not expect all of them to be causally 
involved, it is intriguing that 84% of the genes with associated 
functional information were implicated in apoptosis, cell prolifer- 
ation, signal transduction, transcription, or other cellular processes 
that could directly imply a possible role in cancer progression. 
Therefore, a detailed characterization of these genes may provide 
biological insights to breast cancer progression and might lead to 
the development of novel therapeutic strategies. 

In summary, we. demonstrate application of cDNA microarrays 
to the analysis of both copy number and expression levels of over 
12,000 transcripts throughout the breast cancer genome, roughly 
once every 267 kb. This analysis provided: (a) evidence of a 
prominent global influence of copy number changes on gene 
expression levels; (b) a high-resolution map of 24 independent 
amplicons in breast cancer, and (c) identification of a set of 270 
genes, the overexpression of which was statistically attributable to 
gene amplification. Characterization of a novel amplicon at 
17q21-3 implicated amplification and overexpression of the 
HOXB7 gene in breast cancer, including a clinical association 



between HOXB7 amplification and poor patient prognosis. Overall, 
our results illustrate how the identification of genes activated by 
gene amplification provides a powerful approach to highlight 
genes with an important role in cancer as well as to prioritize and 
validate putative targets for therapy development. 
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Microarray analysis reveals a major direct role of 
DNA copy number alteration in the transcriptional 
program of human breast tumors 
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Contributed by Patrick a Brown, August 6, 2002 
Genomic DNA copy number alterations are key 9**** events in 
the development and progression of human cancers. Here we 
report a genome-wide mteroarrey comparative genomic hybrid- 
ization (array CGH) analysis of DNA copy number variation m 
a series of primary human breast tumors. We have profited DNA 
copy number alteration across 6,691 mapped human genes, in 44 
predominantly advanced, primary breast tumors and 10 breast 
cancer cell lines. While the overall patterns of DNA amplification 
and deletion corroborate previous cytogenetic studies, the high- 
resolution (gene-by-gene) mapping of amplicon boundaries and 
the quantitative analysis of amplicon shape provide significant 
improvement in the localisation of candidate oncogenes. Parallel 
microarray measurements of mRNA levels reveal the remarkable 
degree to which variation In gene copy number contributes to 
variation in gene expression In tumor cells. Specifically, we find 
that 62% of highly amplified genes show moderately or highly 
elevated expression, that DNA copy number influences gene ex- 
pression across a wide range of DNA copy number alterations 
(deletion, low-, mid- and high-level amplification), that on average, 
a 2-fold change in DNA copy number Is associated with a corre- 
sponding 1.5-fold change in mRNA levels, and that overall, at least 
12% of all the variation in gene expression among the breast 
tumors is directly attributable to underlying variation In gene copy 
number. These findings provide evidence that widespread DNA 
copy number alteration can lead directly to global deregulation of 
gene expression, which may contribute to the development or 
progression of cancer. 

Conventional cytogenetic techniques, including comparative 
genomic hybridization (CGH) (1), have led to the identifi- 
cation of a number of recurrent regions of DNA copy number 
alteration in breast cancer cell lines and tumors (2-4). While 
some of these regions contain known or candidate oncogenes 
(e.g. FGFR1 (8pll), MYC (8q24), CCKD1 (llql3), ERBB2 
(17ql2), and ZNF217 (20ql3)] and tumor suppressor genes 
[RBI (13ql4) and TP53 (17pl3)] f the relevant gene(s) within 
other regions (e.g. gain of lq. 8q22, and 17q22-24 t and loss of 
8p) remain to be identified. A high-resolution genome-wide 
map, delineating the boundaries of DNA copy number alter- 
ations in tumors, should facilitate the localization and identifi- 
cation of oncogenes and tumor suppressor genes in breast 
cancer. In this study, we have created such a map, using 
array-based CGH (5-7) to profile DNA copy number alteration 
in a series of breast cancer cell lines and primary tumors. 

An unresolved question is the extent to which the widespread 
DNA copy number changes that we and others have identified 
in breast tumors alter expression of genes within involved 
regions. Because we had measured mRNA levels in parallel in 
the same samples (8), using the same DNA nucroarrays. we had 
an opportunity to explore on a genomic scale the relationship 
between DNA copy number changes and gene expression. From 
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this analysis, we have identified a significant impact of wide- 
spread DNA copy number alteration on the tnuiscriptional 
programs of breast tumors. 

Material and Wetfwds 
Tumors 4^ aiUries. Primary breast tu^ 
large (>3 cm), intermediate-grade, infiltrating ductal carcino- 
mas, with more than 50* being lymph node positive. The 
fraction of tumor ceils within specimens averaged at least 50%. 
Details of individual tumors have been published (8, 9), and 
are summarized in Table 1. which is published as supporting 
information on the PNAS web site, www.pnasxrg. Breast cancer 
cell lines were obtained from the American Type Culture 
Collection. Genomic DNA was isolated either using Qiagen 
genomic DNA columns, or by phenol/dik>roform extraction 
followed by etbanol precipitation. 

DNA Labeling and Microarray Hyt>rid1utlefit. Genomic DNA label- 
ing and hybridizations were performed essentially as described 
in Pollack et al (7), with slight mwiifications. Two micrograms 
of DNA was labeled in a total volume of 50 microliters and the 
volumes of all reagents were adjusted accorchn^. 4 Tert* DNA 
(from tumors and cell lines) was fluorescentiy labeled (CyS) and 
hybridized to a human cDNA microarray containing 6,691 
different mapped human genet (Le^ UniGene dusters)- The 
"reference" (labeled with Cy3) for each hybridization was nor- 
mal female leukocyte DNA from a single donor. The fabrication 
of cDNA niicroarrays and the labeling and hybridization of 
mRNA samples have been described (8). 

Data Analysis and Map Positions. Hybridized arrays were scanned 
on a GenePix scanner (Axon Instruments, Foster City, CA), and 
f hjorescence ratios (test/reference) calculated using scanalyze 
software (available at http://rana.lbLgov). Fluorescence ratios 
were normalized for each array by setting the average log 
fluorescence ratio for all array elements equal to 0. Measure- 
ments with f tumescence intensities more than 20% above back- 
ground were considered reliable. DNA copy number profiles 
that deviated significantly from background ratios measured in 
normal genomic DNA control hybridizations were interpreted as 
evidence of real DNA copy number alteration (see Estimating 
Significance of Altered Fluorescence Ratios in the supporting 
information). When indicated, DNA copy number profiles are 
displayed as a moving average (symmetric 5-nearest neighbors). 
Map positions for arrayed human cDNAs were assigned by 
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identifying the starting position of the best and longest match of 
any DN A sequence represented in the corresponding UmGene 
cluster (10) against the "Golden Path" genome assembly 
(http://genonie^icsc,edu/; Oct 7, 2000 Frecxe). For UmGene 
dusters represented by multiple arrayed elements, mean fluo- 
rescence ratios (for all elements representing the same UmGene 
duster) are reported. For mRNA measurements, fluorescence 
ratios are "mean-centered" (L*. reported relative to the mean 
ratio across the 44 tumor samples). The data set described here 
can be accessed in its entirety in the supporting information. 

Results 

We performed CGH on 44 predominantly locally advanced, 
primary breast tumors and 10 breast cancer cell lines, using 
cDNA microarrays containing 6,691 different mapped human 
aenes (Fig. la; also see Materials and Methods for details of 
microarray hybridizations). To take full advantage of the un- 
proved spatial resolution of array CGH, we ordered (fluores- 
cence ratios for) the 6,691 cDNAs according to the "Golden 
Path" (http://genome.ucsc.edu/) genome assembry of the draft 
human genome sequences (11). In so doing, arrayed cDNAs not 
only themselves represent genes of potential interest (e.g^ 
candidate oncogenes within amplicons), but also provide precise 
genetic landmarks for chromosomal regions of amplification and 
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deletion. Parallel analysis of DNA from cell Hues containing 
d ifferent numbers of X chromosomes (Fig. U>), as we did before 
m demonstrated the sensitivity of our method to detect smgte- 
copy loss (45, XO), and 13- (47.XXX). * jW»BJ or 
254 old (49.XXXXX) gains (also see Fig. 5, which is published 
as supporting information on thePNAS web site). Fluorescence 
ratios were linearly proportional to copy number ratiovwnich 
were slightly underestimated, in agreement with previous ob- 
servations (7). Numerous DNA copy number alterations were 
evident in both the breast cancer cell lines and primary tumors 
(Fig. la), detected in the tumors despite the presence of euploid 
non-tumor cell types; the magnitudes of the observed changes 
were generally lower in the tumor samples. DNA copy-number 
alterations were found m every cancer ceil line and tumor, and 
on every human chromosome in at least one sample. Recurrent 
regions of DNA copy number gain and loss were readOy iden- 
tifiable. For example, gains within Iq, 8q, 17q, and 20q were 
observed in a high proportion of breast cancer ceD tines/tumors 
(90%/69%, 100%/47%,100%/60%, and 90%/44%, 
to), as were losses within lp, 3p, 8p, and 13q (80%/24%» 
80%/22%, 80%/22%, and 70%/18%, respectfvery), consistent 
with published cytogenetic studies (refe. 2-4; a complete listing 
of gains/losses is provided in Tables 2 and 3, which are published 
as supporting information on the FNAS web site). The total 
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number of genomic alterations (gains and losses) was found to 
be significantly higher in breast tumors mat were high grade (P « 
0.008), consistent with published CGH data (3), estrogen recep- 
tor negative {P - 0.04), and harboring TP53 mutations (P « 
0.0006) (see Tabie 4,- which is published as supporting informa- 
tion on the PN AS web site). 

The improved spatial resolution of our array CGH analysts is 
illustrated for chromosome 8, which displayed extensive DNA 
copy number alteration in our series. A detailed view of the 
variation in the copy number of 241 genes mapping to chromo- 
some 8 revealed multiple regions of recurrent amplification; 
each of these potentially harbors a different known or previously 
uncharacterized oncogene (Fig. 2a). The complexity of amplicon 
structure is most easily appreciated in the breast cancer cell line 
SKBR3. Although a conventional CGH analysis of 8qin SKBR3 
identified only two distinct regions of amplification (12), we 
observed three distinct regions of high-level amplification (la- 
beled.1-3 in Fig. 2b). For each of these regions we can define the 



boundaries of the interval recurrently amplified in the tumors we 
examined; in each case, known or plausible candidate oncogenes 
can be identified (a description of these regions, as weU as the 
recurrently amplified regions on chromosomes 17 and 20, can be 
found in Figs. 6 and 7, which are published as supporting 
information on the PNAS web site). ■ J(M 

F6r a subset of breast cancer cell lines and tumors (4 and 37, 
respectively); and a subset of arrayed genes &095), mRNA 
levels were quantitatively measured in parallel by using cDNA 
mkroarrays (8). The parallel assessment of mRNA levels is 
useful in the interpretation of DNA copy number changes. For 
example, the highly amplified genes that are also highly ex- 
pressed are the strongest candidate oncogenes within an ampli- 
con. Perhaps more significantly, our parallel analysis of DNA 
copy number changes and mRNA levels provides us the oppor- 
tunity to assess the global impact of widespread DNA copy 
number alteration on gene expression in tumor ceils. 

A strong influence of DNA copy number on gene expression 
is evident in an examination of the pseudocolor representations 



Pollack er*/. 



PNAS | October 1.2002 | voL 99 | *»-20 I 1*965 




031 



>*i 

17ptar 



t 

170* 



ft* 3. Conconianoi between Dl^cw 

» muftreted for breert «n«r cefl dim tumen. Brwrt «ncer eel! lines .ml tumor, ere ^^mm^M 

end mRNA teveb were determined, ere ordered by position along tht chromosome; lejected ow* ere Indicated In color-coded text »ee ng. 2 legenaj. 
Ruoresceiwerettattest/njferert^ 



of DNA copy number and mRNA levels for genes on chromo- 
some 17 (Fig. 3). Tbe overall patterns of gene amplification and 
elevated gene expression are quite concordant; Le„ a significant 
fraction of highly amplified genes appear to be correspondingly 
highly expressed. The concordance between high-level amplifi- 
cation and increased gene expression is not restricted to chro- 
mosome 17. Genome-wide, of 117 high-level DNA amplifica- 
tions (fluorescence ratios >4, and representing 91 different 
genes), 62% (representing 54 different genes; see Table 5, which 
is published as supporting information on the PNAS web site) 
are found associated with at least moderately elevated mRNA 
levels (mean-centered fluorescence ratios >2), and 42% (rep- 
resenting 36 different genes) are found associated with compa- 
rably highly elevated mRNA levels (mean-centered fluorescence 
ratios >4). 

To determine the extent to which DNA deletion and lower- 
level amplification (in addition to high-level amplification) are 
also associated with corresponding alterations in mRNA levels, 
we performed three separate analyses on the complete data set 
(4 cell lines and 37 tumors, across 6,095 genes). First, we 
determined the average mRNA levels for each of five classes 
of genes, representing DNA deletion, no change, and low-, 
medium-, and high-level amplification (Rg. 4a). For both the 



breast cancer cell lines and tumors, average mRNA levels 
tracked with DNA copy number across all five classes, in a 
statistically significant fashion (P values for pairwise Students 
I tests comparing adjacent classes: cell lines, 4 x lfT* 9 , 1 x 10~~, 
5 x 10-'. Wo-<tumors, 1 x 10-<\ 1 x 10-™. 5 x 10-', 
1 x 10-*). A linear regression of the average log(DNA copy 
number), for each class, against average log(mRNA level) 
demonstrated that on average, a 2-fold change in DNA copy 
number was accompanied by 1.4- and 1.5-fold changes in mRNA - 
level for the breast cancer cell lines and turnors, respectively (Fig. 
4a, regression line not shown). Second, we characterized the 
distribution of the 6,095 correlations between DNA copy num- 
ber and mRNA level, each across the 37 tumor samples (Fig. 46). 
The distribution of correlations forms a normal-shaped curve, 
but with the peak markedly shifted in the positive direction from 
zero. This shift is stttistkaHy significant, as evidenced in a plot 
of observed vs. expected correlations (Fig. 4c), and reflects a 
pervasive global influence of DNA copy number alterations on 
gene expression. Notably, the highest ccrolations between DNA 
copy number and mRNA level (die right tail of the distribution 
in Fig. 4d) comprise both amplified and deleted genes (data not 
shown). Third, we used a linear regression model to estimate the 
fraction of all variation measured in mRNA levels among the 37 
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tumors that could be attributed to underlying variation in DNA 
copy number. From this analysis, we estimate that, overall, about 
7% of all of the observed variation In mRNA levels can be 
explained directly by variation in copy number of the altered 
genes (Fig. 4J). We can reduce the effects of experimental 
measurement error on this estimate by using only that fraction 
of the data most reliably measured (fluorescence intensity/ 
background >3); using that data, our estimate of the percent 
variation in mRNA levels directly attributed to variation in gene 
copy number increases to 12% (Fig. 4<Q. This still undoubtedly 
represents a significant underestimate, as the observed variation 
in global gene expression is affected not only by true variation in 
the expression programs of the tumor cells themselves, but also 
by the variable presence of non-tumor cell types within clinical 
samples. 

Discussion 

This genome-wide, array CGH analysis of DNA copy number 
alteration in a scries of human breast tumors demonstrates the 
usefulness of defining ampEcon boundaries at high resolution 
(gene-by-geneX and quantitatively measuring arapltcon shape, to 
assist in locating and identifying candidate oncogenes. By ana- 
lyzing mRNA levels in parallel, we have also discovered that 
changes in DNA copy number have a large, pervasive, direct 
effect on global gene expression patterns in both breast cancer 



cell lines and tumors. Although the DNA microarrays used in our 
analysis may display a bias toward characterized and/or highly 
expressed genes, because we are examining such a large fraction 
of the genome (approximately 20% of all human genes), and 
because, as detailed above, we are likely umlerestiinatmg the 
contribution of DNA copy number changes to altered gene 
expression, we believe our findings are likely to be generahzable 
(but would nevertheless still be remarkable if only applicable to 
this set of —6.100 genes). 

In budding yeast, aneuploidy has been shown to result in 
chromosome-wide gene expression biases (13). Two recent 
studies have begun to examine the global relationship between 
DNA copy number and gene exp r e s sion in cancer cells. In 
agreement with our findings, Phillips et aL (14) have shown that 
with the acquisition of tumorigenicity in an taunortaKzed pros- 
tate epithelial cell line, new chromosomal gains and losses 
resulted in a statistically significant respective increase and 
decrease in the average expression level of involved genes. In 
contrast, Platzer et aL (15) recently reported that in metastatic 
colon tumors only -4% of genes within amplified regions were 
found more highly (>2-fb)d) expressed, when compared with 
normal colonic epithelium. This report differs substantially from 
our finding that 62% of highly amplified genes in breast cancer 
exhibit at least 2-fold increased expression. These contrasting 
findings may reflect methodological differences between the 
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studies. For example, the study of Platzer et al (15) may have 
systematically under-measured gene expression changes. In this 
regard it is remarkable that only 14 transcripts of many thousand 
residing within unampKfsed chromosomal regions were found to 
exhibit at least 4-fold altered expression in metastatic colon 
cancer. Additionally, their reliance on lower-resolution chromo- 
somal OGH may have resulted in poorly delimiting the bound- 
aries of high-complexity ampltcons, effectively overcallmg re- 
gions with amplification. Alternatively, the contrasting findings 
for amplified genes may represent real biological differences 
between breast and metastatic colon tumors; resolution of this 
issue will require further studies. 

Our finding that widespread DNA copy number alteration has 
a large, pervasive and direct effect on global gene expression 
patterns in breast cancer has several important implications. 
First, this finding supports a high degree of copy number- 
dependent gene expression in tumors. Second, it suggests that 
most genes are not subject to specific autoregulation or dosage 
compensation. Third, this finding cautions that elevated expres- 
sion of an amplified gene cannot alone be considered strong 
independent evidence of a candidate oncogene's role in tumor- 
igenesis. In our study, fully 62% of highly amplified genes 
demonstrated moderately or highly elevated expression. This 
highlights the importance of high-resolution mapping of ampli- 
con boundaries and shape [to identify the "driving" gene(s) 
within ampltcons (16)], on a large number of samples, in addition 
to functional studies. Fourth, this finding suggests that analyzing 
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the genomic distribution of expressed genes, even within existing 
microarray gene expression data sets, may permit the inference 
of DNA copy number aberration, particularly aneuplotdy (where 
gene expression can be averaged across huge chromosomal 
regions; see Fig. 3 and supporting mforraatton). Fifth, this 
finding implies that a substantial portion of the phenotypic 
uniqueness (and by extension, the heterogeneity in clinical 
behavior) among patients* tumors may be traceable to underly- 
ing variation in DNA copy number. Shah, this finding supports 
a possible role for widespread DNA copy number alteration in 
tumorigenesis (17, 18), beyond the amplification of specific 
oncogenes and deletion of specific tumor suppressor genes. 
Widespread DNA copy number alteration, and the concomitant 
widespread imbalance in gene expression, might disrupt critical 
stochioroetric relationships in cell metabolism and physiology 
(eg., proteosame, mitotic spindle), possibly promoting further 
chromosomal instability and directly attributing to tumor 
development or progression. Finally, our findings suggest the 
possibility of cancer therapies that exploit specific or global 
imbalances in gene expression & cancer. 
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Each year, over 1 82,000 women in the United States are 
diagnosed with breast cancer, and approximately 45,000 die 
of the disease. 1 Incidence appears to be increasing in the 
United States at a rate of roughly 2% per year. The reasons 
for the increase are unclear, but non-genetic risk factors appear 
to play a large role. 2 

Five-year survival rates range from approximately 65%- 
85%, depending on demographic group, with a significant 
percentage of women experiencing recurrence of their cancer 
within 10 years of diagnosis. One of the factors most predic- 
tive for recurrence once a diagnosis of breast cancer has been 
made is the number of axillary lymph nodes to which tumor 
has metastasized. Most node-positive women are given adju- 
vant therapy, which increases their survival. However, 20%- 
30% of patients without axillary node involvement also 
develop recurrent disease, and the difficulty lies in how to iden- 
tify this high-risk subset of patients. These patients could 
benefit from increased surveillance, early intervention, and 
treatment. 

Prognostic markers currently used in breast cancer recur- 
rence prediction include tumor size, histological grade, steroid 
hormone receptor status, DNA ploidy, proliferative index, and 
cathepsin D status. Expression of growth factor receptors and 
over-expression of the HER-2/neu oncogene have also been 
identified as having value regarding treatment regimen and 
prognosis. 

HER-2/neu (also known as c-erbB2) is an oncogene that 
encodes a transmembrane glycoprotein that is homologous 
to, but distinct from, the epidermal growth factor receptor. 
Numerous studies have indicated that high levels of expres- 
sion of this protein are associated with rapid tumor growth, 
certain forms of therapy resistance, and shorter disease- free 
survival. The gene has been shown to be amplified and/or 
overexpressed in 10%-30% of invasive breast cancers and in 
40%-60% of intraductal breast carcinoma. 3 

There are two distinct FDA-approved methods by which 
HER-2/neu status can be evaluated: immunohistochemistry 
(IHC, HercepTest™) and FISH (fluorescent in situ hybridiza- 
tion, Path Vys ion™ Kit). Both methods can be performed on 
archived and current specimens. The first method allows visual 
assessment of the amount of HER-2/neu protein present on 
the cell membrane. The latter method allows direct quantifi- 
cation of the level of gene amplification present in the tumor, 
enabling differentiation between low- versus high-amplifica- 
ticfti. At least one study has demonstrated a difference in 



recurrence risk in women younger than 40 years of age for 
low- versus high-amplified tumors (54.5% compared to 
85.7%); this is compared to a recurrence rate of 16.7% for 
patients with no HER-2/neu gene amplification. 4 HER-2/neu 
status may be particularly important to establish in women with 
small (25 1 cm) tumor size. 

The choice of methodology for determination of HER-2/ 
neu status depends in part on the clinical setting. FDA approval 
for the Vysis FISH test was granted based on clinical trials 
involving 1549 node-positive patients. Patients received one 
of three different treatments consisting of different doses of 
cyclophosphamide, Adriamycin, and 5-fiuorouracil (CAF). 
The study showed that patients with amplified HER-2/neu 
benefited from treatment with higher doses of adriamycin- 
based therapy, while those with normal HER-2/neu levels did 
not. The study therefore identified a sub-set of women, who 
because they did not benefit from more aggressive treatment, 
did not need to be exposed to the associated side effects. In 
addition, other evidence indicates that HER-2/neu amplifica- 
tion in node-negative patients can be used as an independent 
prognostic indicator for early recurrence, recurrent disease at 
any time and disease-related death. 5 Demonstration of HER- 
2/neu gene amplification by FISH has also been shown to be 
of value in predicting response to chemotherapy in stage- 2 
breast cancer patients. 

Selection of patients for Herceptin 0 (Trastuzumab) mono- 
clonal antibody therapy, however, is based upon demonstra- 
tion of HER-2/neu protein overexpression using HercepTest™. 
Studies using Herceptin 0 in patients with metastatic breast 
cancer show an increase in time to disease progression, 
increased response rate to chemotherapeutic agents and a small 
increase in overall survival rate. The FISH assays have not yet 
been approved for this purpose, and studies looking at response 
to Herceptin 0 in patients with or without gene amplification 
status determined by FISH are in progress. 

In general, FISH and IHC results correlate well. However, 
subsets of tumors are found which show discordant results; 
i.e., protein overexpression without gene amplification or lack 
of protein overexpression with gene amplification. The clini- 
cal significance of such results is unclear. Based on the above 
considerations, HER-2/neu testing at SHMC/PAML will uti- 
lize immunohistochemistry (HercepTest 0 ) as a screen, fol- 
lowed by FISH in IHC-negative cases. Alternatively, either 
method may be ordered individually depending on the clini- 
cal setting or clinician preference. 
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PT code information 



HER-2/neu via IHC 

88342 (including interpretive report) 

HER-2/neu via FISH 

8827 1 x2 Molecular cytogenetics, DNA probe, each 
88274 Molecular cytogenetics, interphase in situ hybrid- 
ization, analyze 25-99 cells 
8829 1 Cytogenetics and molecular cytogenetics, interpre- 
tation and report 



Procedural Information 

Immunohistochemistry is performed using the FDA-approved 
DAKO antibody kit, Herceptest 0 . The DAKO kit contains 
reagents required to complete a two-step immunohisto- 
chemical staining procedure for.routinery processed, paraffin- 
embedded specimens. Following incubation with the primary 
rabbit antibody to human HER-2/neu protein, the kit employs 
a ready-to-use dextran-based visualization reagent. This re- 
agent consists of both secondary goat anti-rabbit antibody 
molecules with horseradish peroxidase molecules linked to a 
common dextran polymer backbone, thus eliminating the need 
for sequential application of link antibody and peroxidase 
conjugated antibody. Enzymatic conversion of the subse- 
quently added chromogen results in formation of visible 
reaction product at the antigen site. The specimen is then coun- 
terstained; a pathologist using light-microscopy interprets 
results. 

FISH analysis at SHMC/PAML is performed using the 
FDA-approved PathVysion™ HER-2/neu DNA probe kit, pro- 
duced by Vysis, Inc. Formalin fixed, paraffin-embedded breast 
tissue is processed using routine histological methods, and then 
slides are treated to allow hybridization of DNA probes to the 
nuclei present in the tissue section. The Pathvysion™ kit con- 
tains two direct-labeled DNA probes, one specific for the 
alphoid repetitive DNA (CEP 1 7, spectrum orange) present at 
the chromosome 17 centromere and the second for the HER- 
2/neu oncogene located at 1 7q 1 1 .2- 1 2 (spectrum green). Enu- 
meration of the probes allows a ratio of the number of copies 
of chromosome 17 to the number of copies of HER-2/neu to 
be obtained; this enables quantification of low versus high 
amplification levels, and allows an estimate of the percentage 
of cells with HER-2/neu gene amplification. The clinically 
relevant distinction is whether the gene amplification is due 
to increased gene copy number on the two chromosome 1 7 
homologues normally present or an increase in the number of 
chromosome 17s in the cells. In the majority of cases, ratio 
equivalents less than 2.0 are indicative of a normal/negative 
result, ratios of 2.1 and over indicate that amplification is 
present and to what degree. Interpretation of this data will be 
performed and reported from the Vy sis-certified Cytogenet- 
ics laboratory at SHMC. 
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ABSTRACT The consistent cytogenetic translocation of 
chronic myelogenous leukemia (the Philadelphia chromosome, 
Ph l ) has been observed In cells of multiple hematopoietic 
lineages. This translocation creates a chimeric gene composed 
of breakpoint*cluster»reglon (bcr) sequences from chromosome 
22 fused to a portion of the abl oncogene on chromosome 9. The 
resulting gene product (P210 c ** bl ) resembles the transforming 
protein of the Abelson murine leukemia virus in its structure 
and tyrosine kinase activity. P210 c * u is expressed in Ex- 
positive cell lines of myeloid lineage and in clinical specimens 
with, myeloid predominance. We show here that Epsteln-Barr 
virus-transfonned B -lymphocyte lines that retain Ph l can 
express PHO*"** 1 , The level of expression In these B-cell lines is 
generally lower and more variable than that observed for 
myeloid lines. Protein expression is not related to amplification 
of the abl gene but to variation In the level of bcr-abl mRNA 
produced from a single Ph 1 template. 



Chronic myelogenous leukemia (CML) is a disease of the 
pluripotent stem cell (1). In greater than 95% of patients, the 
leukemic cells contain the cytogenetic marker known as the 
Philadelphia chromosome, or Ph 1 (2). This reciprocal 
translocation event between the long arms of chromosomes 
9 and 22 has been used as a disease-specific marker for 
diagnosis and evaluation of therapy. Multiple hematopoietic 
lineages, including myeloid and B -lymphoid, contain Ph 1 in 
early or chronic phase, as well as in the more acute accel- 
erated and blast crisis phases of the disease. 

One molecular consequence of Ph 1 is the translocation of 
the chromosomal arm containing the c-abl gene oh chromo- 
some 9 into the middle of the breakpoint-cluster region (bcr) 
gene on chromosome 22 (3-6).' Although the precise 
translocation breakpoints are variable, an RNA-splicing 
mechanism generates a very similar 8-kilobase (kb) mRN A in 
each case (5-9). The hybrid bcr.-abl message encodes a 
structurally altered form of the abl oncogene product, called 
P210 c " aW (10-13), with an ammo-terminal segment derived 
from a portion of the exons of bcr on chromosome 22 and a 
carboxyl-tenninal segment derived from a major portion of 
the exons of the c-abl gene on chromosome 9. The chimeric 
structure of bcr-abl and the resulting V2lQ c M is similar to the 
structure of the Abelson murine leukemia virus gag-abl 
genome and resulting P160 v * w transforming gene product. 
Both proteins have very similar tyrosine kinase activities (10, 
11, 14) which can be distinguished by their relative stability 
to denaturing detergents and by their ATP requirements from 
the recently described tyrosine kinase activity of. the c-abl 
gene product (15). 
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In concert with structural modification of the amino- 
terminal portion of the abl gene, increased level of expression 
has been implicated in activation of c-abl oncogenic poten- 
tial. Myeloid and erythroid cell lines and clinical samples 
derived from acute-phase CML patients contain about 10- 
fold higher levels of the 8-kb bcr-abl mRNA and WW** than 
the c-abl mRNA forms (6 and 7 kb) and P145 c '* bl gene product 
(5, 8, 9, li), The higher level of expression of the chimeric 
bcrrabl message in acute-phase cells is not likely to be solely 
due to the presence of the bcr promoter sequences at the 5' 
end of the gene, since the normal 4.5-kb and 6.7-kb bcr- 
encoded mRNA species are expressed at an even lower level 
than the normal c-abl messages (5,6). 

We have analyzed a series of Epstein-Barr virus-immor- 
talized B-lymphoid cell lines derived from CML patients (16). 
With such in vitro clonal cell lines, we can evaluate whether 
the presence of t»h l always results in synthesis of the chimeric 
bcr-abl message and protein, and whether the quantitative 
expression varies for cells of B-lymphoid lineage as com- 
pared to previously examined myeloid cell lines. Our results 
show that cell lines that retain Ph* do express bcr-abl message 
and protein, but that the level is generally lower and more 
variable than previously seen for myeloid cell lines. The 
demonstration that the Ph 1 chromosomal template can vary 
in its level of expression of P210 c **^ suggests that secondary 
mechanisms, beyond the translocation itself, contribute to 
the regulation of the bcr-qbl gene in different cell types or 
subclones that derive from the affected stem cell. 

MATERIALS AND METHODS 

Cells and Cell Labelings. Epstein-Barr virus-transformed 
B-lymphoid cell lines were established from peripheral blood 
samples of chronic- and acute-phase CML patients as report* 
ed (16). The cell lines are designated according to patient 
number, karyotype, and lineage. For example, SK- 
CML7Bt(9,22)-33 refers to CML patient 7, B-lymphoid cell 
line, 9;22 translocation (Ph 1 ), cell line 33; and SK-CML7BN- 
2 refers to B-cell line 2 with a normal karyotype derived from, 
the same patient. Repeat karyotype analysis was performed 
to verify the retention of Ph 1 just prior to analysis for abl 
protein and RNA. Cells were maintained in RPMI 1640 
medium with 20% fetal bovine serum. We have not observed 
any consistent pattern of In vitro growth rate that correlates 
to the stage of disease at the time of transformation with 
Epstein-Barr virus. Cells (1.5 x 10 7 ) were washed twice with 
Dulbecco's modified Eagle's medium lacking phosphate and 



Abbreviations: bcr, breakpoint-cluster region * c CML, chronic 
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supplemented with 5% dialyzed fetal bovine serum. Cells 
were then resuspended in 2 ml of the minimal medium. 
Labeling was started with the addition of [ 32 P]orthophos- 
phate (1 mCi/ml; ICN; 1 Ci = 37 GBq) and continued at 37°C 
for 3-4 hr. 

Immunoprecipitation and Immunoblotting,, Immunoprecip- 
itations were carried out as described (10). Cells (1.5 x JO 7 ) 
were washed with phosphate-buffered saline and extracted 
with 3-5 ml of phosphate lysis buffer (1% Triton X-100/0.1 
NaDodSCU/0.5% deoxycholate/10 mM Na 2 HPO«, pH 7.5/ 
100 mM NaCl) with 5 mM EDTA and 5 mM phenylmethyl- 
sulfonyl fluoride. Extracts were clarified by centrifugation 
and precipitated with normal or rabbit anti-abl sera (anti- 
pEX-2 or anti-pEX-5) (17). The precipitated proteins were 
electrophoresed in a NaDodS0 4 /8% polyacrylamide gel. 
32 P-labelcd proteins were detected by autoradiography. 
Alternatively, abl proteins were detected by immunoblotting. 
Extracts from unlabeled cells were clarified, and proteins 
were concentrated by immunoprecipitation with rabbit anti- 
sera against aW-encoded proteins [anti-pEX-2 and anti-pEX- 
5 combined (17)] and then fractionated in 8% acrylamide gels. 
The proteins were transferred from the gel to nitrocellulose 
filters, using protease-facilitated transfer (18). The abl- 
encoded proteins were detected using murine monoclonal 
antibodies as a probe and peroxidase-conjugated goat anti- 
mouse second stage antibody (Bio-Rad) for development. 
Rabbit antisera and mouse monoclonal antibodies to abl 
proteins were prepared using bacterially expressed regions of 
the v-abl protein as immunogens (17, 19). Anti-pEX-2 anti- 
bodies react with the internal tyrosine kinase domain and 
antt-pEX-5 antibodies react with the carboxyl-terminal seg- 
ment of the abl proteins. 

RNA Analysis. RNA was extracted from 10 s cells by the 
NaDodSO^/urea/prienol method (20). Polyadenylylated 
RNA was purified by oligo(dT) affinity chromatography. 
Samples were electrophoresed in a 1% agarose/formalde- 
hyde gel and transferred to nitrocellulose, abl RNA species 
were detected by hybridization with a nick-translated v-abl 
fragment probe (21). 

DNA Analysis. DNA was prepared from 5 x 10 7 cells of 
each cell line and processed for Southern blots with a v-aW 
probe as described (21). 

RESULTS 

Variable Levels of Are Detected in Ph'-Positive Cell 

Lines. Ph 1 -positive and Ph 1 -negative, Epstein-Barr vims- 
transformed B -lymphocyte cell lines derived from the same 
patient were examined for HIO 0 "*" synthesis by immuno- 
precipitation of t 35 P]orthophosphate-labeled cell extracts 
with anti-abl sera (Fig. 1). The normal c-abl protein P145 c ** w 
was detected at a similar level in multiple Ph l -positive and 
Ph l -negative cell lines. P210°' abl was only detected in the 
Ph 1 -positive cell lines because the bcr-abl chimeric gene 
which encodes P210 c_abI resides on the Ph 1 (4, 5, 11, 13). The 
level of P210 cabl was about 4- to 5-fold higher than the level 
of P145<~ M in the SK-CML7Bt-33 cell line (Fig. 1A, +). The 
Ptf-positive erythroid-progenitor cell line K562 (C) showed 
a level of niO c ' M about 10-fold higher than P145 c -* w . 
However, the level of P210 c "* w was about one-fifth that of 
p 145 oab! in ^ phi-positive SK-CML16BM ceil line (Fig. \B> 
+). Comparison of different autoradiographic exposures 
roughly indicated that the level of P210 c ** bl varies over a 
20-fold range between these Ph l -positive B-cell lines. Anal- 
ysis of four additional Ph l -positive B-cell lines demonstrated 
that the level of P210 caW fell into two general classes; some 
cell lines had a level of P210 c - abl similar to SK-CML7Bt-33 
and others had the low level similar to SK-CML16BM (Table 
1). This differs from previous studies with Ph^positive 
myeloid cell lines and patient samples derived from acute- 
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Fio. 1. Detection of variable levels of P210°^ w in Ph l -positive 
B-cell lines. Production of PMS 6 ** 14 and P21u~ bl in Epstein-Barr 
virus-itransformed B-cell lines derived from a blast-crisis (A) and a 
chronic-phase (£) CML patient was examined by metabolic labeling 
with [ 3J P]orthophosphate and immunoprecipitation. Ph*-negative 
(-) and Ph l -positive (+) cell lines derived from each patient were 
analyzed. The Ph^negative cell line in A,- is SK-CMJ-7BN-2 and in 
B t - is SK-CML16BN-1. The Ph l -positive cell line in A,+ is 
SK-CML7Bt-33 and in is SK-CML16Bt-l. The K562 cell line, a 
Ph'-positive erythroid progenitor cell line spontaneously derived 
from a blast-crisis patient (33), is represented in C. Cells (1.5 x 10 7 ) 
were metabolically labeled with 2 mCi of [ 32 P]orthpphosphate for 3-4 
hr and then were extracted and clarified by centrifugation. Samples 
were immunpprecipitated with control normal serum (lanes 1), 
anti-pEX-2 (lanes 2), or anu-pEX-5 flanes 3) and analyzed by 
NaDodS0 4 /8% PAGE followed by autoradiography with an inten- 
sifying screen (3 days for A and C, 10 days for B). 

phase CML patients, in which P210 c ' aW was detected at a 
10-fold higher level than ¥145*** (refs. 10 and 11; Table 1). 
There was no large difference in level of chimeric mRNA and 
P210 c -* bl expressed in four myeloid/erythroid-lineage Ph 1 - 
positive cell lines (K562, EM2, EM3, CML22, and BV173; 
refs. 9 and 11), despite a 4- to 5-fold amplification of 
aW-related sequences in the K562 cell line. 

Detection of different levels of P210 c * abl in Fig. 1 could be 
due to decreased phosphorylation of P210 c ^ bl ( a lower level 
of P21(r >abl synthesis, or altered stability of the protein. To 
help distinguish among these possibilities, the steady-state 
level of P210 c " aW in the cell lines was assayed by immuno- 
blotting. The results show that SK-CML7Btr33 (Fig. 2A, +) 
had a higher level of P210 <>abl than P145, similar to the results 
with metabolic labeling (Fig. 1). We did not detect P210 c "* bl 
by immunoblotting with 2 x 10 7 cells of line SK-CML8Bt-3 
(Fig. 2fi, +). Reconstruction experiments using dilutions of 
cell extracts showed that we could detect about 5-10% the 
level of P210 fr,lbl expressed in the K562 cell line (data not 
shown). We infer that the steady-state level of P210 c * abl in 
SK-CML8Bt-3 is lower than the level in SK-CML7Bt-33 by 
a factor of at least 10. The level of P210 c " abl detected in these 
assays correlated with the amount of P210 c " abl tyrosine kinase 
activity that could be detected in vitre (data not shown). 

Different Levels of P210^ bl Are Reflected in the Amount of 
Stable bcr-abl mRNA. To identify the basis for detection of 
variable levels of P210 cabl , we examined the production of 
the abl RNA. RNA blot hybridization analysis using a v-abl 
probe (Fig. 3) showed that the normal 6- and 7-kb c-aW 
mRNAs were present at a similar level in Ph 1 -positive and 
-negative cell lines derived from different patients. However, 
the 8-kb mRNA that encodes P210 c ' abl was detected at a 
10-fold higher level in SK-CML7Bt-33 (Fig. 3A, +) than in 
SK-CML16BM (£, +), which correlated with the relative- 
level of P210 c ' abl detected in each cell line. Analysis of 
additional cell lines demonstrated that the level of 8-kb RNA 
directly correlated with the level of P210 c ' abl (Table 1). The 
variation in level of 8-kb RNA detected in these cell lines was 
not due to loss or gain of Ph 1 , because cytogenetic analysis 
confirmed the presence of Ph 1 in these cell lines (ref. 16 and 
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Table 1. Relative levels of bcr-abl expression in Epsteinr-Barr 
virus-immortalized B-cell line s and myeloid CML lines ' . . 

8-kb 



Cell line* 

VVU UUV 


CML phase* 


Ph 1 * 


P210* 


mRNAV 


SK-CML7BN-2 


BC 








SK-CML8BN-10 


Chronic 








SK-CML8BN-12 


Chronic . 








SK-CML16BN-1 


Chronic 








SK-CML35BN-1 


Chronic 








SK-CML7B5-33 


BC 


+ • 


+ ++ 


■ ++ + 


SK-CML21BM 


Acc 


+ 


+ + + 


+ + + 


SK-CML21Bt-6 


Acc 


+ 


+++ 


+++. 


SK-CMLBBt-3 


Chronic 


+ 


+ 


± " 


SK-CML16Bt-l 


Chronic 


+ 




+ 


SK-CML35Bt-2 


Chronic 


+ 


■ + 


'+ ' 


K562 


BC 


+ ■ 


+++++ 


■ ■+++++ 


BVX73 


BC 


. + . 


+++++ 


++++ + 


BM2 


BC 


+ 


++++ + 


+++++ 



•Cell lines derived from CML patients by transformation with 
Epstein-Barr virus as described (16). Names of cell lines indicate 
patient number and Ph l status: SK-CML7Bt indicates a cell line 
derived from patient 7 that carries the 9;22 Ph 1 translocation; K 
indicates a normal karyotype. Myeloid-erythroid cell lines (K562, 
EM2, and BV173) are described in previous publications (9, 11, 22, 
33). 

^Status of patient at the time cell line was derived. BC, blast crisis; 
Acc, accelerated phase. 

♦Presence (+) or absence (-) of Ph 1 as demonstrated by karyotype 
or Southern blot analysis. 

9P210 C ** W detected as described in legend to Pig. 1. B-cell lines 
derived from blast-crisis and accelerated-phase patients had levels 
of P210 3- to 5-fold higher (+++) than levels of P145. Chronic- 
phase-derived cell lines had 1710 levels lower than or just equivalent 
(+) to the level of P145. Myeloid and erythroid lines had levels of 
P210 5- to 10-fold higher than P145 (+ + + + +). 

tEight-kilobase bcr-abl mRNA detected as described in legend to 
Fig. 2. Symbols: ±, borderline detectable; .+++++, level of 8-kb 
mRNA 5- to 10-fold higher than that of the 6- and 7-kb c-cW mRNA 
species; + + + , level of 8-kb mRNA 3- to 5-fold higher than that of 
the 6- and 7-kb species; +, a level approximately equivalent to that 
of the 6- and 7-kb messages. 

data not shown). There was no difference in the copy number 
of flM-related sequences as judged by Southern blot analysis 
(Fig. 4). Only the K562 cell line control showed an amplifi- 
cation of 'abl sequences, as previously reported (22, 23). 
These combined data suggest that differential bcr*abl mRNA 
expression from a single gene template is responsible for the 
variable levels of P210 c * €bI detected. This could be mediated 




— P210 



— P145 



Fio. 2. Analysis of steady-state abl protein levels by immuno- 
blotting. Cell extracts prepared from 2 x 10 7 cells of lines SK- 
CML7BN-2 (A,-), SK-CML7Bt-33 (A.+), SK-CML8BN-10 (£,-), 
and SK-CML8Bt-3 (B,+) were concentrated by immunoprecip- 
itation with anti-pEX-2 plus anti-pEX-5. Samples were then electro- 
phoresed in a NaDodS0 4 /8% polyacrylamide gel and transferred to 
nitrocellulose, using protease-facilitated transfer (18). abl proteins 
were detected using a mixture of two monoclonal antibodies directed 
against the pEX-2 and pEX-5 gfef-protein fragments produced in 
bacteria (19) as a probe and a peroxidase-conjugated goat anti-mouse 
second-stage antibody (Bio-Rad) for development. 
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Fig. 3. Comparison of abl RNA levels in Ph l -positive and 
-negative B-cell lines. The levels of the normal 6- and 7-kb c-abl 
RN As and the 8-kb bcr-abl RNA were analyzed by blot hybridization 
using a v-abl probe. RNA was extracted from Ph^negative lines 
SK-CML7BN-2 (A,-) and SK-CML16BN-1 (B,-), from Ph l -pos- 
itive lines SK-CML6Bt-33 (A.+) and SK-CML16Bt-3 (B,+) ( and 
from line K562 (C,+) by the NaDodS6 4 /urea/phenol method (20). 
Polyadenylylated RNA was purified by oligo(dT) affinity chroma- 
tography, and IS fig of each sample was electrophoresed in a 1% 
agarose /formaldehyde gel and then transferred to nitrocellulose. The 
blotted RNAs were hybridized with a nickttranslated v-aW fragment 
probe (21) and then autoradiographed for 4 days. . 



by factors influencing the transcription rate of the bcr-abl 
gene or the stability of the mRNA. 

DISCUSSION 

Several lines of evidence suggest that formation of Ph 1 is not 
* the primary event that affects the stem cell in CML. Patients 
have been identified that present with the clinical picture of 
CML but only later develop Ph 1 (1). This observation, 
coupled with studies of G6PD (glucose-6-phosphate dehy- 
drogenase)-heterozygous females with CML that demon- 
strate stem-cell clonality by isozyme analysis among cell 
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Fio. 4. Southern blot analysis of abl sequences in Ph'-positive 
and -negative B-cell lines. High molecular weight DNA (15 /*g) was 
digested with restriction endonuclease Bam HI, separated in a 0.8% 
agarose gel, and then transferred to nitrocellulose. The blotted DNA 
fragments were hybridized with a nick- translated, 2.4-kb Bgl II \-abl 
fragment (1.5 x 10* cpm/^g; ref. 21) and exposed for 4 days. (A) 
Autoradiogram of abb specific fragments in cell lines HL-60 (lane 1), 
EM2 (lane 2). K562 (lane 3), SK-CML7Bt-33 (lane 4), SK-CML8Bt-3 
Cane 5), SK-CML16BM (lane 6), SK-CML21BI-6 Cane 7). SK- 
CML35BI-2 (iane 8), SK-CML7BN-2 Cane 9), SK-CML8BN-2 Cane 
10), and SK-CML35BN-1 (lane 11). (B) Ethidium bromide staining of 
agarose gel prior to transfer to nitrocellulose , showing the level of 
variation in amount of DNA loaded per lane. 
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populations that lack the Ph 1 marker, supports a secondary 
or complementary role for Ph 1 in the progression of the 
disease (24, 25). This chromosome marker is found in 
chronic, accelerated, and blast-crisis phases of the disease. It 
is likely that Ph 1 confers some growth advantage, since cells 
with the marker chromosome eventually predominate the 
marrow and peripheral blood even in chronic phase. During 
the phase of blast crisis, many patients develop additional 
chromosome abnormalities, including duplication of Ph 1 , a 
variety of trisomies, and complex translocations (26). This 
is suggestive evidence for Ph 1 being a necessary but not 
sufficient genetic change for the full evolution of the 
disease. 

The realization that one molecular result of Ph 1 is the 
generation of a chimeric bcr-abl protein with functional 
characteristics and structure analogous to the gag-abl trans- 
forming protein of the Abelson murine leukemia virus 
strengthens the argument for an important role of Ph 1 in the 
pathogenesis of CML. Although the Abelson vims is gener- 
ally considered a rapidly transforming retrovirus, its effects 
can range from overcoming growth factor requirements, to 
cellular lethality, to induction of highly oncogenic tumors in 
a number of hematopoietic cell lineages (27, 28). Even in the 
transformation of murine cell targets, there are several lines 
of evidence that suggest that the growth-promoting activity of 
the v-abl gene product is complemented by further cellular 
changes in the production of the malignant-cell phenotype 
(29-31). 

The regulation of bcr-abl gene expression is complex 
because the 5' end of the gene is derived from the non : cW 
sequences, bcr, normally found on chromosome 22 (6). The 
level of stable message for the normal bcr gene and the 
normal abl gene are both much lower than the level of the 
bcr-abl message and protein from cell lines and clinical 
specimens derived from myeloid blast-crisis patients (5, 6, 
11). Therefore, the high level of bcr-abl expression cannot 
simply be attributed to the regulatory sequences associated 
with bcr. Possibly, creation of the chimeric gene disrupts the 
normal regulatory sequences and results in a higher level of 
expression. Variation in bcr-abl expression may result from 
secondary changes in the structure of the chimeric gene or 
function of fra/u-acting factors that occur during evolution of 
the disease. Our analysis of P210 c -* bI and the 8-kb mRN A in 
Epstein-Barr virus-transformed Ph^positive B-cell lines 
demonstrates that stable message and protein levels from the 
bcr-abl gene can vary over a wide range. This variation does 
not result from a change in the number of bcr-abl templates 
secondary to gene amplification but more likely from changes 
in either transcription rate or mRNA stability. We suspect 
this range of bcr-abl expression is not limited to lymphoid 
cells. Analysis of peripheral blood leukocytes derived from 
an unusual CML patient who has been in chronic phase with 
myeloid predominance for 16 years showed a level of 
P210 c ' aW one-fifth that of P145 cabf , as detected by metabolic 
labeling with [ 32 FJorthophosphate and tmmunoprecipitation 
(S.C., O.N.W., and P. Greenberg, unpublished observa- 
tions). Lower levels of expression of the chimeric mRNA 
have been demonstrated in clinical samples from chronic- 
phase CML patients compared to acute-phase CML patients 
(9). Others have reported chronic-phase patients with vari- 
able but,* in some cases, relatively high levels of the bcr-abl 
mRNA (32). The sampling variation and the heterogenous 
mixture of cell types in clinical samples complicate such 
analyses. Further work is needed to evaluate whether there 
is a defined change in P210 c,lbl expression during the pro- 
gression of CML. It is interesting to note that among the 
limited sample of Prepositive B-cell lines we have examined 
(Table 1), we have seen higher levels of P210 c '* bl in those 
derived from patic v. * at more advanced stages of the disease. 
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It will be important to search for cell-type-specific mecha- 
nisms that might regulate expression of bcr-abl from Ph 1 . 
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Review 

Paul a. Haynes Proteome analysis: Biological assay or data archive? 

Steven P. Gygi 

Daniel Figeys j n t ^ s rev j ew we examine the current state of proteome analysis. There are 

Ruedi Aebersolo thiet ma | n j ssues discussed: why it is necessary to study proteomes; how pro- 

teomes can be analyzed with current technology; and how proteome analysis 
Department of Molecular can be used lQ en h ance biological research. We conclude that proteome anal- 

Biotechnology, University of ysis is an essenl j a i t00 i j n the understanding of regulated biological systems. 

Washington, Seattle, WA, USA Current technology, while still mostly limited to the more abundant proteins, 

enables the use of proteome analysis both to establish databases of proteins 
present, and to perform biological assays involving measurement of multiple 
variables. We believe that the utility of proteome analysis in future biological 
research will continue to be enhanced by further improvements in analytical 
technology. 
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1 Introduction 

A proteome has been defined as the protein complement 
expressed by the genome of an organism, or, in multicel- 
lular organisms, as the protein complement expressed by a 
tissue or differentiated cell [1], In the most common im- 
plementation of proteome analysis the proteins extracted 
from the cell or tissue analyzed are separated by high 

Correspondence: Professor Ruedi Aebersold, Department of Molecular 
Biotechnology, University of Washington, Box 357730, Seattle, WA, 
98195, USA (Tel: +206-685-4235; Fax: +206-685-6392; E-mail: ruedi 
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Abbreviations: CID, collision-induced dissociation; MS/MS, tandem 
mass spectrometry; SAGE, serial analysis of gene expression 

Keywords: Proteome / TVo-dimensional polyacrylamide gel electro- 
phoresis / Tandem mass spectrometry 



2 Rationale for proteome analysis 

The dramatic growth in both the number of genome 
projects and the speed with which genome sequences 
are being determined has generated huge amounts of 
sequence information, for some species even complete 
genomic sequences ([15—17]). The description of the 
state of a biological system by the quantitative measure- 
ment of system components has long been a primary 
objective in molecular biology. With recent technical 
advances including the development of differential dis- 
play-PCR [18], cDNA microarray and DNA chip techno- 
logy [19, 20] and serial analysis of gene expression 
(SAGE) [21, 22], it is now feasible to establish global and 
quantitative mRNA expression maps of cells and tissues, 
in which the sequence of all the genes is known, at a 
speed and sensitivity which is not matched by current 



© WILEY-VCH Verlag GmbH, 69451 Weinheim, 1998 



0173-0835/98/1 1 1 1-1862 $17.50+ 50/0 



Electrophoresis 1998. 19, 1862-1871 

protein analysis technology. Given the long-standing 
paradigm in biology that DNA synthesizes RNA which 
synthesizes protein, and the ability to rapidly establish 
global, quantitative mRNA expression maps, the ques- 
tions which arise are why technically complex proteome ? 
projects should be undertaken and what specific types of ? 
information could be expected from proteome projects 1 
which cannot be obtained from genomic and transcript I 
profiling projects. We see three main reasons for pro- I 
teome analysis to become an essential component in the f 
comprehensive analysis of biological systems, (i) Protein 1 
expression levels are not predictable from the mRNA 
expression levels, (ii) proteins are dynamically modified 
and processed in ways which are not necessarily 
apparent from the gene sequence, and (iii) proteomes 
are dynamic and reflect the state of a biological system 
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2.1 Correlation between mRNA and protein expression 
levels 

Interpretations of quantitative mRNA expression profiles 
frequently implicitly or explicitly assume that for specific 
genes the transcript levels are indicative of the levels of 
protein expression. As part of an ongoing study in our 
laboratory, we have determined the correlation of expres- 
sion at the mRNA and protein levels for a population of 
selected genes in the yeast Saccharomyces cerevisiae 
growing at mid-log phase (S. P. Gygi et ai, submitted for 
publication). mRNA expression levels were calculated 
from published SAGE frequency tables [22] Protein 
expression levels were quantified by metabolic radiola- 
behng of the yeast proteins, liquid scintillation counting 
ot the protein spots separated by high resolution 2-DE 
and mass spectrometry identification of the protein(s) 
migrating to each spot. The selected 80 samples consti- 
tute a relatively homogeneous group with respect to pre- 
dicted half-life and expression level of the protein pro- 
ducts. Thus far, we have found a general trend but no 
strong correlation between protein and transcript levels 
(tig. l). For some genes studied equivalent mRNA trans- 
cript levels translated into protein abundances which 
varied by more than 50-fold. Similarly, equivalent steady- 
state protein expression levels were maintained by trans- 
cnpt levels varying by as much as 40-fold (S P Gygi 
et ai, submitted). These results suggests that even for a 
population of genes predicted to be relatively homoge- 
neous with respect to protein half-life and gene expres- 
sion, the protein levels cannot be accurately predicted 
trom the level of the corresponding mRNA transcript. 

2.2 Proteins are dynamically modified and processed 

In the mature, biologically active form many proteins are 
post-translationally modified by glycosylate, phosphor- 
ylation prenylation, acyiation, ubiquitination or one or 
more of many other modifications [23] and many pro- 
teins are only functional if specifically associated or com- 
ptexed with other molecules, including DNA RNA pro- 
teins and organic and inorganic cofactors. Frequently 
modifications are dynamic and reversible and may alter 
tne precise three-dimensional structure and the state of 
activity of a protein. Collectively, the state of modifica- 
tion of the proteins which constitute a biological system 
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are important indicators for the state of the system The 
type of protein modification and the sites modified at a 
specific cellular state can usually not be determined 
trom the gene sequence alone. 

2.3 Proteomes are dynamic and reflect the state of a 
biological system 

A single genome can give rise to many qualitatively and 
quantitatively different proteomes. Specific stages of the 
cell cycle and states of differentiation, responses to 
growth and nutrient conditions, temperature and stress 
and pathological conditions represent cellular states 
which are characterized by significantly different pro- 
teomes. The proteome, in principle, also reflects events 
that are under translation^ and post-translational con- 
trol. It is therefore expected that proteomics will be able 
to provide the most precise and detailed molecular des- 
cription of the state of a cell or tissue, provided that the 
external conditions defining the state are carefully deter- 
mined. In answer to the question of whether the study 
of proteomes is necessary for the analysis of biomolec- 
ular systems, it is evident that the analysis of mature pro- 

S.f r UCtS *? ?" S J S eSSential 35 there are numerous 
levels of control of protein synthesis, degradation 
processing and modification, which are only apparent by 
direct protein analysis. y 



3 Description and assessment of current proteome 
analysis technology 

3.1 Technical requirements of proteome technology 

In biological systems the level of expression as well as 
the states of modification, processing and macro-molec- 
ular association of proteins are controlled and modu- 
lated depending on the state of the system. Comprehen- 
sive analysis of the identity, quantity and state of modifi- 
cation of proteins therefore requires the detection and 
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quantitation of the proteins which constitute the system, 
and analysis of differentially processed forms. There are 
a number of inherent difficulties in protein analysis 
which complicate these tasks. First, proteins cannot be 
amplified. It is possible to produce large amounts of a 
particular protein by over-expression in specific cell sys- 
tems. However, since many proteins are dynamically 
post-translationally modified, they cannot be easily am- 
plified in the form in which they finally function in the 
biological system. It is frequently difficult to purify from 
the native source sufficient amounts of a protein for 
analysis. From a technological point of view this trans- 
lates into the need for high sensitivity analytical tech- 
niques. Second, many proteins are modified and pro- 
cessed post-translationally. Therefore, in addition to the 
protein identity, the structural basis for differentially 
modified isoforms also needs to be determined. The dis- 
tribution of a constant amount of protein over several 
differentially modified isoforms further reduces the 
amount of each species available for analysis. The com- 
plexity and dynamics of post-translational protein edit- 
ing thus significantly complicates proteome studies. 
Third, proteins vary dramatically with respect to their 
solubility in commonly used solvents. There are few, if 
any, solvent conditions in which all proteins are soluble 
and which are also compatible with protein analysis. This 
makes the development of protein purification methods 
particularly difficult since both protein purification and 
solubility have to be achieved under the same condi- 
tions. Detergents, in particular sodium dodecyl sulfate 
(SDS), are frequently added to aqueous solvents to 
maintain protein solubility. The compatibility with SDS 
is a big advantage of SDS polyacrylamide gel electro- 
phoresis (SDS-PAGE) over other protein separation 
techniques. Thus, SDS-PAGE and two-dimensional gel 
electrophoresis, which also uses SDS and other deter- 
gents, are the most general and preferred methods for 
the purification of small amounts of proteins, provided 
that activity does not necessarily need to be maintained. 
Lastly, the number of proteins in a given cell system is 
typically in the thousands. Any attempt to identify and 
categorize all of these must use methods which are as 
rapid as possible to allow completion of the project 
within a reasonable time frame. Therefore, a successful, 
general proteomics technology requires high sensitivity, 
high throughput, the ability to differentiate differentially 
modified proteins, and the ability to quantitatively dis- 
play and analyze all the proteins present in a sample. 

3.2 2-D electrophoresis - mass spectrometry: a common 
implementation of proteome analysis 

The most common currently used implementation of 
proteome analysis technology is based on the separation 
of proteins by two-dimensional (IEF/SDS-PAGE) gel 
electrophoresis and their subsequent identification and 
analysis by mass spectrometry (MS) or tandem mass 
spectrometry (MS/MS). In 2-DE, proteins are first separ- 
ated by isoelectric focusing (IEF) and then by SDS- 
PAGE, in the second, perpendicular dimension. Separ- 
ated proteins are visualized at high sensitivity by staining 
or autoradiography, producing two-dimensional arrays of 
proteins. 2-DE gels are, at present, the most commonly 
used means of global display of proteins in complex 



samples. The separation of thousands of proteins has 
been achieved in a single gel [24, 25] and differentially 
modified proteins are frequently separated. Due to the 
compatibility of 2-DE with high concentrations of deter- 
gents, protein denaturants and other additives promoting 
protein solubility, the technique is widely used. 

The second step of this type of proteome analysis is the 
identification and analysis of separated proteins. Individ- 
ual proteins from polyacrylamide gels have traditionally 
been identified using /V-terminal sequencing [26, 27], 
internal peptide sequencing [28, 29], immunoblotting or 
comigration with known proteins [30]. The recent dra- 
matic growth of large-scale genomic and expressed 
sequence tag (EST) sequence databases has resulted in a 
fundamental change in the way proteins are identified by 
their amino acid sequence. Rather than by the traditional 
methods described above, protein sequences are now fre- 
quently determined by correlating mass spectral or 
tandem mass spectral data of peptides derived from pro- 
teins, with the information contained in sequence data- 
bases [31—33]. 

There are a number of alternative approaches to pro- 
teome analysis currently under development. There is 
considerable interest in developing a proteome analysis 
stragegy which bypasses 2-DE altogether, because it is 
considered a relatively slow and tedious process, and 
because of perceived difficulties in extracting proteins 
from the gel matrix for analysis. However, 2-DE as a 
starting point for proteome analysis has many advan- 
tages compared to other techniques available today. The 
most Significant strengths of the 2-DE-MS approach 
include the relatively uniform behavior of proteins in 
gels, the ability to quantify spots and the high resolution 
and simultaneous display of hundreds to thousands of 
proteins within a reasonable time frame. 

A schematic diagram of a typical procedure of the identi- 
fication of gel-separated proteins is shown in Fig. 2. Pro- 
tein spots detected in the gel are enzymatically or chemi- 
cally fragmented and the peptide fragments are isolated 
for analysis, as already indicated, most frequently by MS 
or MS/MS. There are numerous protocols for the gener- 
ation of peptide fragments from gel-separated proteins. 
They can be grouped into two categories, digestion in 
the gel slice [28, 34] or digestion after electrotransfer out 
of the gel onto a suitable membrane ([29, 35—37] and 
reviewed in [38]). In most instances either technique is 
applicable and yields good results. The analysis of MS or 
MS/MS data is ah important step in the whole process 
because MS instruments can generate an enormous 
amount of information which cannot easily be managed 
manually. Recently, a number of groups have developed 
software systems dedicated to the use of peptide MS 
and MS/MS spectra for the identification of proteins. 
Proteins are identified by correlating the information 
contained in the MS spectra of protein digests or 
MS/MS spectra of individual peptides with data con- 
tained in DNA or protein sequence databases. 

The systems we are currently using in our laboratory are 
based on the separation of the peptides contained in pro- 
tein digests by narrow bore or capillary liquid chromatog- 
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Figure 2. Schematic diagram of a procedure for identification of gel- 
separated proteins. Peptides can either be separated by a technique 
such as LC or CE, or infused as a mixture and sorted in the MS. Data- 
base searching can either be performed on peptide masses from an 
MS spectrum, peptide fragment masses from CID spectra of peptides, 
or a combination of both. 



raphy [39, 40] or capillary electrophoresis [41], the anal- 
ysis of the separated peptides by electrospray ioniza- 
tion (ESI) MS/MS, and the correlation of the generated 
peptide spectra with sequence databases using the 
SEQUEST program developed at the University of Wash- 
ington [32, 33]. The system automatically performs the 
following operations: a particular peptide ion character- 
ized by its mass-to-charge ratio is selected in the MS out 
of all the peptide ions present in the system at a parti- 
cular time; the selected peptide ion is collided in a colli- 
sion cell with argon (collision-induced dissociation, 
CID) and the masses of the resulting fragment ions are 
determined in the second sector of the tandem MS; this 
experimentally determined CID spectrum is then corre- 
lated with the CID spectra predicted from all the pep- 
tides in a sequence database which have essentially the 
same mass as the peptide selected for CID; this correla- 
tion matches the isolated peptide with a sequence seg- 
ment in a database and thus identifies the protein from 
which the peptide was derived. There are a number of 
alternative programs which use peptide CID spectra for 
protein identification, but we use the SEQUEST system 
because it is currently the most highly automated pro- 
gram and has proven to be successful, versatile and 
robust. 

3,3 Protein identification by LC-MS/MS, capillary 
LC-MS/MS and CE-MS/MS 

It has been demonstrated repeatedly that MS has a very 
high intrinsic sensitivity. For the routine analysis of gel- 
separated proteins at high sensitivity, the most signif- 
icant challenge is the handling of small amounts of 
sample. The crux of the problem is the extraction and 
transferal of peptide mixtures generated by the digestion 
of low nanogram amounts of protein, from gels into the 
MS/MS system without significant loss of sample or 
introduction of unwanted contaminants. We employ 
three different systems for introducing gel-purified sam- 
ples into an MS, depending on the level of sensitivity 



required. As an approximate guideline, for samples con- 
taining tens of picomoles of peptides, LC-MS/MS is 
most appropriate; for samples containing low picomole 
amounts to high femtomole amounts we use capillary 
LC-MS/MS; and for samples containing femtomoles or 
less, CE-MS/MS is the method of choice. 

3.3.1 LC-MS/MS 

The coupling of an MS to an HPLC system using a 
0.5 mm diameter or bigger reverse phase (RP) column 
has been described in detail [42). This system has several 
advantages if a large number of samples are to be ana- 
lyzed and all are available in sufficient quantity. The 
LC-MS and database searching program can be run in a 
fully automated mode using an autosampler, thus maxi- 
mizing sample throughput and minimizing the need for 
operator interference. The relatively large column is 
tolerant of high levels of impurities from either gel prep- 
aration or sample matrix. Lastly, if configured with a 
flow-splitter and micro-sprayer [40], analyses can be per- 
formed on a small fraction of the sample (less than 5%) 
while the remainder of the sample is recovered in very 
pure solvents. This latter feature is particularly useful 
when an orthogonal technique is also used to analyze 
peptide fractions, such as scintillation of an introduced 
radiolabel, and this data can be correlated with peptides 
identified by CID spectra. 

3.3.2 Capillary LC-MS 

An increase of sensitivity of approximately tenfold can be 
achieved by using a capillary LC system with a 100 urn ID 
column rather than a 0.5 mm ID column as referred to 
above. Since very low flow rates are required for such 
columns, most reports have used a precolumn flow split- 
ting system for producing solvent gradients. We have 
recently desribed the design and construction of a novel 
gradient mixing system which enables the formation 
of reproducible gradients at very low flow rates (low 
nL/min) without the need for flow splitting (A. Ducret 
et al. y submitted for publication). Using this capillary 
LC-MS/MS system we were able to identify gel-separat- 
ed proteins if low picomole to high femtomole amounts 
were loaded onto the gel [40]. This system is as yet not 
automated and, like all capillary LC systems, is prone to 
blockage of the columns by microparticulates when ana- 
lyzing gel-separated proteins. 

3.3.3 CE-MS/MS 

The highest level of sensitivity for analyzing gel-sep- 
arated proteins can be achieved by using capillary elec- 
trophoresis - mass spectrometry (CE-MS). We have de- 
scribed in the past a solid-phase extraction capillary elec- 
trophoresis (SPE-CE) system which was used with triple 
quadrupole and ion trap ESI-MS/MS systems for the 
identification of proteins at the low femtomole to sub- 
femtomole sensitivity level [43, 44]. While this system is 
highly sensitive, its operation is labor-intensive and its 
operation has not been automated. In order to devise an 
analytical system with both the sensitivity of a CE and 
the level of automation of LC, we have constructed 
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Figure J. Schematic illustration of a 
microfabricated analytical system for CE, 
consisting of a micromachined device, 
coated capillary electroosmotic pump, 
and microelectrospray interface. The 
dimensions of the channels and reservoir 
are as indicated in the text. The channels 
on the device were graphically enhanced 
to make them more visible. Reproduced 
from (45), with permission. 



microfabricated devices for the introduction of samples 
into ESI-MS for high-sensitivity peptide analysis. 

The basic device is a piece of glass into which channels 
of 10—30 \xm in depth and 50—70 \im in diameter are 
etched by using photolithography/etching techniques 
similar to the ones used in the semiconductor industry. 
(A simple device is shown in Fig, 3), The channels are 
connected to an external high voltage power supply [45]. 
Samples are manipulated on the device and off the 
device to the MS by applying different potentials to the 
reservoirs. This creates a solvent flow by electroosmotic 
pumping which can be redirected by changing the posi- 
tion of the electrode. Therefore, without the need for 
valves or gates and without any external pumping, the 
flow can be redirected by simply switching the position 
of the electrodes on the device. The direction and rate of 
the flow can be modulated by the size and the polarity 
of the electric field applied and also by the charge state 
of the surface. 

The type of data generated by the system is illustrated in 
Fig. 4, which shows the mass spectrum of a peptide sample 
representing the tryptic digest of carbonic anhydrase at 
290 fmol/uL. Each numbered peak indicates a peptide suc- 
cessfully identified as being derived from carbonic an- 



hydrase. Some of the unassigned signals may be chemical 
or peptide contaminants. The MS is programmed to auto- 
matically select each peak and subject the peptide to CID. 
The resulting CID spectra are then used to identify the 
protein by correlation with sequence databases. Therefore, 
this system allows us to concurrently apply a number of 
protein digests onto the device, to sequentially mobilize 
the samples, to automatically generate CID spectra of 
selected peptide ions and to search sequence databases 
for protein identification. These steps are performed auto- 
matically without the need for user input and proteins can 
be identified at very low femtomole level sensitivity at a 
rate of approximately one protein per 15 min. 

3.4 Assessment of 2-DE-MS proteome technology 

Using a combination of the analytical techniques de- 
scribed above we have identified the 80 protein spots 
indicated in Fig. 5. The protein pattern was generated by 
separating a total of 40 microgram of protein contained 
in a total cell lysate of the yeast strain YPH499 by high 
resolution 2-DE and silver staining of the separated pro- 
teins. To estimate how far this type of proteome analysis 
can penetrate towards the identification of low abun- 
dance proteins, we have calculated the codon bias of the 
genes encoding the respective proteins. Codon bias is a 




m/z 



Figure 4. MS spectrum of a tryptic digest 
of carbonic anhydrase using the microfa- 
bricated system shown in Fig. 3. 290 
fmol/uL of carbonic anhydrase tryptic 
digest was infused into a Finnigan LCQ 
ion trap MS. Each peak was selected for 
CID, and those which were identified as 
containing peptides derived from car- 
bonic anhydrase are numbered. Repro- 
duced from [45], with permission. 
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calculated measure of the degree of redundancy of trip- 
let DNA codons used to produce each amino acid in a 
particular gene sequence. It has been shown to be a 
useful indicator of the level of the protein product of a 
particular gene sequence present in a cell [46]. The gen- 
eral rule which applies is that the higher the value of the 
codon bias calculated for a gene, the more abundant the 
protein product of that gene becomes. The calculated 
codon bias values corresponding to the proteins identi- 
fied in Fig. 5 are shown in Fig. 6b. Nearly all of the pro- 
teins identified (> 95%) have codon bias values of > 0.2, 
indicating they are highly abundant in cells. In contrast,' 
codon bias values calculated for the entire yeast genome 
(Fig. 6a) show that the majority of proteins present in 
the proteome have a codon bias of < 0.2 and are thus of 
low abundance. 

This finding is of considerable importance in our assess- 
ment of the current status of proteome analysis technol- 
ogy. It is clear that even using highly sensitive analytical 
techniques, we are only able to visualize and identify the 



more abundant proteins. Since many important regula- 
tory proteins are present only at low abundance, these 
would not be amenable to analysis using such tech- 
niques. This situation would be exacerbated in the anal- 
ysis of proteomes containing many more proteins than 
the approximately 6000 gene products present in yeast 
cells [16]. In the analysis of, for example, the proteome 
of any human cells, there are potentially 50000-100000 
gene products [47]. Inherent limitations on the amount 
of protein that can be loaded on 2-DE, and the number 
of components that can be resolved, indicate that only 
the most highly abundant fraction of the many gene 
products could be successfully analyzed. One approach 
that has been employed to circumvent these limitations 
is the use of very narrow range immobilized pH gradient 
strips for the first-dimension separation of 2-DE [48]. 
Since only those proteins which focus within the narrow 
range will enter the second dimension of separation, a 
much higher sample loading within the desired range is 
possible. This, in turn, can lead to the visualization and 
identification of less abundant proteins. 
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Figure 6. Calculated codon bias values for yeast proteins. (A) Distribu- 
tion of calculated values for the entire yeast proteome. (B) Distribu- 
tion of calculated values for the subset of 80 identified proteins also 
shown in Figs. 1 and 5. Further details of experimental procedures are 
included in S, P. Gygi et al (submitted). 



4 Utility of proteome analysis for biological 
research 

For the success of proteomics as a mainstream approach 
to the analysis of biological systems it is essential to 
define how proteome analysis and biological research 
projects intersect. Without a clear plan for the implemen- 
tation of proteome-type approaches into biological re- 
search projects the full impact of the technology can not 
be realized. The literature indicates that proteome anal- 
ysis is used both as a database/data archive, and as a bio- 
logical assay or biological research tool. 

4.1 The proteome as a database 

The use of proteomics as a database or data archive 
essentially entails an attempt to identify all the proteins 
in a cell or species and to annotate each protein with the 
known biological information that is relevant for each 
protein. The level of annotation can, of course, be exten- 
sive. The most common implementation of this idea is 
the separation of proteins by high resolution 2-DE, the 
identification of each detected protein spot and the 
annotation of the protein spots in a 2-DE gel database 
format. This approach is complicated by the fact that it is 
difficult to precisely define a proteome and to decide 
which proteome should be represented in the database. 
In contrast to the genome of a species, which is essen- 
tially static, the proteome is highly dynamic. Processes 
such as differentiation, cell activation and disease can all 
significantly change the proteome of a species. This is 
illustrated in Fig. 7 r The figure shows two high-resolu- 



tion 2-DE maps of proteins isolated from rat serum. 
Fig. 7A is from the serum of normal rats, while Fig. 7B 
is from the serum of rats in acute-phase serum after 
prior treatment with an inflammation-causing agent [49]. 
It is obvious that the protein patterns are significantly 
different in several areas, raising the question of exactly 
which proteome is being described. 

Therefore, a comprehensive proteome database of a spe- 
cies or cell type needs to contain all of the parameters 
which describe the state and the type of the cells from 
which the proteins were extracted as well as the software 
tools to search the database with queries which reflect 
the dynamics of biological systems. A comprehensive 
proteome database should be capable of quantitatively 
describing the fate of each protein if specific systems 
and pathways are activated in the cell. Specifically, the 
quantity, the degree of modification, the subcellular loca- 
tion and the nature of molecules specifically interacting 
with a protein as well as the rate of change of these 
variables should be described. Using these admittedly 
stringent criteria, there is currently no cornlete proteome 
database. A number of such databases are, however, in 
the process of being constructed. The most advanced 
among them, in our opinion, are the yeast protein data- 
base YPD [50] (accessible at http://www.ypd.com) and 
the human 2D-PAGE databases of the Danish Centre 
for Human Genome Research [12] (accessible at http:// 
biobase.dk/cgi-bin/celis). While neither can be con- 
sidered complete as not all of the potential gene pro- 
ducts are identified, both contain extensive annotation 
of supplemental information for many of the spots 
which are positively identified in reference samples. 



4.2 The proteome as a biological assay 

The use of proteome analysis as a biological assay or 
research tool represents an alternative approach to inte- 
grating biology with proteomics. To investigate the state 
of a system, samples are subjected to a specific proceess 
that allows the quantitative or qualitative measurement 
of some of the variables which describe the system. In 
typical biochemical assays one variable (e.g., enzyme 
activity) of a single component (e.g., a particular en- 
zyme) is measured. Using proteomics as an assay, mul- 
tiple variables (e.g., expression level, rate of synthesis, 
phosphorylation state, etc.) are measured concurrently 
on many (ideally all) of the proteins in a sample. The 
use of proteomics as an assay is a less far-reaching prop- 
osition than the construction of a comprehensive pro- 
teome database. It does, however, represent a pragmatic 
approach which can be adapted to investigate specific 
systems and pathways, as long as the interpretation of 
the results takes into account that with current technol- 
ogy not all of the variables which describe the system 
can be observed (see Section 3.4). 

A common implementation of proteome analysis as a 
biological assay is when a 2-DE protein pattern gener- 
ated from the analysis of an experimental sample is 
compared to an array of reference patterns representing 
different states of the system under investigation. The 
state of the experimental system at the time the sample 
was generated is therefore determined by the quantita- 
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tive comparative analysis of hundreds to a few thousand 
proteins. Comparative analysis of the 2-DE patterns fur- 
thermore highlights quantitative and qualitative differ* 
ences in the protein profiles which correlate with the 
state of the system. For this type of analysis it is not 
essential that all the proteins are identified or even visu- 



alized, although the results become more informative as 
more proteins are compared- It is obvious, however, that 
the possibility to identify any protein deemed character- 
istic for a particular state dramatically enhances this 
approach by opening up new avenues for experimenta- 
tion. 
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Figure 7. High resolution 2-DE map or proteins isolated from rat serum with or without prior exposure to an inflam- 
mation-causing agent. (A) normal rat serum, (B) acute-phase serum from rats which had previously been exposed to 
an inflammation-causing agent. The first dimension of separation is an IPG from pH 4-10, and the second dimen- 
sion is a 7.5-17.5OT gradient SDS-PAGE gel. Proteins were visualized by staining with amido black. Further details 
of experimental procedures are included in (14, 49]. 
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Proteome analysis as a biological assay has been success- 
fully used in the field of toxicology, to characterize 
disease states or to study differential activation of cells. 
The approach is limited, of course, by the fact that only 
the visible protein spots are included in the assay, and it 
is well known that a substantial but far from complete 
fraction of cellular proteins are detected if a total cell 
lysate is separated by 2-DE. Proteins may not be 
detected in 2-DE gels because they are not abundant 
enough to be visualized by the detection method used, 
because they do not migrate within the boundaries (size, 
pi) resolved by the gel, because they are not soluble 
under the conditions used, or for other reasons. 

A different way to use proteome analysis as a biological 
assay to define the state of a biological system is to take 
advantage of the wealth of information contained in 
2-DE protein patterns. 2-DE is referred to as two-dimen- 
sional because of the electrophoretic mobility and the 
isoelectric points which define the position of each pro- 
tein in a 2-DE pattern. In addition to the two dimen- 
sions used to generate the protein patterns, a number of 
additional data dimensions are contained in the protein 
patterns. Some of these dimensions such as protein 
expression level, phosphorylation state, subcellular loca- 
tion, association with other proteins, rate of synthesis or 
degradation indicate the activity state of a protein or a 
biological system. Comparative analysis of 2-DE protein 
patterns representing difFerent states is therefore ideally 
suited for the detection, identification and analysis of 
suitable markers. Once again it must be emphasized that 
in this type of experiment only a fraction of the cellular 
proteins is analyzed. Since many regulatory proteins are 
of low abundance, this limitation is a concern, particu- 
larly in cases in which regulatory pathways are being 
investigated. 

5 Concluding remarks 

In this report we have addressed three main issues 
related to proteome analysis. First, we have discussed 
the rationale for studying proteomes. Second, we have 
assessed the technical feasibility of analyzing proteomes 
and described current proteome technology, and third, 
we have analyzed the utility of proteome analysis for bio- 
logical research. It is apparent that proteome analysis is 
an essential tool in the analysis of biological systems. 
The multi-level control of protein synthesis and degrada- 
tion in cells means that only the direct analysis of 
mature protein products can reveal their correct identi- 
ties, their relevant state of modification and/or associa- 
tion and their amounts. Recently developed methods 
have enabled the identification of proteins at ever- 
increasing sensitivity levels and at a high level of auto- 
mation of the analytical processes. A number of tech- 
nical challenges, however, remain. While it is currently 
possible to identify essentially any protein spots that can 
be visualized by common staining methods, it is ap- 
parent that without prior enrichment only a relatively 
small and highly selected population of long-lived, 
highly expressed proteins is observed. There are many 
more proteins in a given cell which are not visualized by 
such methods. Frequently it is the low abundance pro- 
teins that execute key regulatory functions. 



We have outlined the two principal ways proteome anal- 
ysis is currently being used to intersect with biological 
research projects: the proteome as a database or data 
archive and proteome analysis as a biological assay. Both 
approaches have in common that at present they are con- 
ceptually and technically limited. Current proteome data- 
bases typically are limited to one cell type and one state 
of a cell and therefore do not account for the dynamics 
of biological systems. The use of proteome analysis as a 
biological assay can provide a wealth of information, but 
it is limited to the proteins detected and is therefore not 
truly proteome-wide. These limitations in proteomics are 
to a large extent a reflection of the fact that proteins in 
their fully processed form cannot easily be amplified and 
are therefore difficult to isolate in amounts sufficient for 
analysis or experimentation. The fact that to date no 
complete proteome has been described further attests to 
these difficulties. With continued rapid progress in pro- 
tein analysis technology, however, we anticipate that the 
goal of complete proteome analysis will eventually 
become attainable. 
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High-throughput technologies, such as proteomic screening and DNA micro-arrays, produce vast 
amounts of data requiring comprehensive analytical methods to decipher the biologically relevant 
results. One approach would be to manually search the biomedical literature; however, this would be 
an arduous task. We developed an automated literature-mining tool, termed MedGene, which 
comprehensively summarizes and estimates the relative strengths of all human gene-disease 
relationships in Medline. Using MedGene, we analyzed a novel micro-array expression data's* 
comparing breast cancer and normal breast tissue in the context of existing knowledge. We found no 
correlation between the strength of the literature association and the magnitude of the difference in 
expression level when considering changes as high as 5-fold; however, a significant correlation was 
observed (r - 0.41; p = 0.05) among genes showing an expression difference of 10-fold or more 
Interestingly, this only held true for estrogen receptor (ER) positive tumors, not ER negative. MedGene 
identified a set of relatively understudied, yet highly expressed genes in ER negative tumors worthy of 
further examination. y 

Keywords: bioinformatics • micro-array . text mining . gene-disease association . breast cancer 



Introduction 

At its current pace, the accumulation of biomedical literature 
outpaces the ability of most researchers and clinicians to stay 
abreast of their own immediate fields, let alone cover a broader 
range of topics. For example, to follow a single disease, e.g.. 
breast cancer, a researcher would have had to scan 130 different 
journals and read 27 papers per day in 1999. 1 This problem is 
accentuated with high-throughput technologies such as DNA 
micro-arrays and proteomics. which require the analysis of 
large datasets involving thousands of genes, many of which are 
unfamiliar to a particular researcher. In any microarray experi- 
ment, thousands of genes may demonstrate statistically sig- 
nificant expression changes, but only a fraction of these may 
be relevant to the study. The ability to interpret these datasets 
would be enhanced if they could be compared to a compre- 
hensive summary of what is known about all genes. Thus, there 
is a need to summarize existing knowledge in a format that 
allows for the rapid analysis of associations between genes and 
diseases or other specific biological concepts. 

One solution to this problem is to compile structured digital 
resources, such as the Breast Cancer Gene Database 1 and the 
Tumor Gene Database. 2 However, as these resources are hand- 
curated, the labor-intensive review process becomes a rate- 
limiting step in the growth of the database. As a result, these 
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databases have a limited scale and the genes are not selected 
in a systematic fashion. 

An alternative approach is automated text mining; a method 
which involves automated information extraction by searching 
documents for text strings and analyzing their frequency and 
context. This approach has been used successfully in several 
instances for biological applications. In most cases, it has been 
applied to extract information about the relationships or 
interactions that proteins or genes have with one another, in 
the literature or by functional annotation. 5 " 7 Thus far. few 
publication have applied text-mining to examine the global 
relationships between genes and diseases. Perez-Iratxeta et al. 
automatically examined the GO (Gene Ontology) annotation 
of genes and their predicted chromosomal locations in order 
to identify genes linked to inherited disorders. 8 

To obtain a more global understanding of disease develop- 
ment, it would be valuable to Incorporate information regarding 
all possible gene-disease relationships, including biochemical, 
physiological, pharmacological, epidemiological, as well as 
genetic. This information would enable comprehensive com- 
parisons between large experimental datasets and existing 
knowledge in the literature. This would accomplish two things. 
First, it would serve to validate experiments by demonstrating 
that known responses occur as predicted. Second, it would 
rapidly highlight which genes are corroborated by the literature 
and which genes are novel in a given context We have utilized 
a computational approach to literature mining to produce a 
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comprehensive set of gene-disease relationships. In addition, 
we have developed a novel approach to assess the strength of 
each association based on the frequency of citation and co- 
citation. We applied this tool to help interpret the data from a 
large micro-array gene expression experiment comparing 
normal and cancerous breast tissue. 

Methods 

MedGene Database. MedGene is a relational database, stor- 
ing disease and gene information from NCBI, text mining re- 
sults, statistical scores, and hyperlinks to the primary lit- 
erature. MedGene has a web-based user interface for users to 
query the database (http://hipseq.med.harvard.edu/MedGene/). 

Text Mining Algorithms. MeSH files were downloaded from 
the MeSH web site at NLM (Nation Library of Medicine) (http:// 
www.nlm.nih.gov/mesh/meshhome.html) and human disease 
categories were selected. LocusLink files were downloaded from 
the LocusLink web site at NCBI (http://www.ncbi.nih.gov/ 
LocusLink/). Official/preferred gene symbol, official/preferred 
gene name, and gene alternative symbols and names, all 
relevant annotations and URLs for each LocusLink record, were 
collected. Gene search terms were used for literature searching 
and included all qualified gene names, gene symbols, and gene 
family terms. Primary gene keys, predominanUy qualified gene 
family terms and gene official/preferred symbols, were used 
to index Medline records. If the official/preferred gene symbols 
did not meet the standards to be an index, then qualified gene 
official/preferred names were used. A local copy of Medline 
records (up to July. 2002) was pre-selected. 

A JAVA module examined the MeSH terms and then indexed 
each Medline record with the appropriate disease terms. A 
separate JAVA module was used to examine the titles and 
abstracts for gene search terms and then to index the gene- 
related Medline records with the relevant primary gene key(s). 

Statistical Methods. For every gene and disease pair, we 
counted records that were indexed for both gene and disease 
(double positive hits), for disease only (disease single hits), for 
gene only (gene single hits), and for neither gene nor disease 
(double negative hits) to generate a 2 x 2 contingency table. 
On the basis of the contingency table-framework, we applied 
different statistical methods to estimate the strength of gene- 
disease relationships and evaluated the results. These methods 
included chi-square analysis. Fisher's exact probabilities, rela- 
tive risk of gene, and relative risk of disease 16 (http:// 
hipseq.med.harvard.edu/MedGene/). In addition, we computed 
the "product of frequency", which is the product of the 
proportion of disease/gene double hits to disease single hits 
and the proportion of disease/gene double hits to gene single 
hits. To obtain a normal distribution, we transformed all the 
statistical scores using the natural logarithm. We selected the 
log of the product of frequency (LPF) to validate MedGene and 
to use for the analysis with the micro-array data. Spearman 
rank-correlation coefficients were used to assess the linear 
relationship between LPF and micro-array fold change in 
expression level. 

Global Analysis. Diseases with at least 50 related genes were 
selected for clustering analysis, and the LPF scores were 
normalized with total score for each disease. Hierarchical 
clustering was done with the "Cluster" software and the 
clustering result was visualized using TreeViewer" (http:// 
rana.lbl.gov/EisenSoftware.htm). 
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Breast Tissue Micro-Arrays. Eighty-nine breast cancer 
samples (79% ER-positive) and 7 normal breast tissue samples 
were selected from the Harvard Breast SPORE frozen tissue 
repository and were representative of the spectrum of histo- 
logical types, grades, and hormone receptor immuno-pheno- 
types of breast cancer. Biotinylated cRNA, generated from the 
total RNA extracted from the bulk tumor, was hybridized to 
Affymetrix U95A oligo-nucleotide micro-arrays. These micro- 
arrays consist of 12 400 probes, which represent approximately 
9000 genes. Raw expression values were obtained using GENE- 
CHIP software from Affymetrix. and then further analyzed using 
the DNA-Chip Analyzer (dChip) custom software. 

Results 

Automated Indexing of Medline Records by Disease and 
Gene. To study the gene-disease associations in the literature, 
we first compiled complete lists for human diseases and human 
genes. To index all Medline records that were relevant to 
human diseases, the Medical Subject Heading (MeSH) index 
of Medline records was utilized. MeSH is a controlled medical 
vocabulary from the National Library of Medicine and consists 
of a set of terms or subject headings that are arranged in both 
an alphabetic and an hierarchical structure. Medline records 
are reviewed manually and MeSH terms are added to each with 
software assistance. 910 Twenty-three human disease category 
headings along with all of their child terms (see the Supporting 
Information. Supplemental Table 1, or visit http://hipseq. 
med.harvard.edu/MedGene/publication/s_Table l.hrml) were 
selected from the 2002 MeSH index creating a list of 4033 
human diseases. 

No index comparable to the MeSH index exists for genes, 
and thus, it was necessary to apply a string search algorithm 
for gene names or symbols found in Medline text. A complete 
list of genes, gene names, gene symbols, and frequendy used 
synonyms were Collected from the LocusLink database at 
NCBI."-" which contains 53 259 independent records keyed 
by an official gene symbol or name (June 18 th , 2002). For the 
purposes of this study, no distinction was made between genes 
and their gene products. Authors often use the same name for 
both, differentiating the two only by the use of italics, if at all. 
For the intended use of this study, this lack of distinction is 
unlikely to have a large effect and may in fact be beneficial. 

Initial attempts to search the literature using these lists 
revealed several sources of false positives and false negatives 
(Table 1). False positives primarily arose when the searched 
term had other meanings, whereas false negatives arose from 
syntax discrepancies necessitating the development of filters 
to reduce these errors. The syntax issues were readily handled 
by including alternate syntax forms in the search terms. The 
false positive cases, caused by duplicative and unrelated 
meanings for the terms, were more difficult to manage. Where 
possible, case sensitive string mapping reduced inappropriate 
citations. In many cases, however, this was not sufficient and 
the terms had to be eliminated entirely, thereby reducing the 
false positive rate but unavoidably under-representing some 
genes. 

For the purposes of data tracking, a primary gene key was 
selected to represent all synonyms that correspond to each 
gene. Medline records were indexed with a primary gene key 
when any synonym for that key was found in the title or 
abstract Case-insensitive string mapping was used for all 
searches except as noted above. No additional weight was 
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Table 1. Systematic Sources of False Positives and False Negatives in Unfiltered Data* 



source of error 



eiTor type 



example 



gene symbol/name false positive 

is not unique 



gene symbol is false positive 

unrelated abbreviation 

gene symbol/name false positive 

has language meaning 

nonstandard syntax false negative 

unofficial gene name/symbol false negative 

nonspecified gene name false negative 



MAG— myelin 

associated glycoprotein 
MAG— malignancy-associated 

protein 

PA— pallid homoiogue (mouse), 

pallidin (also abbrev. for Pennsylvania) 

VMS-Wiskott-Aldrich Syndrome 
(also the word "was") 

BAG-1 instead of BAG 1 

P53 instead of TP53 

estrogen receptor instead of 
Estrogen receptor 1 



research articles 



filter solution 



eliminate this term 



eliminate this term 

case-sensitive string search 

add dash term 

add all gene nicknames 

gdd family stem term 



> In preliminary studies Medline was searched for co-occurrence of genes and diseases and the resulting output was evaluated to identify error sources that 
were amenable to global filters. Each error source is categorized by the type of error it causes: false positives are suggested re tak^Kt^SS^S and 
false negatives are real relationships that are underrepresented. The filter solutions used are Indicated. Note that in some casXKShZ ^einnioduces 
error. In general, error rates maximized sensitivity, even at the expense of specificity If needed. soiuuon risen introduces 



added for multiple occurrences of a term or the co-occurrence 
of multiple synonyms for the same gene key. 

Medline records were searched with alt qualified gene 
identifiers, such as the official/preferred gene symbol, the 
official/preferred gene name, all gene nicknames and all syntax 
variants. In situations where there are several members of a 
gene family or splice variants, some authors prefer to use a 
shortened gene family name, e.g., estrogen receptor instead of 
estrogen receptor I (ESR1), creating a source of false negatives. 
For this reason, gene family stem terms were created for all 
genes that have an alpha or numerical suffix (e.g., IL2RA, TGFfi, 
ESRi, etc.) and then used to search the literature. The family 
stem terms were handled separately from the specific gene 
names so that it would be clear when linkages were made to 
the gene family versus a specific member in that family. 

To improve performance and accuracy, some pre-selection 
was applied to the records that were scanned. First, review 
articles were eliminated to avoid redundant treatment of 
citations. Second, non-English journals were removed because 
the natural language filters were only relevant to English 
publications. Finally, journals unlikely to contain primary data 
about gene-disease relationships were also removed (e.g.. Int. 
I Health Educ, Bedside Nurse, and /. Health Econ.). Together, 
these filters reduced the 12 198 221 Medline publications (July 
2002) by 37%. 

Ranking the Relative Strengths of Gene-Disease Associa- 
tions. In total, there were 618 708 gene-disease co-citations, 
in which 16% (8297) of all studied genes had been associated 
to a disease and 96% (3875) of all diseases had been associated 
to at least one gene. To rank the relative strengths of gene 
disease relationships, we tested several different statistical 
methods and examined the results. With the exception of the 
relative risk estimates, the methods provided similar results 
with respect to the rank order of the gene-disease association 
strengths. However, after comparing the results to other 
databases and after consulting disease experts, the log of the 
product of frequency (LPF) was selected for further analysis 
because it gave the best results overall. 

Validation of MedGene. In developing this tool, it was 
important to minimize the number of missed genes (false 
negatives) and miscalled genes (false positives). However, in 
situations when these goals were in conflict, Inclusiveness was 
prioritized. To determine the false negative rate In MedGene, 
breast cancer was used as a test case because it was associated 
with more genes than any other human disease and because 




Figure 1. Estimation of the false negative rate by comparison 
with hand-curated databases. The breast cancer-related genes 
identified by MedGene were compared with those listed in 
several other databases including the Tumor Gene Database 
(TGD), 2 the Breast Cancer Gene Database(BCG), 1 GeneCards 
(GC) 17 and Swissprot. 18 Genes were considered false negatives 
if they were represented in at least one of these other databases 
and not in MedGene and their link to breast cancer was sup- 
ported by at least one literature reference. All literature references 
were verified by manual review to confirm their validity. The 
number of genes in each database or shared by more than one 
database is indicated. The false negative rate was calculated by 
genes missed at MedGene (26)/total number of nonoverlapping 
genes in other databases (285). 

there were several public databases that link genes to breast 
cancer. We compared die list of breast cancer-related genes 
from MedGene to these databases, illustrated in Figure 1. 
Among the 285 distinct breast cancer- related genes that were 
supported by at least one literature citation in these hand- 
curated databases, 26 were absent from MedGene, suggesting 
a false negative rate of approximately 9%. To determine why 
these were missed, all literature references for these genes (80 
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papers) were reviewed manually (see the Supporting Informa- 
tion, Supplemental Table 2, or visit http://hipseq.med. 
harvard.edu/MedGene/publication/s_Table 2.html). Among 
these papers, most false negatives were caused by nonstandard 
gene terms or gene terms eliminated by our specificity filters. 
Few genes were missed because they were only mentioned in 
review papers (0,4%) or they appeared only in the body of the 
manuscript but not the abstract or title (1.1%). Of note, 
MedGene identified approximately 2000 additional breast 
cancer-related genes not listed in any other database. 

To assess the false positive error rate, two complementary 
approaches were used: a detailed analysis of one disease and 
a global examination of 1000 diseases. The detailed approach 
examined the false positive error rate and its sources, whereas 
the global approach tested whether the overall results made 
biomedical sense. 

Using the LPF, 1467 genes related to prostate cancer were 
assembled in rank order. We then retrieved approximately 300 
Medline records each for the highest ranked 100 and the lowest 
ranked 200 genes and manually reviewed the tides and 
abstracts to determine the verity of the association. Nearly 80% 
of the highest ranked 100 genes fell into one of the five 
categories that reflect meaningful gene-disease relationships 
(see the Supporting Information, Supplemental Table 3, or visit 
http://hipseq.med.harvard.edu/MedGene/publication/ 
sJTable 3,html). Among the lowest ranked 200 genes, ap- 
proximately 70% reflected true relationships. Of the 600 records 
reviewed, there were only two in which the association between 
the gene and the disease was described as negative. Both were 
genes with very low scores. In both cases, the authors did not 
argue the absence of any relationship, but rather that a 
particular feature of the gene or protein was not shown to be 
related to human prostate cancer. 1314 

The coincidence of some gene symbols with medical ab- 
breviations, chemical abbreviations and biological abbrevia- 
tions resulted in most of the false positives (see the Supporting 
Information, Supplemental Table 4, or visit http://hipse- 
q.med.harvard.edu/MedGene/publication/s_Table 4.html), em- 
phasizing the importance of the filters that were added in the 
search algorithm (Table 1). Without the filters, the false positive 
rate more than doubled, and the false negative rate rose 
dramatically (data not shown). For example, among the papers 
about breast cancer, there were only 12 Medline records that 
referred to ESR1 and 10 to ESR2, whereas almost 2000 papers 
mentioned estrogen receptor without specifying ESR1 or ESR2, 
this latter group was detected by the family stem term filter. 

To further validate these results, a global analysis of the gene- 
disease relationships described by MedGene was performed. 
For this experiment, it was reasoned that the more closely 
related the diseases are to one another, the more they will be 
related to the same gene sets. Thus, if the relationships defined 
by MedGene accurately reflected the literature, then an unsu- 
pervised hierarchical clustering of the gene data should group 
diseases in a manner consistent with common medical think- 
ing. Conversely, if the clustered diseases do not make sense 
biologically or medically, it may reflect excessive false positives, 
false negatives, or inappropriate scoring of the data. 

To execute this experiment, the gene sets and the corre- 
sponding LPF values for 1000 randomly selected diseases (each 
with at least 50 gene relationships) were used as a dataset for 
clustering the diseases. A review of the results showed that the 
resulting disease clusters were indeed logical based upon 
common medical knowledge (see the Supporting Information. 
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Supplemental Figure 1, or visit http://hipseq.med.harvard.edu/ 
MedGene/publication/s_Figure l.html). For example, in one 
such cluster shown in Figure 2, diabetes and its complications 
grouped together and were also closely linked to diseases 
associated with starvation states. 

The number of genes associated with a given disease can 
be estimated by adjusting the MedGene number up by the false 
negative rate (-9%) and down by the false positive rate (-26% 
on average), Using this, the average disease has 103.7 ± 45,3 
(mean ± s.d.) genes associated with it, although the range is 
quite broad with 2359 genes related to breast cancer, 2122 
genes related to lung cancer and no genes related to a number 
of diseases. 

Applying MedGene to the Analysis of Large Datasets. Access 
to a comprehensive summary of the genes linked to human 
diseases provided an opportunity to analyze data obtained from 
a high-throughput experiment. We compared the MedGene 
breast cancer gene list to a gene expression data set generated 
from a micro-array analysis comparing breast cancer and 
normal breast tissue samples. Micro-array analysis identified 
2286 genes that had greater than a 1-fold difference in mean 
expression level between breast cancer samples and normal 
breast samples. Using MedGene, we sorted the 2286 genes into 
four classes: 555 genes directly linked to breast cancer in the 
literature by gene term search (first-degree association by gene 
name); 328 genes directly linked by family term search (first- 
degree association by family term); 1021 genes linked to breast 
cancer only through other breast cancer genes (second-degree 
association); and 505 genes not previously associated with 
breast cancer. (See the Supporting Information, Supplemental 
Figure 2, or visit http://hipseq.med.harvard.edu/MedGene/ 
publication/sJFigure 2.html.) Among the 505 previously un- 
related genes. 467 were either newly identified genes or genes 
that had not previously been associated with any disease. 
Among the remaining 38 genes, 9 had been related to other 
cancers, specifically esophageal, colon, uterine, skin, and cervix. 

To determine whether the genes highlighted by the micro- 
array analysis were more likely to have been previously linked 
to breast cancer in the literature, we created a two-dimensional 
plot of the fold change of expression level between breast 
cancer and normal tissue versus the literature score (LPF) 
(Figure 3A), There was a broad spread of expression changes 
among the genes directly linked to breast cancer ranging from 
less than 1-fold change (68%) to over 40-fold (0.3%). Notably, 
the majority of genes with greater than 10-fold expression 
changes were linked to breast cancer by first-degree associa- 
tion. 

Among all 754 genes directly Unked to breast cancer in the 
literature, there was no correlation between LPF and micro- 
array fold change (r = 0.018, p-value = 0.62). However, when 
we stratified the analysis based on the magnitude of the fold 
change, we observed an increasing trend in correlation (Figure 
3B) suggesting that genes with a more substantial change in 
expression level were more likely to have a stronger association 
in the literature. For genes that had 10-fold change or more in 
expression level, the correlation increased to 0 41 (p-value = 
0.05). 

When we evaluated the micro-array data separately for ER 
positive and ER negative tumors, the trend in correlation 
between fold change and literature score was highly dependent 
on estrogen receptor status. Interestingly, there was a similar 
trend in correlation for ER positive tumors, but no trend in 
correlation for ER negative tumors. 
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Figure 2. Global validation by clustering analysis. 2(A). The gene sets and the corresponding LPF values for 1000 diseases, each with 
at least 50 gene relationships, were used in an unsupervised clustering of the diseases based on the gene patterns associated with 
them. A sample of the data is shown here. 2(B). One of the resulting clusters is shown that corresponds to blood sugar states. Diabetes 
terms (above the line) and starvation states, terms (under the line) clustered together. Within these groups, there is also clustering of 
diabetic small vessel complications, altered serum chemistries, nutritional disorders, etc.(Supplemental Figure 1: http://hipseq.med. 
harvard.edu/MedGene/publication/s_Figure 1 .html). 

Finally, to validate our findings, we computed similar cor- disease unrelated to breast cancer. As expected, we did not 
relations between the breast cancer expression data and observe an increasing trend in correlation for hypertenr 
LPF scores generated by MedGene for hypertension, a sion. 

Journal of Proteome Research * Vol. 2, No. 4, 2003 409 



research articles 
A rr— /A 



Hu et al. 



OL 



2> 

8 

a; 
=> 
±* 

? 



■ ■ ( 



courtf 




-/A 



I 

couiai ; 



-//- 



0>l!8ttW 0«M HOC** ■aiMiago IEWX 



■ ■-■-I — 



i 



Microarray Fold Change (Cancer/Normal) 



B 





0.8 ■ 




0.7- 


««-» 


0.6 - 


§ 


0.5 • 






d) 




o 
O 


0.4 • 


c 




€ 


0.3- 


rrel£ 


0.2 ■ 


o 
O 


0.1 






-0.1 5 




Microarray Fold Change 



Figure 3. Relationship between literature score and functional data for breast cancer. 3A. The data from an exoression analvste of 
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Discussion 

The Human Genome Project heralded a new era in biological 
research where the emphasis on understanding specific path- 
ways has expanded to global studies of genomic organization 
and biological systems. High-throughput technologies can 
provide novel insight into comprehensive biological function 
but also introduces new challenges. The utility of these 
technologies is limited to the ability to generate, analyze, and 
interpret large gene lists. MedGene, a relational database 
derived by mining the information in Medline, was created to 
address this need. MedGene users can query for a rank-ordered 
list of human gene-disease relationships fTable 2) for one or 
more diseases. Each entry is hyperlinked to the original papers 
supporting each association and to other relevant databases. 

MedGene is an innovative extension of previous text mining 
approaches. Perez-Iratxeta et al. used the GO annotation and 
their chromosomal locations to predict genes that may con- 
tribute to Inherited disorders, 8 MedGene takes a broader view 
and Includes all diseases and all possible gene-disease relation- 
ships. Furthermore, MedGene utilizes co-citation to indicate a 
relationship rather than GO annotation, which is limited to the 
subset of genes that have GO annotation. Our approach is 
complementary to that taken by Chaussabel and Sher, who 
used the frequency of co-cited terms to cluster genes into a 
hierarchy of gene-gene relationships. 6 

A unique aspect of this tool is the ability to assess the relative 
strengths of gene-disease relationships based on the frequency 
of both co-citation and single citation. This presupposes that 
most co-citations describe a positive association, often referred 
to as publication bias 15 and is supported by our observations 



that negative associations are rare (Supplemental Table 3: 
http://hipseq.med.harvard.edu/MedGene/publication/s_Ta- 
ble 3.html). Of course, relationships established by frequency 
of co-citation do not necessarily represent a true biological link; 
however, it is strong evidence to support a true relationship. 

Another important feature of MedGene is the implementa- 
tion of software filters that substantially reduced the error rate. 
We estimate that less than 10% of all associations were missed 
and at least 70% of even the weakest associations were real. 
For this study, all of the filters that we applied were general 
ones, e.g., expanding the list of all gene names to address the 
different syntax forms used by different journals, eliminating 
gene names that correspond to common English words, etc. 
The majority of the remaining search term ambiguities were 
idiosyncratic and difficult to identify systematically without 
causing a significant rise in false negatives. Alternative ap- 
proaches, such as the examination of the nearest neighbor 
terms, need to be considered to further reduce the false positive 
rate. 

It is not uncommon to see expression changes in micro- 
array experiments as small as 2-fold reported in the literature. 
Even when these expression changes are statistically significant, 
it is not always clear if they are biologically meaningful. When 
comparing expression levels of disease to normal tissue, one 
expects an enrichment of known disease-related genes to 
appear in the altered expression group. MedGene provided a 
unique opportunity to test this notion in the context of existing 
knowledge on a novel breast cancer micro-array dataset. For 
genes displaying a 5-fold change or less in tumors compared 
to normal, there was no evidence of a correlation between 
altered gene expression and a known role in the disease. This 
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Table 3. Genes with Large Expression Changes in ER— but 
Not in ER+ Breast Tumors 



gene symbol 



fold change (ER+) 



fold change (ER-) 



KRTHB1 

BRS3 

DKK1 

ZIC1 

TLRl 

KIAA0680 

CDKN3 

EBI2 

GZMB 

STK18 

GPR49 

MYO10 

LAD] 

POLE2 

HMG4 

BCL2L11 

LRP8 

CCNB2 

CCNE2 

FGB 

KNSL6 

HIF5 . 

SERPINH2 

YAPl 
LPHB 

TCEA2 

TFF1 

C0L17A1 

POPS 

BPAG1 

PDZK1 

VEGFC 

MUC6 

SERPINA5 

MEIS1 

CA12 



1.0 
1.2 
1.2 
1.9 
1.0 
2.6 
1.0 
4.0 
3.8 
4.7 
1.0 
1.6 
-1.0 
4.2 
4.4 
-1.2 
2.9 
1.0 
4.0 
-4.3 
2.9 
3.0 
4.6 
1.0 
-1.3 
-1.1 
1.3 
-4.1 
11 
-4.6 
-1.1 
-2.8 
-1.4 
-1.0 
-1.6 
2.4 



610.8 
89.4 
69.8 
59.6 
38.5 
33.2 
30.6 
27.9 
21.9 
18.6 
14.6 
14.4 
13.5 
13.0 
12.9 
12.3 
12.2 
11.8 
11.6 
11.1 
10.9 
10.2 
10.2 
10.0 
-10.4 
-10.8 
-11.4 
-15.7 
-16.2 
-22.3 
-36.8 
-51.5 
-64.9 
-83.1 
-85.9 
-150.3 



Table 3. MedCene identified a set of relatively understudied vet hiehlv 
jessed genes in ER negative, but not ER positive breast tumors. /UI of 
these genes have either never been co-cited with breast cancer or have a 
weak association except those marked with an *. 

reflects the many genes whose role in breast cancer may not 
involve large changes in expression in sporadic tumors (e.g.. 
BRCA1 and BRCA2) and genes whose modest changes in 
expression may be unrelated to the disease. Strikingly, among 
genes with a 10-fold change or more in expression level, there 
was a strong and significant correlation between expression 
level and a published role in the disease, providing the first 
global validation of the micro-array approach to identifying 
disease-specific genes. 

The results derived from MedGene have two implications. 
First, a careful hunt for corroborating evidence of a role in 
breast cancer should precede any further study of genes with 
less than 5-fold expression level changes. Second, any genes 
with 10-fold changes or more are likely to be related to breast 
cancer and warrant attention. It is likely that this threshold will 
change depending on the disease as well as the experiment. 

Interestingly, the observed correlation was only found among 
ER-positive tumors, not ER-negative. This may reflect a bias 
in the literature to study the more prevalent type of tumor in 
the population. Furthermore, this emphasizes that caution 
must be taken when interpreting experiments that may contain 
subpopulations that behave very differently. The MedGene 
approach identified a set of relatively understudied, yet highly 
expressed genes in ER-negative tumors that are worthy of 
further examination (Table 3). 
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In conclusion, we have developed an automated method of 
summarizing and organizing the vast biomedical literature. To 
our knowledge, the resulting database is the most comprehen- 
sive and accurate of its kind. By generating a score that reflects 
the strength of the association, it provides an important tool 
for the rapid and flexible analysis of large datasets from various 
high-throughput screening experiments. Furthermore, it can 
be used for selecting subsets of genes for functional studies, 
for building disease-specific arrays, for looking at genes com- 
mon to multiple diseases and various other high- throughput 
applications. In the future, it will be possible to enhance the 
utility of the MedGene database by building links between 
genes and other MeSH terms as well as other biological 
processes and concepts, such as cell division and responses to 
small molecules. 
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sive catalog was prepared of the gene 
expression changes that occur during 
morphologic maturation. To do this, 3'- 
end differential display, oligonucleotide 
chip array hybridization, and 2-dlmen- 
sional protein electrophoresis were used. 
A large number of genes whose mRNA 
levels are modulated during differentia- 
tion of MPRO cells were Identified. The 
results suggest the involvement of sev- 
eral transcription regulatory factors not 



previously Implicated In this process, but 
they also emphasize the importance of 
events other than the production of new 
transcription factors. Furthermore, gene 
expression patterns were compared at 
the level of mRNA and protein, and the 
correlation between 2 parameters was 
studied. (Blood. 2001;98:513-524) 
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Although the mature neutrophil Is one of 
the better characterized mammalian cell 
types, the mechanisms of myeloid differ- 
entiation are incompletely understood at 
the molecular level. A mouse promyelo- 
cyte cell line (MPRO), derived from mu- 
rine bone marrow cells and arrested devel- 
opmentally by a dominant-negative 
retinole acid receptor, morphologically 
differentiates to mature neutrophils In the 
presence of 10 nM retinoic acid. An exten- 

Introduction 

Studies of normal myeloid maturation from many laboratories have 
identified genes that may play critical roles in myeloid differentia- 
tion. 1 " 4 Current srudies suggest that* these events are dependent on a 
cascade of molecular changes that involve complex modulation of 
mRNA transcription. Furthermore, studies of acute leukemia have 
suggested that the disease arises from the accumulation of myeloid 
precursors arrested at early stages of differentiation and associated, 
in many cases, with chromosomal rearrangements that alter the 
■structure of specific transcription factors. 5 Nevertheless, the molecu- 
lar events underlying the production of mature myeloid cells arc 
not well understood and appear to use interacting pathways and 
networks, the elucidation of which requires an extensive descrip- 
tion of the molecular components available to the myeloid cell. 

An extensive body of information is accumulating with respect 
to gene expression profiles of mammalian celts, However, much of 
the information available in public databases has been accumulated 
by the use of techniques such as single oligonucleotide chips or 
cDNA arrays that measure fewer than 6000 of potentially 30 000 to 
1 20 000 transcripts. The more limited range of analyses reported by 
the serial analysis of gene expression (SAGE) 6 - 7 technique accu- 
rately estimates changes in levels of the more abundant mRN As but 
requires extensive redundant analyses to measure changes in the 
patterns of expression of scarce mRNAs. We have used a modified 
polymerase chain reaction (PCR)-based cDNA differential display 
(DD) method in which single restriction fragments derived from 
the 3' end of cDNAs are separated on a sequencing gel. 11 ' 9 Bands 
fi-om the gel can be identified initially by sequencing, but then 



comparison of" patterns from different samples can be made widiout 
further sequencing. This sensitive and reproducible method detects, 
in principle, most cDNAs regardless of whether they arc repre- 
sented in existing databases. 

Systematic analysis of the function of genes can also be 
performed at the protein level. This approach has the advantage of 
being closest to function, because proteins perform most of the 
reactions necessary for the cell. The most common method of 
protcome analysis is the combination of 2-dimensional gel electro- 
phoresis (2DE) to separate and visualize protein and mass spectrom- 
etry (MS) for protein identification.' 0 Several such analyses of 
yeast and of normal or malignant mammalian cells have been 
perfurmed. To date, however, there have been few studies in which 
both mRNA and protein have been compared by applying analyses 
to the same samples. The studies of Anderson 11 and Gygi u showed 
thai there is not a good correlation between mRNA and protein 
levels, in yeast or human liver cells. However, other analyses 
disagree with this conclusion (Greenbauui et al, manuscript 
submitted, and Futcber et al 14 ). Furthermore, global correlations 
between changes in mRNA and protein levels have not been 
examined during the execution of any developmental program. 

The MPRO cell line was derived by transduction of a dominant- 
negative retinoic acid receptor construct into normal moxise bone 
marrow cells. It is a granulocyte-macrophage colony-stimulating 
factor (GM-CSF)-depcndent line arrested at a promyelocyte stage 
of development 15 - 16 After treatment with nW-trans retinoic acid 
(ATRA) most of the cells acquire the morphology of mature 
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neutrophils and begin to produce neutrophil lactoferrin and gelati- 
nase, 2 proteins characteristic of neutrophil secondary granules. 17 
As such, it offers a valuable model for studying neutrophil 
differentiation in vitro. 

We now report the analysis of mRNA expression changes 
during the process of MPRO cell maturation to neutrophils and 
compare the results with a limited analysis of cellular protein 
composition. mRNA expression changes were studied by combin- 
ing the use of oligonucleotide arrays and DD. A database (dbMC) 
with comprehensive genomic information for myeloid differentia- 
tion program was constructed (accessible at http://www.bioinfo.mbb. 
yale.edu/cxpression/neutrophil). We have grouped the changes in 
mRNA levels of a large number of genes into 6 patterns, with 
implicatioas for the genetic program of myeloid differentiation. 

We also compared 2-dimensioual high-resolution gel electro- 
phoretograms from control cells and cells differentiated for 72 
hours in the presence of AXRA. Fifty protein spots whose relative 
intensity changed prominently during differentiation were exam- 
ined by mass spectrometry. The results suggest a poor correlation 
between mRNA expression and protein abundance, indicating 
that it may be difficult to extrapolate directly from individual 
mRNA changes to corresponding ones in protein levels (as 
estimated from 2DE). 



Materials and methods 

Cell lines 

MPRO cells and HM-5 cells provided by Dr Schickwanii Tsai (Fred 
Hutchinson Can err Research Center, Seattle, WA) 15 were used throughout 
the study. The cells proliferated continuously as a GM-GSF-dependent cell 
line at 37 n C in Tscoves modified DoJbecco medium (Gibco BRL, Grand 
Tsland, NY) supplemented with S% to 10% fetal calf serum {Gibco BRL) 
and 10% HM-5~conditicmed medium as a source of GM-CSF. Morphologic 
differentiation of the blocked MPRO promyelocytes was induced hy 
treatment with 10 u,M ATRA (Sigma, St Louis, MO). Controls were 
cultured in the absence of ATRA but with the same volume of ve- 
hicle (cthanol). 

RNA Isolation and differential display 

After exposure to 10 p.M ATRA for 0, 24, 48, or 72 hours, total cellular 
RNA was Isolated from MPRO cells using TRJzol reagent (Life Technolo- 
gies, Gaithersburg, MD). cDNAwas then synthesized using a T-7 Sal-Oligo 
d(T) 32 primer as described previous ty.*-'* The double-stranded cDNA was 
digested with 1 of 9 different restriction enzymes {Apa\ y BgJH, Bam\il y 
Eagl y EcoRl, r/indlll, Xbal> Kpnl, and Sphl) and ligated to Y-shaped 
adaptors with a complementary overhang. DNA fragments were then 
amplified by PCR as described previously.** ,K PCR products were separated 
on a sequencing gel of 6% polyacrylamide with 7 M area. The gel was dried 
and exposed to x-ray film. Genes from differential display gels > whose 
maximum intensity changes equaled 2+ on a scale of 1 + to 8+, were 
recorded as significantly changed" Individual DNA bands were recovered 
from the gels, amplified by PCR, and sequenced 

Oligonucleotide chip analysis of RNA samples 

Ten micrograms lotal RNA from each sample (0, 24, 48, or 72 hours) was 
used to prepare cDNA. This cDNA was transcribed with T7 RNA 
polymerase lo prepare a fluorcsccntly labeled probc. J<WJ Each sample was 
hybridized lo mouse array chip (MulIK Army; Aflymetrrx, Santa Clara, 
CA) containing oligonucleotide probe sets corresponding to approximately 
7000 known genes or ESTs represented by UniGcne clusters. 22 cDNAs 
were considered present if their probe set results were rated as such by the 
GerieChip software (AfTymetrix) and if the average difference (AD) 
between perfect match and mismatch probe pairs was not less 100 U. If a 



gene was represented by more than one array probe set, the average of all 
probe sets for the gene was taken. Genes with AD values between 100 and 
200 were considered unchanged because of their low expression levels. 
Those genes with AD values equal to or more than 200 U at one time point 
were further studied by rescaling, threshold, aud normalization methods 
described in the MIT Center for Genome Research Web site. 1 3 A value of 20 
was assigned to any gene with an AD below 20 at some time point. 

Biofnformatics and database development 

All the sequences or gene fragments were searched using Blast against 
GenBank and TIGR gene indices. A database of genes or ESTs whose 
expression levels! changed during myeloid differentiation was constructed 
containing m formation for each band or gene. This included GenBank 
matches. Locus Link or Unigcne clusters, expression patterns, tissue 
distribution, syncmym(s) protein name, gene name(s), notations of possible 
functions, poly A signal and sequence quality, and hyperlinks to the 
database searches, sequence trace files, and related references. Atl gene data 
were then gathered into a cluster rite. Supplementary information is 
available at http.//bioinfo.mbb7ale.edu/expTession/neutrophil. 

Classification and analysis of DNA fragments 

Sequences from differential display analyses were classified as representing 
known genes, ESTfc, genomic sequences, or novel genes as described. 
Known genes from both differential display and arrays were clustered into 
27 functional categories and searched against SWISS-PROT (http:// 
www. cxpasy.cbr.nrc.ca/egi-bin/sproi-search- fid) or PIR (http://www.pir, 
gcorgctown.edu/). Information such as luncrion, subcellular location, 
family and super family class Hi cation, map position, similarity, synonym(s) 
protein name, gene namc(s), and so on was recorded in a variety 
of databases. 

Northern blot analysis 

Thirty micrograms total cellular RNA per lane from time-course MPRO 
cells were loaded onto 1.2% formaldehyde- agarose gels, then transferred to 
Hybond-N+ membranes (Amcrsham Pharmacia Biotech^ Uppsala, Swe- 
den). Alter standard prehybridization, membranes were hybridized over- 
night at 65 °C with radiolabeled cDNA probes (ordered from Research 
Genetics according to their dbEST tmagc ID). Membranes were washed at a 
final stringency of 60°C in 0.1 X SSC. 

Immobilized pH gradient 2 -dimensional gel electrophoresis 
and mass spectrometry 

Induced MPRO cells collected at 0 and 72 hours were lysed with lysis 
buffer (540 mg urea, 20 mg dithiothrcitol, 20 pLPharmalytc [3-10], 1 .4 mg 
phenyhnethylsnlfonyl fluoride, 1 u,g each aprotinm, leupeptin, pepstatin A, 
and antipain 50 jxg TLCK, and 100 |Lg TPCK/l mL). We apphed 100 pi 
each MPRO cell rysate (2.5 X lO^eils/lOO u.L) to immobilized pH 
gradient (IPG) strips (pH 3-10 L; Amersham Pharmacia Biotech), and IPG 
electrophoresis was conducted for 16 hours (20 100 Vh) using an Inunobi- 
linc Drystrip Kit (Amcreham Pharmacia Biotech), Electrophoresis in the 
second dimension was carried ou: in a 12% sodium dodecyl sulfate-- 
polyacrylamide gel electrophoresis (SDS-PAGE) gel with the Laemmli- 
SDS continuous system in a Protean II xi 2-D cell (Bio-Rad) run at 40 mA 
constant current for 4.5 hours. Proteins were detected by Brilliant Blue 
G-\:olloidal staining. 24 Protein spots were excised from the gel and digested 
with trypsin. ACTH clip (average [M+H] 2466.70) and bradykinin 
(average [M+TT] 1061.23) were used for calibration of peptide masses. One 
microliter sample digest was mixed with 1.0 u,L a-cyano-4 -hydroxy 
cinnamic acid (4.5 mg/mL in 50% CTT 3 CN. 0.05% TFA) matrix solution and 
1 \iL tali brants ( 100 ftnol) each. The spectra of the peptides were acquired 
in reflector/delayed extraction mode on a Voyager- DE STR mass spectrom- 
eter (Pcrseptive Biosystems, Faster City, CA). Peptides were identified 
using the ProFound search engine. 39 
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Results 

Differentiation of MPRO cells 

Figure 1 illustrates the morphologic changes in an MPRO celt 
population representative of those used for RNA expression 
analysis. Undifferentiated MPRO cells resembled promyelocytes 
under 1hc light microscope (Figure 1 A), After induction with ATRA 
for 24 hours > the cells morphologically differentiated into metamy- 
elocytes (Figure IB). At 48 hours, the cells further developed 
into metamyelocytes and band neutrophils (Figure IC). At 72 
hours, nearly (00% of MPRO cells became mature neutrophils 
(Figure ID). 

Identification of tnRNAs by differential display assay 

MPRO cellular mRNAwas analyzed at 0, 24, 48, and 72 hours after 
ATRA treatment. Nine restriction enzymes were used in a 3'-end 
DD approach. During MPRO differentiation, 1109 fragments 
corresponding to 837 transcripts were found to change substan- 
tially in expression levels (Figure 2). These represented approxi- 
mately 279 known genes, 112 ESTs, and 59 putative new genes, 
each with a perfect or fair polyadcnylation signal at an appropriate 
distance from the oligo-dT priming site. The gene information 
detected by DD was collected in database dbMCd. 

Identification of mRN As by oligonucleotide chip assay 

We used an oligonucleotide chip containing 13 179 probe sets 
corresponding to approximately 7000 murine genes to analyze 
patterns of mRN A expression in the same RNA samples used for 
DD. The information obtained by oligonucleotide arrays was 
collected in the database dbMCa. 

We clustered the genes by their similarity to idealized 
expression patterns. For instance, the expression pattern of an 
ideal gene that is ovcrexpressed (high) at time 0 and undercx- 
pressed (low) at 24, 48, and 72 hours, would be high-low-low- 
low (HLLL). Overall we have (2 4 -2) idealized patterns exclud- 
ing HHHH and LLL1-. Pearson correlation was used as the 




Figure 1. Morphology of MPRO ceils during differentiation. MPRO ceOs were 
induced as described In 'Materials and methods," concentrated by cytospln, and 
Wright-Giemsa stained. (A) Uninduced MPRO cells. (D) MPRO cells Induced with 
ATRAfcr 24 hours. (C) MPRO cells induced wfthATRAfor48 hours, (D) MPRO cells 
Induced with ATRA for 72 hours. , 




Figure 2. Distribution of genes obtained by DD assay. MPRO cell mRNA was 
snaked at 0, 24, 48, and 72 hours after ATRA treatment; 1109 fragments 
corresponding to £37 transcripts were found to change substantia By in expression 
levels. The tots I 837 transcripts were classified into 6 categories according to the 
bloinfurmatic analysis. Percentages show the gene distributions In these 6 catego- 
ries. Information for each transcript was collected In database dbMCd. 

measure of similarity of each gene expression pattern, 
x ~ {x\yX2>XiJCi) to each of the 14 idealized patterns 
>' = (yiO^vVjOU)* The 4 entries of x and y corresponded to the 
4-dimensional gene expression levels at 0, 24, 48, and 72 hours, 
respectively. Each gene was assigned to a cluster labeled by the 
idealized pattern that had the maximal correlation with thai 
gene. We selected only genes that hybridized well compared 
with the background (considered "present* 1 by. GeneChip soft- 
ware) and had maximal AD amplitude greater than 200 U in at 
least 1 of the 4 stages. We further tabulated the 14 patterns 
according to whether the gene expression changed at early 
(0-honr), intermediate (24- and 48-hour), and late (72-hour) 
time points and whether gene expression monotonically in- 
creased (up-regulated), monotonically decreased (down-regu- 
lated), or was not monotonic (transient). Table 1 shows 8 
clusters of 104 genes that had significant changes of mRNA 
levels, arranged according to the temporal stage and the 
monotonic/transient changes of expression levels. 

Principal component analysis determined whether we could 
comprehensively present multidimensional data (4-dimcnstonal in 
our case) in a simple 2-dhnensional graph. First, we found the 4 
principal components, which were the axes of the most compact 
4-dimensional ellipsoid diat encompassed the 4-dimensional cloud 
of data. Each axis was a different linear combination of the original 
4 variables- Then we verified that the first 2 principal components 
(the first 2 largest axes of the ellipsoid) captured most (95.2%) of 
the variation of the data. Therefore, the data could be faithfully 
projected (with a minor loss of information) into a 2-dimcnsional 
graph, with the 2 largest principal components as the x- andy-axes. 
As shown in Figure 3, genes tend to coalesce in clusters, according 
to their labels determined by their similarity to an ideal expression 
pattern. In summary, a genomic (global) picture of the distribution 
of genes according to their similarity to predetermined ideal! 'zed 
multidimensional expression patterns is concisely displayed in a 
2-dhnensional graph. 
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Table 1. Genes differently regulated during the different stages of mouse promyelocyte cell line differentiation process 

Timing 



Caiegory Early Middle Late 



Up-re gu lation 


LrlnM (n — 10) 


LLHH (n ---* G) 


LLLH (n =* 13) 




Mad P2nc1 rtgb2 Ihr2 Lcn2 ItprS 


Phat Cybb Pfc PiraS CdS3 Ifngrt 


Ma Cstitto Os/S10Qa8 L-CCR Cfss 




Cebpb H2-DBohiGZyx 




AJdol Rac2 Fprl Ctsd Ubb Ptmb4 


Down-regulation 


HLLL (n = 11) 


HHUL(n=1) 


HHHL (n = 37) 




Tcnj-V4 LyB4 Ctsg Gpi2-1 Mcptft 


Mpo 


Act* M2 EL2 Rpl19 Actb Ly6« Atfi Hlet2 




Myc Myb Tlr4 Npm1 Erh Hap60 




Psrna2 Gnas Zfp35 ll4ra LfbrShfdgl 
Max Rps8 CsfZrbJ Slpi Tctexl Tpl 3!f3 
Cn/fGys3 SldOal Ctsb Seppl Rin3 
Ccnb2 S100a9 Cfi 1 Hlst5-2ax Rela 


Transient 






Copa Gstml Gnb2-rs1 Grn RPL8 




LLHL (n ~ 9) 

Sell WK?Plra6 PirbLstl Ltf Sema4d State Mmp9 
LHHL(n =» 17) 

Cebpa Lyzs Fcgr3 Arf5 Lompl Stat3 Csf2ro Osl 
Acta SfpM Gpx3 PtprcPrtn3 Irf1 RpsBkal 
Ub4rMyln 





Arrays of Affymetrix Mu11k containing 13103 probe seta corresponding to 12002 GenBank accessions were used for hybridization. Arrays were hybridized with 
slrcptavidin-phycoerythrin (Molecular Probes) biotin-labeled RNA and scanned, Intensity for each feature of the array was captured using Genechip software (Aflymet/bt), and 
s single raw expression level for each gene was derived from (he 20 probe pairs representing each gene using a trimmed mean algorithm. For each gene, an AD ef 24-, 4B-, and 
72-hour samples was calibrated by dividing the siope of (he linear regression line for a graph wllh the x-axlslhe AD of 0-hour probe sets and the y-axls the AD of the respective 
time point (24, 4B, or 72 hount). A threshold of 20 U was assigned to any gene with a calculated expression level betaw 20 because discrimination of expression below this tavel 
could not be performed with confidence, 3 * Each gene expression profile was categorized ss described in Tables 3, 4, and S. For Ihe 4 time points, the minimum AD of the 
relattvery higher group (MIN-H) was divided by the maximum AD of the relatively tow group (MAX-L). and those genes whose MIN-H/MAX-L grester than 2 were selected as 
mear.ingfulty regulated. Genes were sorted In descending order based on the MIN-H/MAX-L. Genes In boldface are those whose expression level was in the top 20% (le, 
maximum AD of A time points grester than 3000), and genes in italics are those in the bottom 20% (ie, maximum AD of 4 time points less than 300). The differentiation period 
was grouped into 3 stages; early (0-hour), middle (24-hour and 4B-hour), and late (72-hour) stages. 

AD indicates average difference; gene symbols are expanded In an Appendix at the end of this ortlde. 




Figure 3. Gene clusters In the first 2 principal component spaces. Principal 
component analysis aliowed us to present Ihe multidimensional data (In this case. 
4-dlmensiona! data of each gene expression pattern) In a simple 2-dimens tonal 
graph. We derived the 4 principal components, which are a linear combination of the 
standardized expression Intensities (zero mean and unit variance) at 0, 24, 48. and 
72 hours. The first 2 principal components captured most of the variation of the data 
(approximately 85%). Therefore, the data can be displayed (wth a minor toss of 
information) in a 2-dlroensbnai graph. The first arxS second principal components, d and 
c2, are given by the linear combinations d = 0.747 • rrl - 0.11 - n2 - 0.656 • n3 + 0 * 
n4 and c$ = 0278 ■ n1 + 0353 • n2 + 0233 ■ n3 - 0363 * n4, where n1, n2. n3, 
and n4 are the rescaied and standardized expression levels at 0, 24, 48, and 72 
hours, respectively The axes legends cl and c2 stand for the first 2 principal 
components. In this paper we used the Pearson correlation to measure the similarity 
of each gens with the tdeateed expression patterns, ss opposed to the Euclidean 
distance we used tn a previous wor*,« because clusters were better separated using 
this measure. In both cases, we presented the data In the 2-<Jlmensional space of the 
tawest principal components. The data had a tendency to be circularly distributed 
when we used the Pearson correlation as a distance measure. 



Correlation between array and DD analyses 

We have previously demonstrated a correlation coefficient of 0.93 
between visual estimates of changes in band intensity on DD and 
Phosphorimager System (Molecular Dynamics, Sunnyvale, CA) 
estimates of band intensity and a correlation coefficient of 0.88 
between hybridization intensity changes of mRNA on Northern 
blot analyses and changes in band intensity on DD. 19 In a few cases 
there were clear discrepancies in the pattern of expression of a 
gene, as estimated by DD and by oligonucleotide chip analysis. We 
chose the 6 most extreme cases and examined the levels of mRNA 
change for these genes by Northern blot analysis (Figure 4). In 5 
cases, the Northern blot results agreed with ihe results of the DD 
analysis, whereas the results of Gnb2-rsl disagreed with the 
oligonucleotide array but duplicate bands from DD showed a 
relatively high level of expression in the 0 time sample that did not 
correlate with the Northern blot (Table 2). One possible explana- 
tion for these findings was the change in the relative use of different 
poly adeny lation sites after the addition of ATRAto the MPRO cells. 

Constructing a database for mRNA level changes during 
myeloid differentiation 

Based on the data obtained above, an in-house database (dbMC) 
was constructed that included 2 subdatahases, dbMCd and dbMGi, 
for collecting gene information from DD or oligonucleotide arrays, 
respectively. Each entry in dbMC is accompanied by a so-called 
executive summary. The linkage between dbMCd and dbMCa was 
established by UniGene ID and cluster ID. dbMC contains the 
temporal expression patterns of genes during the MPRO cell 
diflerentiation process, including not only products represented in 
public databases but also novel transcripts. 
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Figure 4. Northern blot analysts of selected mRNAs. Equivalent amounts of RNA 
from MPRO cefls Induced byATRAat different time points (0 hour, 2* hours, 48 hours, 
and 72 hours) were resolved by fbmialdehyaVagarose gel electrophoresis, stained 
to verify the amount of loading. Eleven genes were separately probed on the RNA 
fitters. The gene symbol of each probe was listed at the left of a related Northern blot 
result. Detailed Information on these 11 probes was listed In Table 5. One of the 
RNA'btatted membra no photographs Is shown with methylene blue-stained 28S and 
18S RNA subunits demonstrating the quality and quantity of RNA loaded In 
individual lanes. 

Analysis of gene expression patterns during MPRO 
differentiation 

Many of the genes identified in this study were found in myeloid 
cells or were implicated in myeloid development for the first time. 
We detected 8 cytokines 23 and chemolcines whose mRNA levels 
changed more than 5-fold by arrays and 2-fold by DD during the 
maturation of MPRO cells (see our Web site, http://bioinfo.mbb. 



yale.edti/cxpression/neutrophil). Among these were 2 members of 
the CC chemokine family. Interleukin-1 a (IL-la) was up-regulated 
at the late stage of differentiation (LLLH pattern. Table 1). 

mRNA for approximately 52 receptors was detected by one or 
the other method. A number of the receptors known to be present on 
mature neutrophils showed late induction of mRNA, and their 
levels of induction were high, indicating that the expression of 
these products is a prominent event late in neutrophil maturation 
(Table 3). Rarely was mRNA for receptors down-regulated, 
consistent with myeloid maturation being accompanied by increas- 
ing responsiveness of the ceil to a variety of external stimuli. 

Expression of mRNA for granule proteins 

Neutrophils contain several types of granules that develop at 
different stages of myeloid maturation. 3 ' 17 ' 26 Levels of mRNAs 
encoding secondary granule proteins, such as lactoferrin, increased 
as the colli matured (Table 4). The level of mRNA for Mmp9, 
reported as a tertiary granule protein, increased markedly between 
24 and 48 hours after the induction of differentiation, whereas 
mRNAs for secondary granule proteins either increased less 
markedly or showed a maximum increase by 24 hours. mRNAs for 
several primary granule constituents, such as myeloperoxidase and 
cathepsin G, were present in unstimulated cells and decreased as 
the cells matured. There was a discrepancy in the measurements of . 
proteoglycan mRNA by DD and oligonucleotide chips, but North- 
ern blots showed that it reached a peak at 48 hours and then 
declined (Figure 4). Cathepsin D is repotted as a primary granule 
protein, but its partem of mRNA expression more closely re- 
sembled that of secondary granule constituents. In addition to 
known granule components, mRNAs for several other cathepsins 
were up-Tegulated during myeloid differentiation, in parallel with 
or later than the tertiary granule protein mRNAs. 

mRNAs for transcription factors 

Transcription factor genes, including several identified at the sites 
of consistent chromosome rearrangements in acute myeloid leuke- 
mia, have been implicated in normal myeloid differentiation and in 
the expression of neutrophil proteins/' 5 * 27 However comprehensive 
information concerning the expression of these transcription fac- 
tors during myeloid development is not readily available. There- 
fore, we compared gene names and identifiers in our databases to 
those of the transcription factor database Transfac (http:// 



Table 2. Expression patterns of genes detected by Northern blot analysis 



Gene 
symbol 


Gene 
accession 




AO value by array 






Intensity by DO 




Oh 


24 h 


48 h 


72 h 


Oh 


24 h 


48 h 


72 h 


Cebpa 


M62362 


33 


212 


182 


44 










Cecpb 


X62600 


350 


1248 


1380 


1303 










Cebpd 


X61B00 


157 


262 


168 


430 










Cebpe 




















Myb 


M12848 


892 


356 


230 


435 










Sipf 


U73004 


617 


501 


783 


402 


1 


2 


3 


3 


Pro3 


W45634 


153 


259 


339 


345 


5 


1 


1 


2 


Gnb2-rs1 


X75313 


4231 


3623 


3215 


3403 


4 


4 


1 


1 


Ly6e 


U 04 268 


3061 


5391 


2844 


1252 


3 


2 


1 


1 


Lspt 


M9C316 


65 


376 


840 


28 


2 


3 


5 


6 


Actb 


XD3765 


3095 


3588 


3976 


2434 


1 


2 


3 


2 



Gene symbol end gene accession refer to National Center for Biotechnology Information databases and, In particular, to Locus Link. AO value (s the average difference in 
the value of hybridization intenstty between the set of perfectly matched oligonucleotides and the set of mismatched oligonucleotide in the oligonucleotide array. Band 
intensities from DD were semiquaniified on s scale from 1 (+)toB(-i- + + + + + + -!-). These estimates are shown as boldface numbers in this table. 19 Both AD value and 
Intensity of genes were studied at 4 time points corresponding to MPRO ceils induced for the indicated times. 

DD Indicates differential display; MPRO, mouse prcmyetocytic cell ifna; for gene symbols, see the Appendix at the end of this article. 
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Table 3. Receptors expressed during myeloid differentiation process 



AD volue fay array 



Maximal fold change 



Gene symbol 


Gene accession 


0 h 


24 h 


. .._,._? 8t> 


72 h 


Bzrp 


D21207 


641 


65B 


681 


887 


Cmkar4 


X995Q1 


508 


447 


378 


664 


Crry 


M34173 


433 


384 


506 


506 


Csf2rh1 


W 34 397 


318 


345 


410 


241 


HliSa 


21 8273 


188 


272 


273 


339 


M6pr 


XB4068 


536 


409 


408 


649 


MFPIR 


AA1 16789 


232 


84 


, 63 


381 


TCRGB 


M28053 


16S 


212 


244 


299 


Trifrsfla 


M59377 


0 


1 


1 


1 


Cmkbrl 


U28404 


221 


244 


504 


638 


Cmr 


X72305 


121 


200 


250 


355 


Csf2ra 


MB5078 


171 


372 


402 


254 


Ebl3 


AF0131U 


187 


270 


428 


148 


Gridl 


D10171 


128 


164 


150 


257 


Ifngr 


J05265 


141 


263 


327 


251 


H2rg 


U21795 


205 


184 


231 


477 


Ldlr 


X64414 


13S9 


1653 


1665 


3968 


P40-S 


.102870 


849 


677 


381 


640 


Plaur 


X62701 


312 


443 


476 


734 


Rang 


M34476 


102 


113 


114 


218 


Srbi 


U37799 


126 


232 


132 


258 


Cr2 


M28281 


83 


138 


243 


77 


Csf2rt>2 


M2S655 


209 


248 


437 


111 


Fcerlg 


J05020 


2398 


2766 


3365 


8751 


Fcgr2b 


X0464B 


1703 


1652 


1431 


4605 


Ifngr2 


U6959S 


1 


2 


2 


3 


Nr4el 


XI 6995 


96 


188 


202 


401 


I11r2 


X59769 


482 


1796 


2672 


3818 


C5ii 


L05630 


185 


434 


808 


1078 


Drd2 


X5S674 


0 


0 


0 


219 


Fcgr3 


M14215 


1 


1 


' 1 


2 


Fprl 


U21B1 


0 


69 


141 


671 


GCR 


AA24D71 1 


2 


0 


0 


0 


L-CCR 


AA034B46 


4$ 


175 


314 


2056 


NMDARGB 


AAB20211 


2 


2 


0 


0 


P2TX1 


XB469B 


79 


346 


530 


744 


Piral 


U96632 


0 


43 


172 


378 


Pira5 


U96666 


■ 274 


391 


954 


1874 


PiraG 


U9G687 


122 


635 


2014 


1716 


Ptrb 


U96689 


191 


445 


966 


747 


Sett 


M25324 


46 


104 


570 


20 


Tog-V4 


M54993 


1650 


78 


65 


315 



less than 2 



2 or more, less than 3 



3 or more, less than 4 



4 or more, less than 5 

5 or more 



Receptors are identified as present whose maxima! AD values were more than or equal to 2G0 U in this study. Genes were sorted by their expression patterns as follows; 
first by the average difference vaiue, then by the difference between minimum and maximum AD for the 4 time points, and last by the alphabetical order of gene symbols. Genes 
were ordered according to the maximal fold change of AD values. Abbreviations of gene names are taken from gene symbols listed in the Locus Link portion o! the National 
Center for Biotechnology Information database whe^e available. Numbers in bold denote those gene expression patterns obtained hy differential display rather than by 
oligonucleotide array assays. The other information is presented as in the legend to Table 2. 

AD indicates average difference; gene symbols are expanded In an Appendix at the end cf this article. 



www.transfac.gbf-braiinschweig.de/TRANSFAC) and determined 
which factors contained in this database were present at detectable 
levels in MPRO cell mRNA, using Affymctrix software for the 
criteria for inclusion of inRNAs from approximately 200 murine 
transcription fectors probe seta on the oligonucleotide chip. Of 
these, 54 were expressed and 1 3 showed changes of 3-fold or more 
in chip signal (Table 5). 

The changes in certain transcription factors, such as the moderate 
down-regulation of myb and myc and (he up-regulation of the Max 
dimerization protein MAD, were consistent with the shift of the cells 



fiom a proliferative to a differentiated state. 28 Some changes are more 
difficult to explain, such as the up-regulation of DPI , a partner for E2f 
factors in the regulation of S-phase genes, and the mild up-regulation of 
the Id genes, commonly associated with an inhibition of differentiation 
by competition with bHLH transcriptional activators 

The C/EBP family has been extensively studied with respect to 
myeloid differentiation. 2 ' 30 Absolute levels of the C/EBP a and 5 
mRNAs were low, probably at the borderline of significance for the 
oligonucleotide chip assay, whereas the level of C/EBP p appeared 
higher. In addition, there were discrepancies between the chip 
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Table 4. Granule constituents expressed during mouse pro myelocytic cell line cell differentiation 



AD vslue by array 



Granule constituent 


Gene symbol 


Gene accession 


Oh 


24 h 


48 h 


72 h 


Azumphil (pritnary) granules 
















Man2c1 


AA16t66Q 


178 


134 


99 


164 




Ctsb 


M65270 


442 


480 


595 


389 




Ctsd 


XS2836 


214 


10B7 


1826 


2784 




Cleg 


M96801 


1509 


405 


46 


236 




E12 


UG4562 


658 


1273 


843 


157 




E1a? 


AA583016 


47 


159 


134 


163 




Gus-s 


M63836 


544 


226 


266 


254 




Lyzs 


M21050 


0 


1 


1 


3 




McptB 


X7B545 


831 


268 


66 


491 




Mpo 


X1S37B 


378B 


3009 


776 


692 




Prg 


X16133 


2621 


■ 2653 


2920 


9859 


Possible granule proteins 
















Ctsc 


AA 144 887 


252 


194 


342 


576 




Ctse 


X97399 


1 


3 


4 


5 




Ctsh 


U06119 


45 


124 


195 


156 




C1sl 


XOB086 


16 


11 


31 


237 




Ctss 


AA089333 


12 


9 


88 


463 


Specific secondary granules 
















Cpa3 


J05118 


621 


270 


90 


801 




Cd36l2 


AB008553 


113 


93 


157 


187 




Cnlp 


X94353 


30 


479 


704 


626 




Cybb 


U433B4 


8 


24 


91 


128 




Ear2 




0 


1 


1 


2 




Fpr1 


L221B1 


178 


220 


235 


845 




Ugb2 


X14951 


0 


2 


4 


2 




Lcn2 


W13166 


916 


3513 


3531 


6036 




Ltf 


J03298 


18 


162 


333 


138 




Map 


W45834 


5 


1 


1 


2 




Mmp13 


X66473 


44 


43 


72 


178 




Ngp 


L37Z97 


2661 


4782 


2311 


6912 


Tertiary granules 
















Mrnp9 


227231 


0 


1 


2 


2 



Shown ana the possible granule protein cONAs represented on me ottgtonucleotlde arrays, sorted by thefr expression patterns as follows: first by the average difference AD 
vslue, then by the granule types, and last by the alphabetical order of gene symbols. D2ta are presented as described in the legend to Table 3. 
AD indicates average difference; gene symbols are expanded in an Appendix at the end of this article. 



estimates and the mRNA levels observed by Northern blotting with 
specific probes for these genes. In particular, the latter method, 
more sensitive and specific, showed that C/EBP a began to decline in 
the most mature cells, whereas C/EBP 6 mRNA declined progressively 
beginning at 24 hours after (he onset of differentiation. 

C/EBP e Is a more recently cloned C/EBP family member Previous 
studies indicated it is expressed in a large array of human leukemia ceil 
lines blocked at various stages of differentiation and that it is up- 
regulated during granulocytic differentiation. 31 A C/EBP € probe was 
not included in the ohgonucleotide chips, and this mRNA was not 
detected by DD. Therefore, we examined the C/EBP € expression 
patterns by quantitative PCR and Northern blot analysis (Figure 4). 
C/EBP e exon i was PCR amplified from MPRO RNAs using primers 
RY48 (AGCCCXCGACACCCTTGXTGA) and RY49 (TGCCACACT- 
GCGGGCAGACAG). 35 The results showed that C/EBP € is expressed 
throughout myeloid differentiation, with expression levels increased 
moderately in the later stages. 

We detected a number of other transcription factors that are 
broadly expressed or that have been reported in other studies of 
hematopoiesis (Table 5). Some of the factors that were most 
strongly induced during differentiation have been studied in other 
contexts but not previously implicated in hematopoiesis, such as a 
mammalian homologue to the Drvsophila enhancer of split gene, a 
transcriptional silencer. The mammalian gene is expressed at 
relatively high levels as measured by the oligonucleotide chip and 



is a candidate for mediation of the silencing of growth-related 
genes in the maturing neutrophil. Another candidate transcriptional 
silencer, Tiflb, may serve as a corepressor for the KRAB domain 
family of zinc finger transcription factors and also may mediate 
binding of the heterochromatin protein HP1 to DNA. 33 

There were 26 transcription factors whose mRNAs showed no 
significant changes by oligonucleotide chip analysis and were not 
identified as differentially regulated genes by differential display 
assays. PU.l, a factor necessary for the production of neutrophils 
and the expression of several neutrophil genes, 34 showed less than a 
3-fold increase in mRNA, below the threshold for a significant 
change. Other candidate hematopoietic transcription factors, such 
as PEBP1&B2 (AMU), GATA-1, and SP-2, were represented on 
the oligonucleotide chips, but their mRNA levels were so low that 
they were reported as absent in this study. The possibility that small 
changes in the levels ot ratios of some transcription factors could 
produce marked changes in transcription potentially liinits the 
ability of d2ta generated by present methods to explain transcrip- 
tional changes during differentiation. 

Protein expression patterns of MPRO cells during 
ATRA Induction 

We visually compared the 2DE patterns from MPRO cells at the 
same time points used for mRNA analysis. In most cases the 
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Table 5. Transcription modulators presented during myeloid differentiation 



Maximal fold change 



AD vatus by array 



Gene symbol 


/2An* Ckrr *^^lrm 


Oh 


24 h 


48 b 


72 h 


Zfp11-S 


AB020542 


2630 


2989 


2795 


2515 


8tf3 


W13502 


3 


3 


2 


1 


Gala2 


AB000096 


562 


770 


472 


730 


Hmgt 


J04179 


337 


. 346 


177 


232 


Idb1 


M31B85 


455 


787 


721 


637 


Max 


M63903 


256 


224 


312 


172 


Nfatc2 


AA560093 


2313 


3218 


2396 


2542 


Pm1 


U33626 


173 


2B1 


329 


305 




M34476 


102 


113 


114 


218 


Rota 


M619C9 


297 


280 


304 


244 


Sox 15 


W53527 


419 


461 


j 484 


837 


Ybx1 


M62B57 


643 


489 


472 


496 


Zfp162 


Y12B38 


671 


734 


720 


992 


e pd 


X61B00 


157 


262 


168 


430 


Idb2 


IYIU9jC 


244 


210 


310 


604 


Jundl 


VV^JJuu 


1274 


2Q02 


1434 


3085 


Lyi1 


X5T76B7 


399 


342 


347 


891 


Nfe2 


L09600 


458 


743 


1042 


505 


Nfkbl 


L28117 


953 


2044 


1B76 


2034 


Pbx1 


AF0201B6 


611 


303 


345 


212 


stpll 


A34693 


375 


784 


891 


529 


Trflb 


Uo f JU J 


673 


659 


420 


663 


Trp53 


P10351 


259 


149 


125 


361 


Usf2 




129 


185 


285 


192 


Ybx3 


L35549 


96 


169 


210 


119 


Zfp216 


AA510137 


62 


151 


204 


106 


Irfi 


M21065 


85 


207 


278 


198 


Ktf2 


U25096 


62 


86 


246 


77 


Myb 


M1284B 


892 


356 


230 


435 


Stal3 


AA396028 


484 


1057 


1012 


290 


Tfdpl 


C08639 


307 • 


560 


505 


1093 


Cobpb 


X62600 


390 


1248 


1380 


■ 1903 


Stral4 


Y07636 


223 


383 


510 


936 


Cebpa 


MG2362 


33 


212 


1B2 


44 


Grg 


X73359 


99 


565 


916 


1005 


Mad 


X8310B 


0 


111 


167 


327 


Myc 


L00039 


314 


112 


62 


173 


EtohtO 


W896H7 


169 


386 


313 


1003 


TBX1 


AA542220 


0 


0 


1 


2 



Less than 2-fold 



2 or more, lass than 3 



3 or more, lass than 4 



4 or more, less lnan 5 



Shown are the transcription factors identified as present by the oligonucleotide array analysis whose maximal AD between perfect match and mismatch oligonucleotide 
sets was greater than or equal to 200 U in this study. Data are presented as described in the legend to TaWe 3. 
AD Indicates average difference; gene symbols are expanded In an Appendix at the end of this arbcle. 



peptides identified foT a given protein were derived from regions 
along the entire length of the protein, indicating the observed 
products were not the result of proteolytic degradation. These 
data must be considered with several caveats: membrane and 
other hydrophobic proteins and very basic proteins are hot well 
displayed by the standard 2DE approach, and proteins present at 
low levels will be missed. 35 In addition, to simplify MS analysis, 
we used a Coomassie dye stain rather than silver to visualize 
proteins, and this decreased the sensitivity of detection of minor 
proteins. The MS method we used was sufficiently sensitive to 
identify proteins that could barely be visualized by colloidal 
blue staining. However, a limitation of the method for the mouse 
is that the current database lacks predicted amino acid sequences 
for a substantial fraction of murine genes. In addition, very 
small proteins give only a few peptides, making statistically 
confident identification difficult. 



Figure 5 shows the analytical colloidal blue-stained 2DE IPG 
reference maps of differentiated MPRO cells. Expression patterns 
of more than 500 protein spots were detected and observed through 
the entire series of gels. Protein spots could easily be cross- 
matched to each other, indicating the reproducibility of the method. 
As marked on the gel pictures (Figure 5), 50 proteins with a wide 
range of molecular weights (I to 200 kd), isoelectric points (4 to 9), 
and abundances were subjected to MS protein identification. The 
results are presented in Table 6. 

Comparing the theoretical value of the molecular weight and pi 
of each protein to that of the observed value, we confidently 
identified 28 proteins in the expected position on the gels (spots I to 
28). Some of (he other proteins with strong matches to the murine 
databases migrated to a somewhat unexpected pi position. Nine 
spots gave clear peptide peaks on mass spectroscopy but did not 
match any known gene. Their identification will require amino acid 
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Figure 5. 2DE electrophoretograms of MPRO cells. 
MPRO cell lysate (2.5 X 10° cell/sample) was loaded for 
2DE analysis. Gels were stained with brilliant blue G-- col- 
loidal dye. (A) 2DE map of urrinduced MPRO ceil {0 hour). 
(B) 2DE map of matured MPRO ceils (72 hours). Protein 
spots marked In the maps were considered differentia Uy 
expressed and ware subjected to MS analysts. The 
rssuttent protein information is listed in Table 6. 
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sequence analysis or availability of more extensive murine data- 
bases. We searched for the expression patterns of the genes cognate 
to the expressed proteins in dbMC (Table 6). Nineteen genes were 
found in dbMC, the mRNA foT 5 genes was reported as absent, and 
13 genes were present during MPRO differentiation. Comparison 
of the expression patterns showed only 4 genes of 1 8 present on the 
oligonucleotide chips whose expression was consistent at the RNA 
level and protein leveL None of these was on the Hst of the genes 

Table 6. Correlation of expression patterns between mRNA level and protein level 



mat were differentially expressed significantly (5-fold or greater 
change by aTray or 2-fold ot greater change by DD). 



Discussion 

We explored the temporal patterns of gene expression during 
myeloid developmenL A database has been developed to provide n 



Spot 


Protein definition 


Gi number 


Predicted 
value 

kd pi 


Percentage 
(%) 


2DE pattern 
Oh 72 h 


cDNA expression 
pattern 

Oh 72 h 


Afl 


1 


GRP7B 


2506545 


72.4 


5.1 




1 


3 


1321 


1043.3 


N 


2 


Artln. gamma, cytoplasmic 


6752954 


41.77 


5.3 


40 


3 


6 


0 


2 


Y 


3 


RHO GDI 2 


2494703 


22.83 


4.9 


33 


3 


3 


341 


441.6 


Y 


4 


Proliferating cetl nudear antigen 


7242171 


26.77 


4.7 


42 


1 


0 


544 


430.9 


Y 


5 


APS kinase 


4038346 


69.8 


7.1 


24 


2 


1 


43 


50.7 


N 


6 


Pyruvate kinase 3 


6755074 


57.9 


7.2 


48 


6 


4 


3047 


5880.3 


N 


7 


Melanoma X-actin 


6671509 


41.72 


5.3 


39 


1 


3 


2539 


341.3 


N 


B 


Glyceraidehyde-3-phosphate dehydrogenase 


6679937 


35.79 


B.7 


39 


B 


7 


3073 


5742.3 


N 


9 


Steftn 3 


461911 


10.99 


5.9 


48 


0 


4 


N/A 


N/A 




10 


Guanine nucleotide binding protein, beta-2, 






















related sequence 1 


6680047 


35.06 


7.9 


21 


4 


2 


139 


303 1 


N 


11 


TrtosephcspHate iscmerase 


6678413 


28.69 


6.9 


26 


3 


3 


3312 


2650.1 


Y 


12 


Testis -derived c-abl protein 


1196524 


17.19 


7 


51 


2 


3 


152 


126.9 


N 


13 


RNA binding motif protein 3 


794S121 


16.59 


6.8 


25 


1 


0 


628 


812.4 


N 


14 


Coliapstn response mediator 


6681019 


62.16 


6.4 


36 


2 


0 


Absent 


Absent 


N 


15 


Lamln A 


22C474 


47.52 


6.6 


35 


2 


0 


Absent 


Absent 


N 


16 


47-kd keratin 


52783 


35.82 


4.8 


29 


3 


0 


Absent 


Absent 


N 


17 


sid478p 


5931565 


31.3 


6.7 


30 


1 


2 


Absent 


Absent 


N 


16 


MHC dass il H2-iA-beta-5 


3169662 


28.6 


7.1 


39 


1 


2 


N/A 


N/A 




1B 


Androgan-binding protein; subunrt alpha 


739346 


8.04 


6.4 


68 


0 


2 


Absent 


Absent 


N 


20 


Neuronal apoptosls inhibitory protein 


5932010 


158.7 


6 


17 


1 


0 


N/A 


N/A 




21 


PAD type IV 


675501 B 


74.46 


7.2 


21 


1 


3 


N/A 


N/A 




22 


Human serum albumin homologcue 


3212625 


66.45 


5.7 


24 


0 


6 


N/A 


N/A 




23 


syncrip 


6576815 


62.53 


7.2 


33 


2 


1 


N/A 


N/A 




24 


Transaminase 


1730203 


48.22 


7.2 


31 


3 


1 


N/A 


N/A 




25 


PGK crigr phosphoglycerete 


1730519 


44.54 


6.3 


47 


5 


4 


1088 


1402.3 


N 


26 


Proliferation-associated gene A 


6754976 


22.16 


8.6 


53 


3 


1 


N/A 


N/A 




27 


Putaluve peroxisomal antioxidant enzyme 


3913065 


17 


7.8 


55 


0 


3 


N/A 


N/A 




28 


IgE chain C2 region 


2137430 


12.1 


5.2 


38 


0 


1 


N/A 


N/A 





The proteins listed here are represented by the spots marked In the eiectrophoretograms shown in Figure 5. 

Protein definiiion, GI number, and predicted varus refer to the protein name, accession number, and properties derived from the National Center for Biotechnology 
Information protein database. The column labeled % shows the porcentago of peptides predicted from the protein sequence that wcro detected by mass spectroscopy. Tho 
expression level of protein spots expressed in mouse promyelocyte cell line csH Induced by ad-fmns rctmoic acid for 0 hours and 72 hours (Figure 5) were scored on a scale of 
1 (+) to 8 (+ + + + + -r + +) in the 2DE pattern column. The cDNA expression patterns of the cognate mRNAs are listed in the cON A expression pattern column abstracted from 
the dbMC database. The genes not represented on the oligonucleotide arrays were marked as N/A Ag showed the correlation of gene patterns at mRNA level or protein level 

Y indicates agreement and N discrepancy between changes in cDNAand protein spot intensity. The numbers in bold were obtained with DD. 2DE indicates 2-dimensk»»ai 
gel electrophoresis; IgE, immunoglobulin E; DD, differential display. 
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reference for later research on the molecular mechanisms underly- 
ing normal myeloid development. 

The MPRO cell system morphologically mimics normal myeloid 
differentiation and biochemically proceeds further toward mature neutro- 
phils than most olher in vitro systems. Because the arrest in differentia- 
tion of MPRO cells growing in the absence of ATRA is not physiologic, 
there is a theoretical risk that gene expression in these cells is not 
cxxirdinated in the way that it is in normal differentiation. It is 
encouraging that, for the most part, me timing of expression of genes for 
proteins of the various neutrophil granules is consistent with the timing 
of the morphologic and biochemical appearance of these granule 
components during normal myeloid differentiation. 

The DD technique provides certain advantages for detecting 
and comparing mRNA levels in different samples. First, the method 
is, in principle, similar to competitive RT-PCR, and, with the use of 
stringent PCR conditions, is expected to be about as reliable. 
Second, display patterns are reproducible. Third, the method 
detects the levels not only of RNAs already represented in the 
database but also of unknown RNA species that may represent 
"new" genes. Fourth, closely Telated genes can be distinguished 
regardless of cross-hybridization, provided there are some single 
nucleotide differences in the 3' end sequence. Limitations associ- 
ated with this technique are that numerous gels are necessary to get 
complete information and that comparison of the levels of different 
mRNAs is only approximate because of the differential amplifica- 
tion of bands of different size or sequence. 

Oligonucleotide chip analysis is a fast and effective means of 
accessing mRNA expression patterns. 20 Cluster analysis of groups 
of samples by this approach is effective; However, the present 
results indicate that alternative methods of verification are desir- 
able before the data on an unexpected change in a particular gene 
are definitively accepted. 

To obtain the broadest range of information from the myeloid 
differentiation process, both differential display and oligonucleotide 
chip techniques were applied in the current study. As a result, 65.3% of 
the observed changes in mRNA levels came from the differential display 
method and 41 .5% came from oligonucleotide chip assays. 

Our data showed in general that changes in expression pattern 
by the 2 methods agreed qualitatively but that there was some 
quantitative variation. Our results indicate that DD may be a more 
accurate way to detect changes in levels of gene expression than the 
oligonucleotide chip assay. However, improvements in the types of 
oligonucleotides used in arrays may close this gap in the future. 

The mRNAs for a limited number of transcription factors vary in a 
pattern correlating with that of the mRNAs for primary or .secondary 
granule proteins. However, more detailed infon nation is needed, and the 
underlying mechanisms of granule gene regulation remain unclear. The 
number of potential positive and negative regulatory factors found here 
is sufficiently small as to make it feasible to perform in vivo studies, 
such as chromatin rrmnunoprecipitatioa 

The oligonucleotide chip used in this study focused on known 
genes, whereas the DD method samples all polyadcnylated tran- 
scripts. The latter method generated a large number of products not 
associated with known genes, in part because the mouse genome is 
not as well represented in the database as the human genome. 
However, our experience with DD and human mRNAs indicates 
that substantia] fractions of the products represented as ESTs or not 
represented at all in the public databases are cDNA copies from 
introns, bnRNA, or other RNA with internal A runs. 

Approximately 59 sequences obtained from gel-display bands 
had significant changes in the level of expression and a sequence 
that did not match that for any named gene in the public databases. 
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Of these, 38 had plausible or excellent polyA signals. This is only 
an approximate estimate of the number of new genes found 36 
because a fraction of the mRNAs for known genes still had poor 
polyA signals. In addition, the full 3' untranslated region is often 
not known for characterized genes, and in some cases these new 
genes may prove to be identical to products identified by the 
oligonucleotide chips when more complete sequences are obtained. 
At the least, their presence indicates that a substantial fraction of 
the regulatory or functional circuitry of maturing myeloid cells 
remains unexplored and that valuable tools for their investigation 
will emerge from a combination of RNA expression studies and 
analysis of emerging genomic sequences. 

The desired end point for the description of gene expression in a 
biologic system is not only the analysis of mRNA transcript levels 
but also the accurate measurement of protein abundance. The 
developments in 2DE and new MS instrumentation make it 
possible to accomplish this work rapidly and efficiently. In this 
study, we attempted to identify a number of the proteins differen- 
tially expressed between uninduced and ATRA-differentiated MPRO 
cells and to examine the relation between mRNA and protein expression 
levels for these genes representing the same state. 

For protein levels based on estimated intensity of Coomassie dye 
staining in 2DE, there was poor correlation between changes in mRNA 
levels and estimated protein levels. Other groups have studied the 
correlation between mRNA and protein levels in yeast and liver 
cells. 'U 2,14 m the liver cell eaqseriments, 11 ^ 2 correlation coefficients of 
0.4 to less than 0.5 were observed. Tn an extensive study in yeast, 1 M2 the 
correlation coefficient was high if the most abundant mRNAs and 
proteins were considered. If a handful ofthese products was omitted, the 
remaining correlation coefficient was 0.4 or less. However, one 
could restore some of the correlation by averaging individual 
data points into broad proteomic categories. 37 

The discrepancies between mRNA and protein levels in MPRO cells 
appear to be substantially larger than those observed for yeast Possible 
causes for the discrepancies include translational regulation, differential 
expression of certain mRNAs at various stages of cell growth hi vitro, 
post-translational protein modification that varies with the stage of 
inaturation of the cells, and selective degradation or excretion of proteins 
in vivo. Innthermorc, here we are focusing on a developmental 
time-course, whereas the yeast study concentrated on the organism in 
vegetative growth. New techniques, equipment, and bioinformatic 
analysis tools must be developed to make such systematic, global, and 
quantitative analyses feasible. 

The initial studies of protein expression presented here provide a 
cautionary note for efforts to interpret cell composition and function in 
relation to mRNA levels. Discrepancies we observed between gene 
expression and protein abundance suggest that selective post4ranscrip- 
tional controls may be at least as important as changes in mRNA levels 
in determining the protein compasrfion of neutrophils and that they are 
phenomena less well explored than uanscriptional controL Analysis of 
mRNA expression patterns is itself only a small beginning toward a 
genome-wide description of cellular components. 
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Appendix 

Gene symbols used in tables: Actb: actin, beta, cytoplasmic; Actg: actin, gamma, 
cytoplasmic; Acct: niclanania X -actin; Aidot: aldolase 1, A isoihrm; ArS: 
ADP-ribosylauon factor 5; AJttl: activating transcription fector 1; AtQ: activating 
transcription factor 2; BtO: basic transcription factor 3a; Bzrp: pcrir^ierdt-type 
bciizodiazqinetccq^tac; C5rl: coinpianent enrnnonrnt 5 > receptor 1/G protein- 
coupled receptor (C5a); Ccnbi cyclin B2; Cd36L2: CD36 antigen (collagen type 
I Teceptor, thrombospondin receptar)-lilte 2; C<I53: CD53 antigen; Cebpa: 
CCAAT/cnhanccr bindmg protein CEBP t alpha; Ccbpb: CCAAT/cnhanccr 
binding protein (C/EBP), beta; Ccbpd: CCAAT/cnhanccr binding protein (O 
EBPX delta; Ccbpc: CCAAT/enhancer binding protein (C/EBP), cpsilon; Cfll: 
cofiiin 1, norinxuscle; CrnkaM: chemokine (C-X-Q receptor 4; Cmkbrl: chemo- 
kine (C-C) receptor 1/Mipla receptor, Cnlp: cscncHn-likc protein; Cntf: ciliaiy 
neurotropic factor/zinc finger protein PZF; Copa: coatomcr protein complex 
siibuiiit alpha; Cpa3: carbcrxypcptidasc A3, mast cell; Cr2: complement receptor 
2; Crhr: ciirtico tropin releasing hormone receptor; Crry: complement rcccptar- 
rclatcd protein; Csflr. CSF 1 (M-CSF) nxcrAv/c-rms/CDl 15; Csf2ra: CSF 2 
(GM-CSF) receptor, arpha, low-afTinify/CDl 16; Csf2rbl: CSF 2 (GM-CSF) 
receptoc, bete X low-attiniiy/U. 3 rccrpinr-likc protein (AlC2B)/CDwl3J; 



^f2rb2: CSF 2 (GM-CSF) receptor, beta 2, low-affinity/IL-3 receptor (A1C2A); 
"Ctsb: cathepsin B; Ctsc: catbepsin C; Ctsd: cathepsin D; Ctsc: caaicpsin E; Cosg: 
aahepsin G; Ctsh: latbxpsin H; OsL cathepsin L; Ctss: cathepsin S; Cybb: 
r/iochrome b-245, beta; Drd2; dopamine receptor 2; E2tl: H2i : transcription 
factor 1; Ear2: eosmcphil-associated ribcnucleasc 2; Ebi3: Epstein-Barr virus- 
induced gene 3/cytokine nrccptor-like molecule (EBI3>, E12: Balb/c ceatrophil 
elastise; Ela2: elastase 2; Erh: enhancer of rudimentary homolog (Drosophila); 
Etohi6: cthanol induced cVslcrol rcguhuory clement binding transcription factor 1 
(SKEBF I ) homolog; F2ri2: coagalauon factor TI (thrombin) rcccptor-likc 2; 
Fcer 1 g: Fc receptor, IgE, high affinity \ ganima polypeptide; Fcgr2b: Fo receptor, 
IgG, low affinity lib: Fcgr3: Fc nxeptor, IgG, low aifinity III; Fprl: fornryl 
peptide receptor L y lMLP recrptar, Gabpbl: G A repeat binding protein (GABP- 
bctal subunitX Gata2: GATA-binding protein 2; Gnas: guanine nucleotide 
binding protein, alpha stimulating; Gnb2-rsl : guanine nucleotide binding protein, 
beta-2, related sequence 1; Gpx3: glutathione peroxidase 3; Grg: related to 
Drosophita groudMi («cnc; Gridl: ghitamau: receptor diannel subwnit delta I; 
Gm: granulin; Gstml: glutarhiono-S-lransfcrase, mu I; Gus-s: beta-ghicuroni- 
dasc structural; Gys3: gheogen bTnlhasc 3, brain; H2-D: liistoct>inptU&iury 2, D 
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region locua 1 ; Uist2: historic gene complex 2; Hist5~2ax: rl2A hist one family, 
member X; Hingi: high mobility group protein I; Hsp60: heat shock protein, 60 
kDa; Htr5a: 5-hydroxytryptamine (serotonin) receptor 5A; Idbl: inhibitor of 
DNA binding l/helix-loop-helix DNA binding protein regulator (Id); Idb2: 
inhibitor of DNA binding 2; Ifhgr. interferon gamma receptor; Ifhgr2: interferon 
gamma receptor 2; Ii: la-associated invariant chain; Ilia: ILl alpha; Illr2: IL1 
receptor, type II; H2rg: IL2 receptor, gamma chain; IWra: IL4 receptor, alpha; 
111 Orb: IL10 receptor, beta; JJ 17r. ILl 7 receptor, Irfl: interferon regulatory factor 
1; lrC: interferon regulatory fector-2; Itgb2: integrin beta 2 (Cdl8); ItprS: inositol 
1 ,4,5-trisphosphate receptor (type 2); Jundl: Juu rwto-oncogene-related gene 
dl/transcriptbri factor JUN-D; Klf2: Kruppd-like Sartor LKLF; L-CCR: lipopoly- 
saccharide inducible C-C chemokine receptor-related; Loi2: lipocalin 2; Ldlr: 
low density lipoprotein receptor; Lspl: Lyrnpnocyte-specific l/$37/pp52; Lstl: 
kiicocyte-specific transcript 1 ; Ltb4n leiikotricne B4 receptor; Lthn lymphotoxin- 
beta receptor, Lrf: lactotransferrin; Ly64: lymphocyte antigen 64; Ly6e: lympho- 
cyte antigen 6 complex, tocus E; Lyll : JymphoblastQmic leukemia/bHLH fector; 
Lyzs lysozyme; M6pr rnanno»-6-phosphate receptor, cation dependent; Mad: 
Max dimcrization protein; Man2cl: rnannosidase, alpha, class 2C, member J; 
Max.: Max protein; Mar MYC-associated zinc finger protein (purine-binding 
transcription factor); MBP: coainophij granule major bask protein precursor; 
McptS: mast cell protease 8; Mil: myeloid/lymphoid or mixed-lineage letikemia; 
Mmp 1 3: matrix mctalloproteinase 13/collagcnase; Mmp9: matrix metallo protein- 
ase 9/gelatinase B; Mpo: myeloperoxidase; Myb: myeloblastosis oncogene; 
Mybl2: myeloblastosis oncogene-like 2; Myc: myclocytomatosis oncogene; 
Myln: myosin light chain, alkali, rionmuscle; Nfatc2: nuclear factor of activated T 
cells, cytoplasmic 2; Nft2: nuclear factor, erythnjid -derived 2, 45 kDa; Nfkbl : 
NF-kappa-B (p!05); Ngp: neutrophilic granule protein; NMDRGB: N-methyl-D- 
aspanate receptor gbdamate-binding chain homolog; Npml: nudeonhosmin 1; 
Nr4al: nuclear receptor subfamily 4, group A, member 1; Osi: oxidative stress 
induced; P2rxl: purinergic receptor P2X, %aud-gated km channel, I; P2ry2: 
rxirinergk receptor P2Y, G-nrotein-<oiipled 2; P40-8: P40-8, firndkmal/laminin 
receptor; Phxl: pre B-cell leukemia transcription factor 1; Ptb: properdin factor, 
complement; Piral: paired-Ig-like receptor AI; PiraS: paired-Ig-like receptor 



A5; Pira6: paired-Ig-like receptor A6; Pirb: rjaired-Ig- like receptor B; Plaur. 
urokinase plasminogen activator receptor; PMI: putative receptor protein (SP: 
P 17 1 52 ); Pml: promyelocyte leukemia; Prg: proteoglycan, secretory granule; 
Prg3: proteoglycan 3/cosinophil major basic protein 2; Prtn3: proteinase 3; 
Psma2: proteasome (prosome, macropain) sub unit alpha type 2; PtrnM: pro thy- 
mosin beta 4; Ptprc: protein tyrosine phosphatase, receptor type, C; Rac2: 
RAS-related C3 botulinum substrate 2; Rarg: retinok acid receptor, gamma; 
Rela: avian reu^tdwndoaVaiosis viral (v-rel) oncogene homolog A/NF-kjippa-B 
p65; Rpll9: rirxraomal protein L19; RPL8: ribosomal protein L8; Rps6kal: 
ribosornai protein S6 kinase polypeptide 1; Rps8: rtbosonuil protein S8; Rtn3: 
reticulon 3; S100a8: S100 calcium binding protein A8 (cajgranulin A); SV00a9: 
SI 00 calciiinvbiuding protein A9 (calgranuJin B); Sdfr2: stromal cell-derived 
fector receptor 2; Sell: selectin L (lymphocyte adhesion molecule 1); Sema4d: 
semaphorin 4D; Seppl: selenoprotein P, plasma, 1; Sfpil: SFFV proviral 
integration I; Shfdgl: split hand/foot deleted gene 1; SklOal: solute carrier 
family 10 (sodium/bile acid co transporter family), member 1; Sipi: secretory 
leukocyte protease inhibitor, Soxl5: SRY-box containing gene 15; Spi2-1: serine 
protease inhibitor 2-1 ; Srb 1 : scavenger receptor class B 1 ; Stat3 : signal transducer 
and activator of transcription 3; Stat5a: signal transducer and activator of 
transcription 5A; Stat6: signal transducer and activator of transcription 6; Stral4: 
basic-hehx-loop-helix protein-retinoic acid induced; Hod: TBX1 protein/LPS- 
induced TNF-aJpha factor homolog; Tfcrgb: T-cell-receptor germline beta-chain 
gene constant region; Tcrg-V4: T-ceU-receptor gamma, variable 4; Tctexl: 
t-complex testis expressed I; Tfdpl: transcription fector Dp I; Tiflb: transcrip- 
tional intermediary fector 1, beta; Tlr4: toil- like receptor 4; Tnfhfla: TNT 
receptor superfamily, member la; Tnfisflb: TNF superfamily, member lb; 
TbmmTOa: transtocase of outer mftochondrial membrane 70 (yeast) homolog A; 
Tpu triosenhosphate isamerase; Trp53: rransfonriation-relatcd protein 53; (Jbb: 
ubiquhin B; Usf2: upstream transcription fector 2; Yhxl : Y box transcription 
factor; Yhx3: Y box binding protein; Zfpl 1--6: zinc finger protein si 1-6; Zfpl8: 
zinc finger protein 1 8 homolog; Zfp36: zinc ringer protein 36; Zfpl 62: zinc finger 
protein 162; Zfp216: zinc finger protein 216; Zfpml: zinc finger protein, 
rmiltitvpe 1; Znfiil al: zinc finger protein, sub&mily 1A, I (Bcaros); Zyx: zyxin 
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Bacterial lipopolysaccharide (LPS) evokes several 
functional responses in the neutrophil that contribute 
to innate immunity. Although certain responses, such as 
adhesion and synthesis of tumor necrosis factor-**, are 
inhibited by pretreatment with an inhibitor of p38 mi- 
togen-activated protein kinase, others, such as actin as- 
sembly, are unaffected. The aim of the present study was 
to investigate the changes in neutrophil gene transcrip- 
tion and protein expression following lipopolysaccha- 
ride exposure and to establish their dependence on p38 
signaling. Microarray analysis indicated expression of 
13% of the 7070 Affymetrix gene set in nonstimulated 
neutrophils, and LPS up- regulation of 100 distinct 
genes, including cytokines and chemokines, signaling 
molecules, and regulators of transcription. Proteomic 
analysis yielded a separate list of up-regulated modula- 
tors of inflammation, signaling molecules, and cytoskel- 
etal proteins. Poor concordance between mRNA tran- 
script and protein expression changes was noted. 
Pretreatment with the p38 inhibitor SB203580 attenu- 
ated 23% of LPS-regulated genes and 18% of LPS-regu- 
lated proteins by £=40%. This study indicates that p38 
plays a selective role in regulation of neutrophil tran- 
scripts and proteins following lipopolysaccharide expo- 
sure, clarifies that several of the effects of lipopolysac- 
charide are post-transcriptional and post-translational, 
and identifies several proteins not previously reported 
to be involved in the innate immune response. 



Lipopolysaccharide (LPS), 1 a component of the outer cell wall 
of Gram-negative bacteria, evokes a variety of functional re- 
sponses in the human neutrophil (PMN) after binding to a 
plasma membrane receptor complex that involves the Toll-like 
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"advertisement" in accordance with 18 U.S.C. Section 1734 solely to 
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receptors (TLRs) (1-5). These "immediate" functional re- 
sponses, including actin assembly, adhesion, activation of nu- 
clear factor-kappa B (NF-kB), and priming for an enhanced 
secretory response and for release of reactive oxygen interme- 
diates, appear to be central both to the innate immune re- 
sponse and to the pathogenesis of several inflammatory human 
diseases, including sepsis and the acute respiratory distress 
syndrome (6). p38 mitogen-activated protein kinase (p38 
MAPK) has been shown to mediate LPS-induced PMN adhe- 
sion, NF-kB activation, and TNF-a and IL-8 translation and 
release (7), and its hlockade attenuates LPS-induced PMN 
accumulation in the airspace (8). However, other cascades al- 
most certainly lead to downstream effectors of the LPS signal; 
for example, actin assembly appears to be p38 MAPK-inde- 
pendent (9). An improved understanding of the transcriptional 
and translational responses of the neutrophil to LPS and the 
modulation of these responses by p38 MAPK might carry 
pathogenetic and therapeutic implications. 

Historically, it has been believed that the downstream PMN 
transcriptional response to LPS is static and that PMN func- 
tional responses to LPS that depend on de novo protein syn- 
thesis are primarily limited to the release of cytokines (10). 
However, recent studies indicate a robust transcriptional re- 
sponse (11). To date, most studies have relied upon and re- 
ported a short list of functional assays of the LPS-exposed 
PMN; therefore, no exhaustive investigation of either the tran- 
scriptional response or protein synthetic repertoire of the PMN 
has been reported. Although several techniques have been used 
to evaluate transcripts, the screening of global changes in 
mRNA by microarray analysis has only recently become possi- 
ble. In this way, thousands of genes can be screened in an 
unbiased fashion for transcript abundance. Such genomic 
screens in mammalian ceils have previously been applied to 
define altered expression profiles in response to agonists (12) 
and to drug action (13) and during cell cycle progression (14). 

Although DNA microarray technology is expected to provide 
insight into the response of the human PMN to LPS (15), 
inliibition of LPS-stimulated IL-1 and TNF-tr production by 
p38 MAPK inhibitors in THP-1 cells (16) and of TNF-a synthe- 
sis in human PMNs (9) occurs at a translational level and 
would therefore not be detected by DNA microarrays. Further- 
more, in other systems, such as yeast and human liver, mRNA 
and protein levelB show poor correlation (17, 18). Proteomics is 
a complementary tool for assessing global changes in cellular 
protein expression, thereby providing additional insight into 
cellular signal regulation. A proteomic approach has proven 
useful in different systems for dissecting signal transduction 
cascades and describing their output (19, 20) and has even 
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recently been used to detect novel upstream messengers in- 
volved in LPS signal transduction (21). Wo have applied DNA 
microarrays and proteomics to define and compare transcrip- 
tional and post-transcriptional alterations in the LPS-exposed 
PMN and to establish the dependence of these alterations on 
p38 MAPK signaling. 

EXPERIMENTAL PROCEDURES 

MateriaZs^Endotoxin-free reagents and plastics were used in all 
experiments. Aprotmin, leu pep tin, AEBSF, E-64, peps ta tin, and besta- 
tin protease inhibitors} spermine HC1, and a-cyano-4-hydroxydnnamic 
acid (CHCA) were all purchased from Sigma Chemical Co. (St. Louis, 
MO). SB203580, a p38 MAPK inhibitor, was purchased from Calbio- 
chem-Novabiochem Corp. (San Diego, CA). For two-dimensional PAGE, 
rehydration buffer, equilibration buffers, vertical electrophoresis solu- 
tions, and 10% homogeneous poly aery 1 amide slab gels were purchased 
from Genomic Solutions, Inc. (GSI, Ann Arbor, MI). Sequencing grade 
porcine trypsin was purchased from Promega (Madison, WI). 

LPS Incubation — PMNs were isolated by the plasma Percoll method 
(22), a technique that yields less tlian 5% monocytic contamination, and , 
resiiflpended at a concentration of 15.4 X 10°/ml in RPMI 1640 culture 
medium (BioWhittaker, Walkers ville, MD) supplemented with 10 mM 
HEPES (pH 7,6) and 1% heat-inactivated platelet-poor plasma. After 
addition of 100 ngfanl Escherichia coli 0IH:B4 LPS (List Biological), 
incubation was carried out with continuous rotation (4 h, 37 *C) both in 
the presence and absence of SB203580. Both Affymetrix analysis and 
proteomic analysis utilized 75 X 10 6 cells. For microarray analysis, 
nonstiinulated and 4-h-treated PMNs were collected from three sepa- 
rate donors. A more detailed time course following LPS exposure was 
performed using polymerase chain reaction. For proteomic analysis, 
LPS incubations from separate donors (n — 6) were performed and then 
analysed individually. Control and post-LPS incubation PMNs were 
washed (0.34 M sucrose/1 inM EDTA/10 mM Tria) and then lysed in a 
modified rehydration buffer (GSI, Ann Arbor, MT) supplemented with 2 
M thiourea, 50 mM dithiothreitol (DTD, 22.6 mM spermine HC1, and a 
mixture of six protease inhibitors (10 ^tg/ml aprotiniii, 10 ;tg/ml leupep- 
tin, 2 mM AEBSF, 5 ;iM E-64, 1 fM pepstatin, 10 }xu bestaun). DNA was 
pelleted by cejitrifugation at 260,000 X g for 60 min (23). 

Affymetrix Oligonucleotide Array — Five micrograms of total RNA was 
isolated with TRIzol (Invitrogen) and RNeaay columns (Qiagen) and 
subsequently labeled with biotin as described by AfFymetrix. Briefly, 
first-strand synthesis was accomplished with Superscript Tl reverse tran- 
scriptase (Invitrogen) using a TT-oligotdT)^ primer for 1 h at 42 a C 
followed by second-strand synthesis using E. coli DNA polymerase I end 
KNase H (Invitrogen) at 16 *C for 2 h. Double-stranded DMA was used as 
a template for in vitro transcription with T7 RNA polymerase in the 
presence of biotin-labeled IJTP and OTP using the BioArray High Yield 
RNA transcript labeling kit (Enzo). Fifteen micrograms of cRNA was 
fragmented and used for hybridization to Asymetrix HuOene 6800FL 
Genechipa. Each sample was hybridized initially using a Test2 Genechip 
to test for sample degradation and full-length in vitro translation. Data 
were analyzed using Affymetrix Genechip software. Results from three 
separate donors were analyzed. 

Reverse Transcription and Polymerase Chain Reaction — cDNA was 
prepared by reverse transcription using 2 /ig total RNA, derived from 
20 X 10° cells that were treated as indicated. Polymerase chain reac- 
tions were performed using specific primers for Mr- J, TNF-ol, MCP-1, 
p65 t SlQQA4t and glyceraldehyde-3-phoaphate dehydrogenase. 

Twa-dimetisiGiial PAGE— The protein concentration of the lyaatea 
was measured as described by Bradford et al. (24). Poor isoelectric 
focusing (IEF) results were encountered unless the porycationic sperm- 
ine was diluted (data not shown); therefore, lysates were diluted with 
rehydration buffer (GSI, Ann Arbor, MI) to achieve a final spermine 
• concentration of 6 mM. Equal protein loads (1.6 rog) of control and 
IPS-stimulated neutrophils were used to rehydrate IEF gels overnight 
(18 cm, pH 3-10 nonlinear Immobiline Dry Strip IEF gels, Ameraham 
Biosciences; Pis ca taw ay, NJ). IEF was performed at 20 *C to 100-kVh 
(Phaser, GSI) under mineral oil, followed by two 10-min SDS equilibra- 
tion steps (DTT and then iodoacetamide-containing equilibration buff- 
ers, GSI) and then by vertical electrophoresis on 10% homogeneous 
polyacrylamide slab gels (GSI) at 600 V. Protein spots were visualized 
by agitation in colloidal Coomassie Brilliant Blue G-250 (16 h) (26), 
followed by des taint ng in deionized water (20 h). In separate experi- 
ments, control and LPS-stimulated PMN lysates from three donors 
were pooled and then analyzed by two-dimensional PAGE using over- 
lapping narrow isoelectric point (pi) ranges (IS cm, pH 5.0-6.0, 5.5- 



6.7, and 6-11, Ameraham Biosciences, Piscataway, NJ). Identical TEF 
and vertical electrophoresis parameters were used for all gels. 

Image Analysis of Two-dimensional Gels — Colloidal Coomassie- 
stained gels were digitized using a Powerlook II (UMAX Data Systems, 
Inc., Taiwan) flatbed scanner with 8-bit dynamic range and 150-dpi 
resolution Biolmage (GSI, Ann Arbor, MI) 2D-Anaiyzer software was 
used to locate, quantitate, and match protein spots on the control and 
LPS gel images. Analysis was performed by assigning 50 common 
anchor spots between paired images; the remaining spots were com- 
pared by a constellation- matching algorithm. All data were then care- 
fully reviewed by the operator to account for any discrepancies. Protein 
loading between control and experimental gels may have varied be- 
cause of inconsistencies in rehydration of the different IEF gel strips; 
therefore, gel images were normalized so that the sum of the integrated 
intensities of all matched spots on paired gels was made equaL Control 
and LPS-stimulated gel images from individual donor experiments 
were matched to generate composite images; composite images were 
then matched into a master composite image to track the LPS response 
of protein spots among different donors (26). Only those spots that were 
common (image-matched) to all original 12 (pH 3,0-10.0) gels were 
considered for further analysis. For these spots, the LPS-induced 
change in integrated intensity in the six experiments was subjected to 
statistical analysis with a two-tailed Students t test, and those spots 
withp < 0.05 were identified by peptide mass fingerprinting (described 
below). For the narrow range (pH 5.0-6.0, 5.5-6.7, and 6-11) two- 
dimensional PAGE experiments using pooled donors, only those spots 
with concordant regulation exceeding 1.5-fold or that appeared de novo 
in the LPS gel in two repeat experiments were further analyzed. 

In-gel Tryptic Digestion — In-gel digestion of protein spots was per- 
formed with sequencing grade porcine-modified trypsin using the 
method of Hellman et al. (27). Tryptic peptides were then extracted (50 
fd of 50% acetonitrile/5% triXLuoroacetic acid, 2 h), and the supernatant 
was taken to dryness in a vacuum centrifuge and then re die solved in 
trifluoroacetic acid (20 p\, 0.5%). Peptides were then purified and con- 
centrated using ZipTip clfl pipette tips (Millipore, Bedford, MA). 

MALDI-TOF Afass Spectrometry — Analyses were performed on an 
Applied Biosystems matrix-assisted laser desorption ionization time-of- 
flight (MALDI-TOF) Voyager-DE PRO mass spectrometer (Framing- 
ham, MA) operated in delayed extraction mode. Samples (0.5 pX) were 
spotted onto a sample plate to which matrix (0.5 p\ of 10 mgfanl CHCA) 
was added. The sample-matrix mixture was dried at room temperature 
and then analyzed in reflector mode. CHCA was also spotted alone as a 
negative control. Spectra were the sum of 100 laser shots, and those 
peaks with a signal-to-noise ratio of greater than 3:1 were selected for 
data base searching. Spectra were internally calibrated using autolytic 
trypsin peptides (m/z 842.51, 2211.10). 

Data Base Searching Algorithm — The monoisotopic masses for each 
protons ted peptide were: (a) entered into the program MS-Fit (available 
at prospector.ucsf.edu) for searches against the Swiss-Pro t, NCBI, 
and GenPept databases, and (b) entered into Mascot (available at 
matrixscience.com), an algorithm testing statistical significance of pep- 
tide mass fingerprinting identifications. For MS-Fit searches, masses 
derived from trypsin, CHCA, keratin, and Coomassie Brilliant Blue 
G-250 were excluded. Search parameters included a maximum allowed 
peptide mass error of 0. 1 Da (0.8 Da in the few instances in which linear - 
mode was used), consideration of one incomplete cleavage per peptide, 
pi range of 3.0-10.0, and molecular mass range of 1-200 kDa. Accepted 
modifications included carbamidomethylation of cysteine residues 
(from iodoacetamide exposure following IEF) (28) and methionine oxi- 
dation, a common modification occurring during SDS-PAGE (29). Pro- 
tein identifications were assigned when three criteria were met: 1) 
statistical significance (p < 0.05) of the match when tested by Mascot 
(matrixscience.com); 2) >20% sequence coverage by the tryptic pep- 
tides; and 3) concordance (±15%) with the molecular weight and pi of 
the parent two-dimensional PAGE protein spot. The following special 
exceptions were considered: (a) protein identifications not fuUTlling 
criterion 2 were still assigned if criteria 1 and 3 were fulfilled and no 
other Homo sapiens proteins with peptide mass-matched p values < 
0.05 were identified by Mascot; Kb) if criterion 3 was not fulfilled (lower 
than expected molecular weight), a cleavage product of the identified 
protein was inferred, and the cumulative molecular weight of the tryptic 
peptides was compared with that of the two-dimensional-PAGE spot to 
ensure that it was not exceeded; (c) if criterion 3 was not fulfilled (isolated 
discordance between theoretical and observed pi), post-translational mod- 
ification of an unrecovered peptide was inferred; and (d) if two or more H. 
sapiens protein assignments with > 4 mutually exclusive matching pep- 
tides were identified, a protein mixture in the two-dimensional PAGE 
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spot was inferred and further analysis halted (quantitative ainclusioiis 
regarding the individual protein constituents could not be drawn). 

RESULTS 

Genes Differentially Expressed in LPS 'Stimulated Neutro- 
phils-- Human PMNs were left untreated or incubated in the 
presence of 100 ng/ml LPS for 4 h. As a control to confirm that 
the PMNs were quiescent at baseline and that LPS resulted in 
normal stimulation, mRNA was isolated, cDNA was prepared, 
and PGR for TNF-ot was performed. Little TNF-a expression 
was seen in nonstimulated cells, whereas LPS treatment led to 
an increase in expression in each of the donors subsequently 
used tor microarray analysis (data not shown). No macrophage- 
colony stimulating factor receptor transcript was detected by 
oligonucleotide microarray analysis, confirming there was no 
significant monocytic contamination. 

Human PMNs express a limited repertoire of mRNA tran- 
scripts at baseline but respond to LPS with differential expres- 
sion of genes in many families. Considering only those genes 
present by microarray analysis in all three donors, unstimu- 
lated PMNs expressed 13.0% (923 of 7070 genes) of the Af- 
fymetrix gene set. Gene classes represented at baseline include 
metabolic enzymes, structural proteins, receptors, signaling 
proteins, and transcription factors. By comparison, human 
monocytes expressed -40% and human fibroblasts -35% of the 
represented genes (data not shown). By the criterion of a >3- 
fold increase in expression in all three donors on Asymetrix 
oligonucleotide array analysis, exposure of PMNs to LPS for 4 h 
resulted in the up-regulation of 100 genes (Table I). 

Genes from several different functional classes were induced 
in PMNs following LPS exposure. Of interest, a number of 
transcriptional regulators were induced, including transcrip- 
tion factors of the NF-kB family. The transcriptional NF-kB 
complex has previously been implicated in the regulation of the 
genes induced by LPS (11). The genes for several cytokines and 
chemoklnes were also found to be up-regulated. TheBe include 
TNF-a, IL-lfr IL-6, MCP-2, MIP-3a, and MIP-1& (Table I). 
PCR was performed to confirm the results from the microarray 
analysis. PCR analysis on selected genes indicates that the 
time course for changes can be rapid or delayed but parallel the 
changes found in the array at the 4-h time point (data not 
shown). Other up-regulated genes included those for metabolic 
enzymes, immune response molecules, kinases, phosphatases, 
signaling molecules, adhesion and cytoskeletal components, 
interferon-etimulated genes, and those with unknown or mis- 
cellaneous function (Table I). 

LPS stimulation of PMN also resulted in the down-regula- 
tion of 56 genes (Table II). Down-regulated genes were identi- 
fied as transcriptional regulators, protein and lipid kinases and 
phosphatases, structural molecules, and signaling molecules. 
Genes for metabolic proteins were also evident, as were several 
uncharacterized genes. 

Two-dimensional PAGE and Image Analysis — In contrast to 
the limited number of transcripts found at baseline, PMNs 
were found to express a large number and variety of proteins in 
the nonstimulated state (Fig. 1, A and C, and Tables III-V). 
Reproducible protein expression patterns were found on the pH 
3.0-10.0 gels, and the majority of proteins fell in the pH 5.0- 
7.0 range (Fig. 1A). The basic region (pH > 7.0) consistently 
exhibited poor resolution, precluding meaningful image analy- 
sis and further workup (data not shown). Depending on the 
spot-finding parameters (minimum spot intensity, filter width) 
selected on the image analysis software, spot-by-spot manual 
editing was found to be necessary to avoid over- and underde- 
tected spots; moreover, further manual editing was performed 
to screen for unmatched and mismatched spots following 
matching of paired control and LPS-stimulated gels. After spot* 



editing, -1200 well-resolved spots were evident on each pH 
3.0-10.0 gel. In an attempt to improve resolution of the pi 
range bearing the greatest number of well-resolved spots, over- 
lapping narrow pH range gels (pH 5.0-6.0, 5.5-6.7, 6-11) were 
also run. Of interest, a similar number of well-resolved spots 
(-1200) were detected on the narrow pH range gels (Fig. 1. C 
and D). Assuming a detection limit for Coomassie of 15 ng (0.25 
pinol, or 1.5 X 10 1A molecules, for a 60-kDa protein) and a 
protein load per gel corresponding to 75 X 10 s PMNs, we 
estimate a detection limit on our gels of 2000 molecules/cell for 
a 60-kDa protein. As investigators have suggested in other cell 
lines with the use of high resolution two-dimen3ional-PAGE 
methods (30), we estimate that > 10,000 proteins are expressed 
in the resting PMN. 

Human PMNs respond to LPS with the differential expres- 
sion of a large number of proteins. In the six individual pH 
3.0-10.0 experiments, the number of protein spots that in- 
creased in integrated intensity by at least 50% following LPS 
exposure was 185, 122, 104, 104, 96, and 131, respectively. The 
number of protein spots that decreased by at least 50% follow- 
ing LPS exposure was 72, 151, 102, 98, 128, and 97, respec- 
tively. Although gel-to-gel regional variability in resolution was 
expected to account for individual spots not being well visual- 
ized on particular gels, only those spots that were matched to 
all 12 original gels were analyzed further. Overall, the number 
of spots matched to all 12 original gels was 125. The numbers 
of spots that were both matched to all 12 original gels and that 
increased by at least 50% in integrated intensity in the indi- 
vidual experiments following LPS exposure were 46, 13, 17, 27, 
22, and 20, respectively. The numbers of spots that were 
matched to all 12 gels and that decreased by at least 50% were 
6, 22, 17, 22, 34, and 28, respectively. The LPS-induced change 
in integrated intensity of the 125 spots that were matched to all 
12 original gels was subjected to statistical analysis with a 
two-tailed Student's t test, and those spots with statistically 
significant (p < 0.05) regulation among the six experiments 
were identified by peptide mass fingerprinting (Table III). 

Identification of LPS-regulated Proteins— Several proteins 
were consistently up-regulated on the pH 3.0-10.0 gels (Table 
HI), including regulators of inflammation (annexin HI) and 
signaling molecules (Rab-GDP dissociation inhibitor /3). Sev- 
eral actin fragments were seen to be consistently up-regulated 
in the six experiments following LPS exposure (Table IH). Of 
interest, the proteasome 0 chain was also consistently up- 
regulated. Down-regulated proteins included other signaling 
molecules, such as Rho GTPase activating protein 1. 

On the pH 5.0-6.0 and 5.5-6.7 gels, several proteins were 
found to show increases of greater than 1.5-fold following LPS 
exposure (Tables IV and V), including cytoskeletal proteins, 
such as moesm, nonmuscle myosin heavy chain, and a putative 
phosphorylated form of nonmuscle myosin heavy chain, and 
signaling molecules, such as protein phosphatase 1 and P0 4 - 
etathmin. The putative phosphorylated form of nonmuscle my- 
osin heavy chain (spot #1101) was positioned 0.03 pH unit more 
acidic than the unmodified protein (spot #1102) (Fig. ID) and 
was distinguished by a tryptic peptide (mlz 1366.74) not pres- 
ent in the unmodified protein, consistent with phosphorylation 
of serine 685. Serine 685 is predicted by NetPhos 2.0 Prediction 
Server (available at www.cbs.dtu.dk/services/NetPhos/(3D) to 
be a lrigh probability phosphorylation residue and by Scan- 
Prosite (www.expasy.ch/tools/scnpsite.html) to be a substrate 
for protein kinase C. The tryptic phosphopeptide identified in 
PO^-stathmin, extending from residues 15 to 27 (1468.7 Da), is 
consistent with phosphorylation of either serine 16, a known 
substrate for Ca 2+ /calmodulin (CaM)-dependent kinases (32), 
or serine 25, a known substrate for p38S and ERK (Fig. 2A) 
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Tabus I 

Human neutrophil genes induced after 4h of LPS exposure 



Description GenBunk™ no. Chunge-fuld 



Transcriptional regulation 

Plciomorphic adenoma gene -like 2 D83784 16.8 

NFKB2 S76638 12.3 

NFKBJE U91616 11.5 

p65 L19067 8.4 

BCL3 U05681 7.7 

X-bax binding protein 1 M31627 7.5 

Metal-regulatory transcription factor 1 X78710 7.4 

Ets-2 J04102 7.4 

c,Rel X75042 G.2 

NFKB1 M5S603 5.8 

Basic leucine zipper transcription factor, ATF-like U15460 4.7 

1KB M69043 ■ 3.8 

MAX dmicrization protein L0GS95 3.6 

DIF2 S81914 3.1 

Cytokines and receptors 

MCP-1 M69203 78.7 

M1P-1& M72885 48.8 

aHelix coiled-cutt rod homolog AF014958 20.8 

IL-lfl XO4500 17.6 

GRQ3 (beta) M57731 17.3 

TNF-a X02910 14.5 

MlP-3« U64197 8.1 

MORA U00672 7.3 

IL-6 Y00081 6.3 

GROa X54489 4 

mm D10923 3.8 



Immune response 
Orosomucoid 

Complement component C3 

Protease inhibitor 9 

Complement component 3a receptor 1 

Protease inhibitor 3 

SZ-PZ/anttfeukoprotease 

ELANH2/c\a&taac inhibitor 

CD56 

Complement component PFC 

Kinases 
CNKI FNKfPJJC-like 
Cot 
Pim-2 
L1MK2 



X02544 
K02765 
U71364 
U28488 
L10343 
X04470 
M93056 
Y00636 
M83652 



U56998 
D14497 
U77735 
D45906 



20.2 
12.8 
9.5 
6.1 
4.9 
4.7 
4.6 
3.8 
3.5 



16.2 
11.9 
9.5 
4.3 



Phosphatases 
PAC-I JDUSP2 
DUSP5 
PHAl 



LI 1329 
U15932 
U73477 



11.8 

5.3 
3.4 



Signaling molecules 
TNFAJP1/A20 
TRAF1 
RonBP2 
GNA1S 
PTAFR 

Adhesion and cytoskeleton 



M59465 
U19261 
D42063 
M63904 
D10202 



10 

6.2 
5.6 
5.2 
3.9 



ICAM1 


M24283 


22.4 


CEACAM1 (bilary glycoprotein) 


XI 6354 


6.3 


UMS1 


U09284 


6.1 


SNUactin bundling protein 


U03057 


5.9 


Golectin-J/LGALSl 


M57710 


4.7 


MEMD/ALCAM 


U30999 


4.2 


CD44 


HG2981— HT3125 


3.9 


TSG-6 


M31165 


3.7 


Metabolic 






GTP cyclohydrolase I 


U19523 


13.5 


NDUlhf2f ubiquinone reductase 


M22538 


8.6 


ASMARA prot cos ome iota) 


X59417 


8.4 


UDP-gaiactcse transporter (SLC35A2) 


D84454 


7.3 


PLAU (urokinase) 


X02419 


6.4 


/OWtZ/Lrkynuretiin© hydrolase 


U57721 


5.5 


AMP03 


D12775 


5 


P4HA 7 /prolyl 4-hydroxylase 


M24486 


4.7 


y Glutamylcysteine synthetase 


L35546 


4.5 


ATPGD 


J05682 


4.2 


ATP6S1 


D16469 


4 
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Table I — continued 



Description GenBank 1 * no. Change- fold 



Glycerol kinase X68285 3.6 

FACL1 L09229 3.5 

AK3 X60673 3.3 

Interferon- inducible 

ISG1S M13755 22.5 

Mxl M33882 19.4 

IF156 M24594 12.1 

INDO M34465 5.2 

GBPJ M55542 4.3 

PJRKR U50648 3.7 

IFIT4 U52513 3.6 

IFJ54 M14660 3.5 

IFI58 U34605 3.5 

/FP35 U72882 3 

Other 

Gos2 M72885 48.8 

MWC/1AP1 U37546 7.2 

IOAA01Q6 D14661 5.1 

KIAA0118 D42087 5 

SNAP23 U55936 5 

CASP5 U28015 4.8 

KIAA0U3 D30755 4.8 

WAAQ255 D87444 4.7 

Hepatoma-derived GF D16431 4.7 

D28235 4.6 

CD4S M37766 4.3 

UNCI 19 homolog U40998 4.2 

KIAAQ151 D63485 3,9 

J2o626 XM035G60 3.8 

^rniexinVT/ J04543 3.7 

/OAAOiiO D14811 3.7 

Adrnnoxnedullin D14874 3.7 

AIM! U83115 3,6 

KIAA0250 D87437 3.2 

P5-7 L06175 3.2 

Scavenger receptor expressed hy endothelial cells D63483 3.2 

VHL L15409 • 3.1 



(33). Assuming that no other multiply phosphorylated stath- 
min species had escaped detection, analysis of the integrated 
intensities of the P0 4 -stathmin and stathmin spots indicates 
that the percentage of the P0 4 form of total cellular stathmin 
increased from 11% to. 38% with LPS stimulation (Fig. 2B). 
This is similar to a previous report of an increase from <10% to 
35—40% of the Ser^-phosphorylated form in Jurkat cells stim- 
ulated with anti-CD3 (34). 

Effect of SB203580 on LPS-stimulated Gene Expression- 
Gene expression analysis of PMNs stimulated with LPS indi- 
cated that the majority of genes induced by LPS were* unaf- 
fected by prior treatment of PMN with SB203580. Of the 100 
genes up-regulated by LPS, the up-regulation of 23 was inhib- 
ited by greater than 40% (Table VI). The majority of these 
genes affected by SB203580 were inhibited by less than 60%, 
whereas only six were inhibited by greater than 80%, all of 
which represent previously identified interferon-stimulated 
genes. Induction of cytokine genes by LPS, with the exception 
of IL-6, was generally unaffected by SB203580. 

Effect ofSB203580 on LPS-stimulated Protein Expression-— 
Similar to the effect of SB203580 on LPS-stimulated gene 
expression, little effect of SB203580 was seen on expression 
levels for the majority of LPS-regulated proteins (Table VID. 
Two exceptions are annexin ID. and a-enolase, for which LPS- 
stimulated expression was attenuated in the presence of the 
p38 MAPK inhibitor. 

Comparison of Microarray and Proteomics Results — Of the 
LPS-regulated proteins identified by peptide mass fingerprint- 
ing for which probes were present on the oligonucleotide mi- 
croarray, poor concordance was found at the niRNA level (Table 
VIII). For 13 LPS- up-regulated proteins, 2 corresponding 



mRNA transcripts were up-regulated, 1 was down-regulated, 5 
were unchanged, and 5 were not detected by the Affymetrix 
chip. For 6 down-regulated proteins, 3 corresponding tran- 
scripts were down-regulated, 1 was unchanged, and 1 was not 
detected. Varying patterns of LPS regulation emerge for those 
candidates detected at both the transcript and protein level. 
Proteasome 0 chain was up-regulated at both the transcript 
and protein levels (Table VIED, with no notable effect of 
SB203580 on expression at either level. Similarly, CAPl, Rho- 
GAP1, and ficolin 1 were down-regulated at both the mRNA 
transcript and protein level (Table VQI), with no notable effect 
of SB203580. Annexin HI was down-regulated at the transcript 
level and up-regulated at the protein level, with an inhibitory 
effect of SB203580 seen only at the protein level (Tables VII 
and Vm). 

DISCUSSION 

Interaction of bacterial LPS with the human PMN repre- 
sents a model system for studying the activation and output of 
the innate immune system during infection and inflammation. 
A recent publication (35) describes the gene expression changes 
of a cultured monocytic cell line after infection by the Gram- 
positive bacterium Listeria monocytogenes. The cell wall com- 
ponents of Gram-positive bacteria, like Gram-negative-derived 
LPS (i.e. from E. co#), are known to signal through TLRs (36, 
37). Importantly, many of the expression changes found in 
LPS-stimulated PMNs in the present study were also described 
in the bacteria-exposed monocytic cells, indicating that many of 
the gene expression changes seen in bacterial infection are 
likely mediated by TLRs (38, 39) and that the LPS model 
system accurately reflects exposure of immune cells to infee- 
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Human neutrophil genes repressed C>4-fold) after 4 h of LPS exposure 



Description 



GenBunk™ no. 



Kinases 

CAMK, II t gamma 
Diacylglycerol kinase, delta 
PHKCL2tVT&2 protein kinase C-like 2 
MAPKAPK3 

Protein kinase Ht31, cAMP-dependent 
CAMK II 

Transporters 

SLC25A5,'Bo\ute carrier family 25, member 5 

SLC19AJ; folate transporter 

SLC2A3; facilitated glucose transporter 

Metabolic 

Carbonic anhydrase IV 
KNase A family, kS 
Glycogen phosphorylase; liver 
Inositol polyphosphate S-phosphatase 
Inositol 1,3,4-trisphosphaie 516-kinase 
Tranakctolase 

Protein phosphatase 4 t reg. eubunit 1 (clone 23840) 

Cytidine deaminase 

MQAT1 

HMOX1 

MAN2A2 

Glycogenin (also represents U31525) 

Structural 

Fibrinogen-like protein (pT49 protein) 

H2AFZ. 

PaxUlin 

Jjamin B R 

Dynamin 2 

Actinin 1 

a-Tubulin 

Tubulin, atl, isoforra 44 

Transcriptional regulators 
Ijymplwblastic leukemia-derived sequence 1 
MAX-interacting protein 1 
Nuclear factor cryihroid 2 isoform f 
Transducer of ERBB2, 1 
NFATC4 
ATF-2 tCRE-Bpa) 

Receptors 
Lympkotoxin. 0 receptor 
Folate receptor 3 (gamma) 



U50360 
D63479 
U33052 
U09578 

HG2167-HT2237 
1,07044 



J02683 
U17566 
M20681 

L10955 

U64998 

M14636 

U57650 

U51336 

L12711 

TJ79267 

L27943 

M55621 

X06985 

L28821 

HG4334-HT4604 



Z3G531 

M37583 

U14588 

L25931 

L36983 

M95178 

X01703 

HG225S*-HT2348 



M22638 
L07648 
S77763 
D38305 
L41067 
L05515 



L04270 
U08471 
1311875 



Change 



-fiUd 

-4 

-4.2 

-4.3 

-6.3 

-8 

-9.8 



-4.2 
-4.4 

-5 



-4.4 
-4.5 
-4.6 
-4.6 
-4,7 
-4.8 
-4.9 
-5.4 
-5.4 
-5.4 
-5.8 
-5.9 



-4.2 
-4.7 
-4.9 
-5.9 
-6.2 
-6,7 

-10 

-15 



-4.4 

-4.5 

-6 

«6.9 

-7.8 

-9.6 



-4.4 

-5 

-5.3 



Signaling 

Pix-o;cooZ^aaAA0006) D25304 -4.5 

ARHBWxoB M12174 -4.5 

TNFSF10; TRAIL U37B18 -6.6 

Ca 3 "" binding 

ANXII L19605 -4.8 

S100A4 M80563 -4.8 

ANX1 X05908 -4.8 



Other 
Proteolipid protein 2 

Protein phosphatase l t a catalytic subunit 

TIMP2 

KIAA0199 

Lipin 2 (KIAA0249) 

LKAfP (Jawl) 

CUGBP2 

Clone 23933 

PECAM1 

Delta sleep-inducing peptide 

DiGeorge synd. critical region gene 2 (KIAA0163) 

SELPLG; CD162; selectin P ligand 



Ij09604 


-4.9 


HG1614-HT1614 


-5 


M32304 


-5.1 


D83782 


-5.2 


D87436 


-5.6 


V 10485 


-5.8 


U69546 


-6.9 


U79273 


-7 


L34657 


-8 


Z50781 


-8.7 


079985 


-9 


U25956 


-32 



tion. Nevertheless, the reliance upon DNA microarrays alone 
affords insight only into the transcriptional response without 
corroboration at the protein level. In the present study, appli- 



cation of both DNA microarray and proteomics technology to 
our model system provides unique insight into both the cellular 
biology of the activated PMN and the responsiveness and reg- 
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Fig. 1. Two-dimensional PAGE of LPS -exposed human PMNs. 

A and B, colloidal Coomasaie Blue-Stained pH 3.0-10.0, two-dimen- 
sional PAGE gels (A, control; B, LPS- exposed) with up-regulated isolid 
arrows) and duwn-rcgulatcd (hatched arrows) proteins indicated. These 
results are representative of six separate experiments. C and D, colloi- 
dal Goomassie Blue-stained pH 5.0-6.0, two-dimensional PAGE gels 
iC, control; D, UPS-exposed) with up-regulated ( solid arrows), new 
(solid arrow x open aiTowhead\ and down- regulated (hatched arrows) 
proteins indicated. LPS-exposed PMNs from three blood donors were 
pooled. 

ulation of its transcriptional and translational machinery. As 
will be discussed below, our study identifies, in particular, 
novel aspects of the LPS-stimulated PMN transcriptional reg- 
ulation, activity in the innate immune response, signaling, 
cytoskeletal reorganization, and priming for granule release. 

In the present study, the increase in NF-kB transcript abun- 
dance (Table I) detected by the microarrays corroborates the 
findings of other studies of PMNs and monocytes (40) and 
indicates a mechanism for the responsiveness and scope of the 
PMN transcriptional machinery following LPS exposure. NF- 
kB, recently described to be activated by LPS through the 
TLR/MyD88/interleukin-l receptor-associated kinase pathway 
(1, 4), is the only transcriptional complex reported to be in- 
duced by LPS in the PMN. However, because the transcrip- 
tional NP-kB complex has been implicated in the regulation of 
only a portion of the genes induced by LPS in this study (data 
not shown), the importance of alternative transcriptional reg- 
ulators in the PMN is clear. Of interest, several other known 
and putative transcriptional regulators with less well defined 
functions were also up-regulated in the present study, includ- 
ing PLAGL2, a putative zinc-finger protein, XBP-1, MTF-l f 
Ets-2, B-ATF, and DIF2, On the other liand, LPS-down-regu- 
lated genes include ATF-2 (a known target of p38), NFATC4, 
TOB-1, NF-E2, MXI-1, and LYL1. Although the exact role of 
these gene products in regulating cell function is unknown, 



these data indicate that the range of transcriptional responses 
in the LPS-stimulated PMN is much broader than previously 
suggested and that the signaling capabilities of the PMN in the 
immune response are thereby likely extended in scope and 
specificity. 

As expected from the literature, the genes for several cyto- 
kines and chemokines, including IL- 7/3, 7L-6 1 , and AfZP-7 0, were 
found to be up-regulated (Table I). On the other hand, the 
notable absence of up-regulated cytokines in the proteomics 
experiments reflects their removal in the post-LPS incubation 
wash performed prior to lysis for two-dimensional-PAGE. Up- 
regulation of these inflammatory mediators is well documented 
in PMNs exposed to LPS and in animal models of LPS-induced 
sepsis syndrome and acute respiratory distress syndrome, a 
PMN-mediated illness (41, 42). Several genes in this family 
were up-regulated that have not, to our knowledge, been de- 
scribed in LPS-stimulated cells, including MCP-1, GR03, 
IL-lORAj arid HM74, an orphan G protein-coupled receptor 
with homology to chemokine receptors. The down-regulation of 
TNFSF10, lymphotoxin b receptor, and TNFAIP1 were also 
observed. The modulation of genes involved in cytokine signal- 
ing, including the adapter molecules TRAF1 (LPS and TNF 
receptor signaling) and TNFAIPl (TNF receptor signaling) and 
several kinases and phosphatases, may indicate a change in 
cytokine responsiveness after LPS treatment. Relevant in this 
regard from the proteomics data are: 1) the up-regulation of 
protein phosphatase 1, which has been shown to regulate PMN 
NADPH oxidase activation and translocation (43, 44) and to 
regulate LPS-induced NF-kB activation (45); 2) the down-reg- 
ulation of Rho-GAPl, which has been shown to regulate 
NADPH oxidase activity in the PMN (46); and 3) the up- 
regulation of P0 4 -stathrnin (Table IV), a phosphoprotein pos- 
tulated to function as a relayer and integrator of multiple 
signal transduction pathways (34). Several noncytokine, 
nonchemokine genes involved in the immune response were 
also up-regulated, including the complement pathway mem- 
bers C3, C3AR1, and PFC\ the protease inhibitors ELANH2 
(elastase inhibitor), SLPI t PI-3, and PI 9; and the acute phase 
protein orosomucoid. LPS regulation of C3AR1 and orosomu- 
coid expression have not previously been reported. In the pro- 
teomics experiments, the down-regulation of ficolin-1 (Table HI), 
a collec tin-like cell surface protein reported to activate the com- 
plement system and to mediate adhesion and phagocytosis in 
monocytes but not previously reported in granulocytes (47), may 
represent negative modulation of the innate immune response. 
The finding that genes other than cytokines and chemokines are 
regulated by the PMN in response to LPS indicates that the PMN 
plays a more sophisticated role in host-defense and immunity 
than previously thought 

Treatment of the PMN with LPS lead to the induction of a set 
of genes associated with the anti-viral Type I interferons, 
IFNa/0. This induction occurs independently of the release of 
IFN or another unidentified soluble factor. 2 Furthermore, the 
set of genes expressed is smaller than that induced by IFNa/0, 
as described by Der et al. (12). This may be due to differences in 
the scope of the signaling systems activated by LPS and 
JFNa//3, or the time course of analysis of genes in the LPS- 
stimulated PMN. The implication that LPS treatment of PMN 
allows PMN to express anti-viral activity is currently being 
tested. Of interest was the finding that induction of interferon- 
stimulated genes was blocked by pretreatment of PMNs with 
SB203580. Work from our laboratory has indicated that signal 
transducers and activators of transcription activation does not 
occur in response to LPS in PMNs. 2 In addition, interferon- 



2 K. C. Malcolm and G. S, Worthen, manuscript in preparation- 
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Tabus III 

Analysis ofpH 3.0-10.0 two- dimensional PAGE gels 



Mean change{-fold) in expression level among six PMN donors is reported. The change in expression for the proteins listed was statifltically 
significant {p < 0.05) as measured by a two-tailed Student's t test. 



Identification [spot no.) 


Swias-Prot no. 


Estimated 


Tktioretical 


Peptides matched/ 
submitted 


Protein 
covered 


Mean 
change 










% 


% 


fold 


Up-regulated 














Proteasorae £1 chain [646] 


P28070 


27/5,7 


29.2/5.72 


9/12(75%) 


36% 


1.51 


Annexin III [550] 


P12429 


31/5.7 


36.4/5.6 


14/18 (78%) 


42% 


1.37 


Actin fragment 15441" 


P02570 


32/5.5 


(41.7/5.29) 


13/15 (87%) 


(34%) 


1.74 


Actin fragment [5911° 


P02570 


30/5.4 


(41.7/5.29) 


14/18(78%) 


(29%) 


1.60 


n-Enolase [3B0] 


P06733 


41/5.7 


47.2/7.01 


9/10 (90%) 


24% 


1.65 


Rab-GDP dissociation inhibitor 0 (289] 


P50395 


50/6.1 


50.7/6.11 


10/11 (91%) 


25% 


1.24 


Glutathione S-transferase P [648] 


P09211 


23/5.5 


23.4/5.43 


6/8 (75%) 


41% 


1.54 


Pre-B-cell colony enhancing factor 11152] 


P43490 


53/7.0 


55.5/6.69 


12/16 (75%) 


25% 


1.29 


Down-regulated 














Adenylyl cyclase-associated protein 1 [256] 


Q01518 


55/7.3 


51.7/8.07 


16/22 (73%) 


34% 


0.53 


Rho-GAPl [283] 


Q07960 


50/5.8 


50.4/5.86 


7/9 (78%) 


22% 


0.67 


Ficoiin 1 [511] 


000602 


33/6.5 


35/6.39 


10/12 (83%) 


25% 


0.74 



° The theoretical pi and M K of native actin are indicated. Protein coverage indicates coverage of native actin. 



Table TV 

Analysis of pH 6.0-6.0 two-dimensional PAGE gels 



Results are from pooled samples for control (n=3) and LPS-exposed \n - 3) PMNs from human donors. Expression of the reported proteins was 
altered > 1.5-fold following LPS exposure in two repeat experiments. "New" designates proteins seen in the LPS gel in two repeat experiments but 
not detectable in the corresponding control gels. 



Identification [spot no.} 


Swiss-Prat 

no. 


Estimated 


Theoretical 


Peptides matched/ 
submitted 


Protein 
covered 


Change 










% 


% 


-fold 


Up-regulated 














Protein- tyrosine kinase 9-like (4681 


Q9Y3F5* 1 


34/5.81 


39.5/6.37 


10/14 (71%) 


34% 


1.8 


Protein phosphatase 1, catalytic summit, p is o form 


P37140 


38/5.73 


37.2/5.84 


7/10 (70%) 


22% 


2.0 


[378] 














PO r stathmin 15771 


P16949 4 


18/5.36 


17.3/5.76 


9/12(75%) 


. 42%. 


2.1* 


Nonmuscle myosin heavy chain [1102] 


189036* 


145/5.32 


145/5.23 


20/21 (95%) 


17% 


New 


Putative P0 4 -nonmuscle myosin heavy chain [1101]' 1 


I89036 fc ' r 


145/5.29 


145/5.23 


14/16 (87%) 


13% 


New 


Leukocyte eiastase inhibitor [318] 


P30740 


42/5.71 


42.7/5.9 


9/13 (69%) 


22% 


2.4 


Grancalcin [1004] 


P2S676 


24/5.36 


24.0/5.02 


7/10 (70%) 


31% 


New 


Dawn-regulated 














Adenosyttiomocysteinase [324] 


P23526 


48/5.82 


47.7/6.04 


7/9 (78%) 


14% 


0.4 


PEST phosphatase interacting protein homolog [234J 7 


4100162' 


48/5.30 


47.6/5.35 


11/13 (85%) 


30% 


0.5 



n TrEMBL accession number. 

h Accession number and theoretical pi and Af R foT the unmodified protein are indicated. 
c NCBl accession number. 
d See text for explanation. 

"■ Among three experiments, the ratio of P0 4 -stathmin expression increase, following LPS exposure in the presence of SB203580 divided by that 
in the absence of SB203580, was 0.93. 
f Genpept accession number. 

* This search was performed using average masses measured by linear mode MA1J.11-TOF MS. 

Table V 

Analysis of pH 5.5-6.7 two-dimensional PAGE gels 



Results are from pooled samples for control (n = 3) and LPS-exposed (n = 3) PMNs from human donors. Expression of the reported proteins was 
altered > 1,5-fold following LPS exposure in two repeat experiments. 



Identification [spot no.} 


Swiss- Prut 
no. 


Estimated 


Theoretical 


Peptides matched/ 
submitted 


Protein 
covered 


Change 










% 


% 


-fold 


Up-regulatcd 














Transaidolase [475] 


P37837 


38/5.95 


37.5/6.36 


13/17 (76%) 


33% 


2.5 


Ioocitrate dehydrogenase [431] 


075874 ^ 


4676.25 


46.7/6.35 


7/7(100%) 


13% 


2.3 


Moesin [201] 


P26038 


61/6.09 


67.8/6.07 


11/13 (85%) 


17% 


2.1 


/(•Enolase [459] 


P06733 


43/5.64 


47.2/7.01 


7/10 (70%) 


17% 


3.8 


Down-regulated 
Calponin H2 [240] 














Q99439 


34/6.65 


33.7/6.94 


10/11 (90%) 


27% 


0.5 



regulatory factor 3, a known regulator of interferon- stimulated 
gene transcription, is not a direct target of p38 kinase. 2 There- 
fore, gene expression analysis of LPS-stimulated PMNs has 
uncovered a previously uncharacterized signal transduction 
system that is sensitive to inhibition of p38 MAPK. 

Knowledge of the genes down-regulated by LPS permits the 



development of further hypotheses addressing PMN function in 
the face of infection. Strikingly, several down-regulated genes 
and gene products are structural in nature (e.g. paxillin, acti- 
nin, calponin 112) (Tables II and V). A known consequence to 
the PMN of LPS exposure is decz-eased motility (48). Up-regu- 
lation of genes for adhesion molecules [ICAM-l, CD44, AL- 
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Tabie VI 

Effect ofSB203580 on LPS-stimulated gene expression 
Genes arc reported for which the SB20S680/control expression ratio 
is ^ 0.60. 



-fold change ratio Change in absence 

(SB20356OWro]> of £3203500 







-fold 


ISG15 


0.09 


22.5 


HCH 


0.38 


20.8 


Mxl 


0 


19.4 


IFI66 


0 


12.1 


PI-9 


0.57 


9.5 


Ets-2 


0.59 


7.4 


1L-6 


0.45 


6.3 


Rel 


0.50 


6.2 


UMS1 


0.58 


6.1 


C3AR1 


0.49 


6.1 


INDO 


0.35 


5.2 


EIAA0105 


0.41 


5.1 


SNAP23 


0.58 


6.0 


SLPI 


0.58 


4.7 


ELNAII2 • 


0,49 


4.6 


HM-74 


0.57 


3.8 


PKR 


0 


3.7 


MAD 


. 0.21 


3.6 


IFIT4 


0.12 


3.6 


Glycerol kinase 


0 


3.6 


IFJ54 


0 


3.5 


WISH 


0.39 


3.5 


IPF35 


0.46 


3.0 




Fig. 2. A, the predicted sequence of the tryptic phosphopeptide in 
PCVatathmin (1468.72 Da). The peptide mass measured by MALDI- 
TOF MS and the predicted mass differed by 14 ppm. As indicated, two 
alternate phosphorylation sites are possible: serine 16 and serine 25. B, 
P0 4 -Btathmin and stathmin were identified on the control and LPS- 
exposed pH 5.0-6.0 gels. Consistent with phosphorylation, the P0 4 - 
stathmin spot was distinguished by a peptide of mass 1468.72 Da 
80 Da greater than the peptide of 1388.72 Da seen in the stathmin spot). 
Aasiuning that no other multiply phosphorylated stathmin species have 
escaped detection, analysis of the integrated intensities of the P0 4 - 
stathmin and stathmin spots indicates tliat the percentage of the P0 4 
form of total cellular stathmin has increased from 11% to 38% with LPS 
stimulation. The decrease in integrated intensity for stathmin was 
equal in amount to the increase in PO^-stathmin following LPS 
exposure. 

CAM, and TSG-6), and down-regulation of genes for structural 
proteins, indicates a genetic basis for this observation. Down- 
regulation of two genes implicated in cytoskeletal regulation, 
Pix-a and RhoB t was also observed. The calcium-binding pro- 
tein S100A4, down-regulated in LPS-treated PMNa (Table II), 
has been implicated in cell motility and metastasis (49). De- 
creased motility may be beneficial in sustaining the inflamma- 
tory response at sites of infection. In addition, LPS treatment 
results in an inhibition of apoptosis (50). Therefore, the longer 
residence time of the PMN at sites of infection is consistent 
with the long term genetically coded changes seen in these 
gene-profiling experiments and indicates that the changes in 
gene expression are functionally relevant to host defense and 
immunity. 

By providing information on post-translational modification, 
the proteomics data may provide further insights into the cy- 



Table VII 

Effect of SB203580 on LPS stimulated protein expression 



Protein mime 


-fcJd rhanee rjitio 
(S62U35B0/ccntroi) 


Change in 
ubetacc of 
SB203580 






-fold 


Up-regulated 


0.8 


1.51 


Proteaaome £ chain 


Annex in III 


0.6 


1.37 


Actin fragment 15441 


0.8 


1.74 


Actin fragment [6911 


0.8 


1.60 


a-Enolofie 


0.6 


1.65 


Rab-GDP dissociation inhibitor 0 


1.1 


1.24 


Glutathione S- transferase P 


1.2 


1.54 


Pre-B-cell colony enhancing factor 


1.2 


1.29 


Down-regulated 




0.53 


Adenylyl cyclase- associated protein 1 


1.3 


Rho-GAPl 


0.8 


0.67 


Ficolin 1 


1.0 


0.74 



toskeletal remodeling effects of LPS upon the PMN. We con- 
tend that the actin fragments identified (Table III) are unlikely 
to represent technical artifacts. Rather, their specificity (iden- 
tical molecular weight/pi among different experiments), statis- 
tically significant up-regulation by LPS, as well as the use of a 
lysis buffer containing chaotropes and multiple protease inhib- 
itors argue instead that these fragments are physiologic con- 
sequences of LPS exposure in the human PMN. More specifi- 
cally, the up-regulation of these fragments following LPS 
exposure (Table HI) suggests that LPS may activate an actin- 
cleaving enzyme, which, in turn, remodels the cytoskeleton. 
Intriguing in this vein, calpain has recently been reported to 
play an important role in cell migration and cytoskeletal orga- 
nization of fibroblasts (51). The possibilities that LPS may 
induce calpain activation and that calpain activation may reg- 
ulate cytoskeletal reorganization and motility are currently 
under investigation. An alternative possibility is that actin 
cleavage is a marker of neutrophil apoptosis (52). 

Other LPS-regulated proteins may play important roles in 
cytoskeletal reorganization. The up-regulation of protein-ty- 
rosine kinase 9-like (A6-related protein) may modulate LPS- 
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Table VHJ 

LPS-regulated proteins for which a probe was present on the 
Asymetrix chip 

A comparison of corresponding protein and mRNA transcript changes 
following LPS exposure is shown. 

I>rot B m JESS mRNA Chimee 



-fold 

Up-regulated 

Protoasome 0 chain 1.5 1.9 \ 

Leukocyte elastase inhibitor 2.4 4.6 f 

Rab-GDl 0 1.24 NC° 

Granealcin New NC 

Transaidalase 2.5 NC 

Moesin 2.1 NC 

Nonmuscle myosin heavy chain New NC 

Glutathione S-transferase P 1.54 Absent 

Pre-B cell enhancing factor 1.29 Absent 

Isocitrate dehydrogenase 2.3 Absent 

P0 4 -statiunin 2.1 Absent (stathmin) 

Protein phosphatase 1, (5 catalytic sub unit 2 Absent 

Awiexinffl - 3.1 3.1 i 

Down-regulated 

Adenvlyl cyclase-associatRd protein 1 1.9 2.1 I 

Rho-GAF i 1.5 2.7 1 

Ficolinl 1.4 1.7 i 

Adenosylhomocyoteinase 2.5 Absent 

Calponin H2 2 NC 



° NC, no measureable change. 

induced actin polymerization, because it bears a high degree of 
homology to twinfUin (A6), an actin monomer-binding protein 
that localizes to sites of rapid filament assembly in cells and is 
believed to regulate actin filament turnover (53). In turn, LPS- 
induced down-regulation of Rho-GTPase activating protein 1 
(Table III) may regulate twinfilin (and protein-tyrosine kinase 
9-like) activity, because twinfilin has been shown to colocalize 
with Racl and Cdc42 and to be regulated by active Racl in NIH 
3T3 cells (53). Activation of Rho proteins may be facilitated by 
LPS up-regulation of moesin (Table V), because moesin report- 
edly induces the dissociation of Rho from GDI (54). Racl may, 
in turn, promote activation of the actin filament-nucleating 
Arp2/3 complex through interactions with WASP (Wiskott-Al- 
drich syndrome protein) family proteins (55) and, interestingly, 
is postulated to regulate the dynamics of both the actin and 
microtubule cytoskeletons via phosphorylation of stathmin (Ta- 
ble IV) (56). Calponin H2 is an actin-binding protein not pre- 
viously reported in PMNs that is postulated to play a role in 
cytoskeletal organization (57). Its down-regulation by LPS (Ta- 
ble V) likely modulates LPS-induced cytoskeletal reorganiza- 
tion. The up-regulation of nonmuscle myosin heavy chain and a 
putative phosphorylated form of myosin heavy chain (putative 
protein kinase C substrate by prediction rules) in the LPS- 
exposed PMN (Table IV) is of uncertain significance; myosin 
has been implicated in multiple functions in the PMN, includ- 
ing locomotion, fluid pinocytosis, and phagocytosis (58). Of 
interest, however, S100A4 (down-regulated, Table It) lias been 
reported to regulate cytoskeletal dynamics by inhibiting pro- 
tein kinase C-mediated phosphorylation of nonmuscle myosin 
heavy chain (59). 

LPS induction of stathmin phosphorylation (Table IV and 
Pig. 2) may represent another mechanism by which the cy- 
toskeleton is remodeled. Stathmin is a phosphoprotein report- 
edly involved in both signal transduction and in regulation of 
the microtubulin filament network; furthermore, phosphoryla- 
tion of stathmin has been reported to modulate its tubulin- 
binding avidity (60). Inferences can be made about both the 
phosphorylation site on P0 4 -stathmin and the responsible ki- 
nase induced by LPS. Four phosphorylation sites in stathmin 
have been well described: Ser 16 , Ser 26 , Ser 38 , and Ser 63 (32, 33). 



Ser 16 has been reported as a substrate for Ca 2+ /calmodulin 
(CaM)-dependent kinases (32), and Ser 25 as primarily a sub- 
strate for p38 and ERK (33), with p34 ,5df<2 also active but bear- 
ing a 5-fold preference for Ser 3 * 1 (34). As stated above, the 
phosphopeptide identified in P0 4 -stathmin, extending from 
residues 15 to 27 (1468.7 Da), is consistent with phosphoryla- 
tion of either Ser 16 or Ser 26 (Fig. 2). Although both p388 and 
p38a MAPK isoforms are expressed in the human PMN, LI'S 
has been shown to selectively activate the p38« isoform in 
human PMNs (9). The p38a isoform, however, has been shown 
to be relatively inactive at Ser 25 ; in fact, p38S is ~ 100-fold more 
active at Ser 25 , and selective p38a inhibitors do not inhibit the 
stress-activated phosphorylation of stathmin in 293 cells (33). 
Further support for the lack of involvement of p38 signaling in 
phosphorylation of stathmin in our system is the apparent lack 
of effect of SB203580 (a selective p38a and p38/3 inhibitor) on 
LPS-induced expression of P0 4 -8tathmin (Table IV). Because 
p34 cdc2 is relatively inactive at Ser 26 (34), we conclude that the 
phosphorylation site is likely to be Ser 16 , a reported substrate 
of CaM-dependent kinase. Although CaM kinases have previ- 
ously been implicated in gene activation in LPS-exposed my- 
elomonocytic HD11 cells (61), stathmin signaling has not, to 
our knowledge, been previously reported in either PMNs or 
lipopolysaccharide signal transduction. 

Cytoskeletal reorganization, a well-described regulator of 
granule release (62), may underlie LPS-induced priming for 
PMN granule release, but several LPS-regulated proteins may 
provide more specific clues. LPS exposure led to increased 
levels of grancalcin, a calcium-binding protein previously de- 
tected in PMNs and shown to translocate to granules and 
plasma membrane in the presence of physiologic concentra- 
tions of calcium (63). Similarly, annexin HI, a calcium-binding 
protein highly expressed in PMN granule membranes and im- 
plicated in calcium-mediated secretion (64) and in granule fu- 
sion (65), was also found to be up-regulated. Exocytosis of 
granule contents may also be facilitated by LPS up-regulation 
of Rab-GDP dissociation inhibitor (Table III), which has been 
proposed to recycle Rab after vesicle fusion by extracting it 
from the membrane and loading it onto newly formed transport 
intermediates (66). 

Parallel use of DNA microarrays and proteomics affords a 
powerful strategy for comparison of corresponding mRNA tran- 
scripts and proteins, thereby affording new insight into the 
mechanisms by which the cell regulates its signaling responses 
to the external environment. Of interest, a poor correlation was 
found between corresponding transcripts and proteins (Table 
VHI), as reported in other systems (17, 18). The finding in some 
cases of unchanged transcript abundance in the face of regu- 
lated protein levels indicates post-transcriptional modulation 
following LPS exposure. The finding of undetected transcripts 
in the face of regulated levels of the corresponding proteins 
may indicate previous transcription of these genes in an earlier 
state of the myeloid maturation of the PMN, producing stable 
protein specieB that have undergone post-translational alter- 
ation following LPS exposure. The use of SB203580, a p38 
inhibitor, adds further insights into the mechanisms of LPS 
regulation. At the level of mRNA expression, SB203580 inhib- 
ited 23% of LPS-stimulated genes by >40% and 11% of genes 
by .^60%; therefore, p38 plays a specific role in gene regulation 
in the PMN. In particular, proteasome p chain was up-regu- 
lated at both the mRNA transcript and protein level (Table 
VIII), with no notable effect of SB203580 on expression at 
either level, consistent with a non-p38-mediated pathway of 
primary transcriptional up-regulation induced by LPS. Simi- 
larly, CAP1, Rho-GAPl, and ficolin 1 were down-regulated at 
both the mRNA transcript and protein level (Table VIII), with 
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no notable effect of SB203580, consistent with a non-p38-me- 
diated pathway of primary transcriptional down-regulation. 
Interestingly, anriexin III waB down-regulated at the transcript 
level and up-regulated at the protein level, with an inhibitory 
effect of SB203580 seen only at the protein level (Table VII), 
consistent with a p38-mediated post-transcription al up-regula- 
tion induced by LPS. 

Limitations of the present study should be noted. Gene ex- 
pression analysis by cDNA microarrays does not distinguish 
between transcriptional regulation and mENA stabilization; 
similarly, two-dimension al PAGE proteomies by itself does not 
distinguish among transcriptional, translational, or post-trans- 
lational regulation of protein abundance. Transcript detection 
by microarray technology is limited to the probes included; 
protein identification by two-dimensional PAGE proteomies is 
limited to well-resolved regions of the gel, may perform less 
well with hydrophobic and high molecular weight proteins, and 
tends to select for more abundant protein species (30). Harvest- 
ing of the LPS-incubated PMNs at 4 h may have prevented 
detection of earlier, transient changes and may have thereby 
introduced artifactual transcript-protein discordance. Further- 
more, the post-LPS incubation, pre-two-dimensional PAGE cell 
washes would be expected to remove secreted proteins from 
further analysis, with uncertain effects on detected protein 
abundance depending on such factors as the degree of de novo 
synthesis and extent of degranulation/exocytosis. Because pro- 
tein binding of Coomassie Blue has a limited dynamic range 
and is typically not linear throughout the range of detection, 
image analysis of Coomassie Blue-stained protein spots should 
be considered semi-quantitative. For some protein spots, the 
apparent magnitude of regulation by LPS may have been 
blunted by the spot approaching staining saturation in the 
control gel. By limiting our analysis to those protein spots 
common to all twelve pH 3.0-10.0 two-dimensional gels, we 
likely excluded some LPS-regulated proteinB that happened to 
be either poorly resolved on a subset of the gels or unmatched 
by the image analysis software. By further limiting the analy- 
sis to those matched spots on the pH 3.0—10.0 gels for which a 
two-tailed t test demonstrated p < 0.05, the list of regulated 
proteins was likely also limited by statistical power. In addition 
to those regulated proteins listed in Table III, three others were 
up-regulated and three down-regulated with p < 0.09 (data not 
shown). 

Limiting our reported results to those changes that met 
statistical significance among the donors carries further impor- 
tant implications. We have encountered a two order of magni- 
tude range of response in unselected donor LPS-induced PMN 
functions, such as TNF-a and superoxide anion release (data 
not shown). The sources of this physiologic heterogeneity re- 
main uncertain but may possibly include such factors as nat- 
ural mutations of the LPS receptor component, TLR4 (67). By 
selecting for LPS effects common to all donors, we may not have 
characterized the range of genomic and proteomic heterogene- 
ity present in the population and thereby may have focused on 
only a narrow portion of a broader biological response to LPS. 
We contend that this reductionist approach is valid because it 
would be expected to enrich for biologically integral responses 
of the PMN to LPS. Nevertheless, correlation of genomic and 
proteomic profiles with functional phenotypes of the PMN may 
bear, important diagnostic and therapeutic implications and 
will be pursued in future studies. 

Widespread regulation of numerous honcytokine/chemokine 
genes and proteins in the LPP-stimulated human PMN is a 
novel finding. These data indicate that, despite a narrow scope 
of gene expression in the nonstimulated state, the terminally 
differentiated, short-lived PMN likely plays a role in the innate 



immune response that is far more sophisticated and dynamic 
than the simple release of preformed inflammatory mediators. 
Although gene expression appears to be an important mecha- 
nism by which PMNs respond acutely to infection, mRNA tran- 
script/protein concordance is limited, and post-transcriptional 
(and post-translational) modifications also play an important 
role. The alteration of multiple transcriptional regulators, G- 
protein regulators, P0 4 -stathmin, and protein phosphatase 1 
indicates that one of the responses to LPS exposure is to modify 
subsequent signaling events by bacterial components or by 
other cytokines and chemokines. Finally, the finding that p38 
MAPK mediates LPS regulation of a limited subset of tran- 
scripts and proteins underlines the continuing need to define 
signal transduction cascades in the neutrophil. 
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The relationship between gene expression measured at 
the mRNA level and the corresponding protein level Is not 
well characterized In human cancer. In this study, we 
compared mRNA and protein expression for a cohort of 
genes fn the same lung adenocarcinomas. The abun- 
dance of 165 protein spots representing 98 Individual 
genes was analyzed In 76 lung adenocarcinomas and nine 
non-neoplastic lung tissues using two-dimensional poly- 
acrylamlde gel electrophoresis. Specific polypeptides 
were Identified using matrix-assisted laser desorption/ 
ionization mass spectrometry. For the same 85 samples, 
mRNA levels were determined using oligonucleotide mi- 
croarrays, allowing a comparative analysis of mRNA and 
protein expression among the 165 protein spots. Twenty- 
eight of the 165 protein spots (17%) or 21 of 98 genes 
(21.4%) had a statistically significant correlation between 
protein and mRNA expression (r > 0.2445; p < 0.05); 
however, among all 165 proteins the correlation coeffi- 
cient values (r) ranged from -0.467 to 0.442. Correlation 
coefficient values were not related to protein abundance. 
Further, no significant correlation between mRNA and 
protein expression was found (r = -0.025) if the average 
levels of mRNA or protein among all samples were applied 
across the 165 protein spots (98 genes). The mRNA/ 
protein correlation coefficient also varied among pro- 
teins with multiple fsoforms, indicating potentially sep- 
arate Isoform-specffic mechanisms for the regulation of 
protein abundance. Among the 21 genes with a signifi- 
cant correlation between mRNA and protein, five genes 
differed significantly between stage I and stage III lung 
adenocarcinomas. Using a quantitative analysis of mRNA 
and protein expression within the same lung adenocarci- 
nomas, we showed that only a subset of the proteins 
exhibited a significant correlation with mRNA abundance. 
Molecular & Cellular Proteomlcs 1:304-313, 2002. 



Lung cancer is the leading cause of cancer death for both 
men and women in the United States. Adenocarcinomas of 
the lung comprise -40% of all new cases of non-small cell 
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lung cancer and are now the most common histologic type. 
Functional genomics, broadly defined as the comprehensive 
analysis of genes and their products, have become a recent 
focus of the life sciences (1). Application of these approaches to 
lung adenocarcinomas has the potential to aid in the identifica- 
tion of high risk patients with resectable early stage lung cancer 
that may benefit from adjuvant therapy, as well as to identify 
new therapeutic targets. In human lung cancer, however, little is 
currently understood regarding the relationship between gene 
expression as determined by measuring mRNA levels and the 
corresponding abundance of the protein products. 

A number of powerful techniques for analysis of gene ex- 
pression have been used including differential display (2), 
serial analysis of gene expression (3), DNA microarrays (4), 
and proteomics via two-dimensional polyacryiamide gel elec- 
trophoresis and mass spectrometry (5). Bioinformatics tools 
have also been developed to help determine quantitative 
mRNA/protein expression profiles of all types of cells and 
tissues (6) and now can be applied to benign and malignant 
tumors. DNA microarrays (cDNA and oligonucleotide) permit 
the parallel assessment of thousands of genes and have been 
utilized in gene expression monitoring (7), polymorphism anal- 
ysis (8), and DNA sequencing (9). Recent studies have fo- 
cused on classification or identification of subgroups of lung 
tumors using DNA microarrays (10, 11). The use of mRNA 
expression patterns by themselves, however, is insufficient for 
understanding the expression of protein products, as addi- 
tional post-transcriptional mechanisms, including protein 
translation, post-translational modification, and degradation, 
may influence the level of a protein present in a given cell or 
tissue. Proteomic analyses, a complementary technology to 
DNA microarrays for monitoring gene expression, involves 
protein separation and quantitative assessment of protein 
spots using 2D 1 -PAGE and protein identification using mass 
spectrometry. By combining proteomic and transcriptional 
analyses of the same samples, however, it may be possible to 
understand the complex mechanisms influencing protein ex- 
pression in human cancer. 

in this study, we determined mRNA and protein levels for 
165 proteins (98 genes) in 76 lung adenocarcinomas and nine 



1 The abbreviations used are: 2D, two-dimensional; MALDI-MS, 
matrix-assisted laser desorption/ioni2ation mass spectrometry. 
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Protein and mRNA Correlation in Lung Adenocarcinomas 



Table I 

Correlation coefficients of protein and mRNA where only one spot was present on 20 gels 
t*. correlation coefficient value > 0.2445; p < 0.05. Values in boldface are significant at p < 0.05. 



Spot Unigene Gene name r* Protein name 
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Spot 



' 0278 
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Unigene 
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Hs.9614 

Hs.74335 
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Hs.16488 
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Hs.77060 



Gene name 



TCP1 

NPM1 

HSPCB 

FABP5 

CALR 

GSTM4 

PSMB6 



Table I — continued 



Protein name 



-0.1237 T-complex protein I, a subunit 

-0.1738 B23/numatrin 

-0.2049 Hsp90 

-0.2109 E-FABP/FABP5 

-0.2344 Calreticulin 32 

-0.2438 Glutathione S-transferase M4 (GST m4) 
-0.2512 . Macropatn subunit A 



non-neoplastic lung tissues. Protein levels were determined 
using quantitative 2D-PAGE analysis, and the separated pro- 
tein polypeptides were identified using matrix-assisted laser 
desorption/ionization mass spectrometry (MALDI-MS). The 
corresponding mRNA levels for the identified proteins within 
the same samples were determined using oligonucleotide 
microarrays. Correlation analyses showed that protein abun- 
dance is likely a reflection of the transcription for a subset of 
proteins, but translation and post-translational modifications 
also appear to influence the expression levels of many indi- 
vidual proteins in lung adenocarcinomas. 

EXPERIMENTAL PROCEDURES 
77sst/es-Rfty-seven stage I and 19 stage III lung adenocarcino- 
mas, as well as nine non-neoplastic lung tissue samples, were used 
for protein and mRNA analyses. Patient consent was obtained, and 
the project was approved by the Institutional Review Board. All tis- 
sues were obtained after resection at the University of Michigan 
Health System between May 1991 and July 1998. Tissues were all 
snap-frozen in liquid nitrogen and then stored at -80 Q C. The patients 
included 46 females and 30 males ranging in age from 40.9 to 84,6 
(average 63.8) years. Most patients (66/76) demonstrated a positive 
smoking history. Sixty-one tumor samples were classified as bron- 
chial-derived, 14 were classified as bronchoalveolar, and one had 
both features. Eighteen tumor samples were classified as well differ- 
entiated, 38 were classified as moderate, and 19 were classified as 
poorly differentiated adenocarcinomas. Hematoxylin-stained cryostat 
sections (5 pm), prepared from the same tumor pieces to be utilized 
for protein and mRNA isolation, were evaluated by a pathologist and 
compared with hematoxylin- and eosin-stained sections made from 
paraffin blocks of the same tumors. Specimens were excluded from 
analysis if they showed unclear or mixed histology (e.g. adenosqua- 
mous), tumor cellularity less than 70%, potential metastatic origin as 
indicated by previous tumor history, extensive lymphocytic infiltration, 
or fibrosis or if the patient had received prior chemotherapy or 
radiotherapy. 

Oligonucleotide Array Hybridization -The HuGeneFL oligonucleo- 
tide arrays (Affymetrix, Santa Clara, CA) containing 6800 genes were 
used in this study. Total RNA was isolated from all samples using 
Trizol reagent (Invitrogen). The resulting RNA was then subjected to 
further purification using RNeasy spin columns (Qiagen). Preparation 
of cRNA, hybridization, and scanning of the HuGeneFL arrays were 
performed according to the manufacturer's protocol (Affymetrix, 
Santa Clara, CA). Data analysts was performed using GeneChip 4.0 
software. The gene expression profile of each tumor was normalized 
to the median gene expression profile for the entire sample. Details of 
data trimming and normalization are described elsewhere (11). 

2D-PAGE and Quantitative Protein /Vja/ys/s-Tissue for both pro- 
tein and mRNA isolation came from contiguous areas of each sample. 
Protein separation using 2D-PAGE, silver staining, and digitization 



were performed as described previously (12, 13). Our 2D-PAGE sys- 
tem allows us to run 20 gels at one time (one batch). Spot detection 
and quantification were accomplished utilizing Bio Image Visage Sys- 
tem software (Bioimage Corp., Ann Arbor, Ml). The integrated inten- 
sity of each spot was calculated as the measured optical density 
units X mm 2 . Of the total possible 2000 spots detectable on each gel, 
820 spots on the gel of each sample were matched using a Gel-ed 
match program with the same spots on a chosen "master" gel. In 
each sample, 250 ubiquitously expressed reference spots were used 
to adjust for variations between gels, such as that created by subtle 
differences In protein loading or gel staining. Slight differences be- 
cause of batch were corrected after spot-size quantification. 

Mass Spectrometry and 2D Western Blotting- Preparative 2D gels 
were run using extracts from A549 lung adenocarcinoma cells (ob- 
tained from ATCC) and using the identical experimental conditions as 
the analytical 2D gels, except 30% more protein was loaded. The 
resolved protein gels were silver-stained using successive incuba- 
tions in 0.02% sodium thlosulfate for 2 mln, 0.1 % silver nitrate for 40 
min, and 0.014% formaldehyde plus 2% sodium carbonate for 10 
min. For protein identification, protein polypeptides underwent trypsin 
digestion followed by MALDI-MS using a MALDI-TOF Voyager-DE 
mass spectrometer (Perseptive Biosystems. Framingham, MA). The 
masses were compared with known trypsin digest databases using 
the MS-FIT database (University of California, San Francisco; 
prospector.ucsf.edu/ucsfhtml3.2/msfrt.htm). Some of the polypep- 
tides included in the analysis had been Identified prior to this study on 
the basis of sequencing (14). The identified protein spots used in this 
paper are shown in Fig. 1A The method for 2D-PAQE Western Wot 
verification was as described previously (1 5). The 2D Western blots of 
GRP58 and Op18 are shown in Fig. 1 , C and E ; the others, such as 
GRP78, GRP75, HSP70, HSC70, KRT8, KRT18, KRT19, Vimentin, 
ApoJ, 14-3-3, Annexin I, Annexin II, PGP9.5, DJ-1. GST-pi, and 
PGAM, are described elsewhere. 2 

Statistical Analysis- Missing values were replaced with the mean 
value of the protein spot. The transform x -* log (1 + x) was applied 
to normalize all protein expression values. The relationship between 
protein and mRNA expression levels within the same samples was 
examined using the Spearman correlation coefficient analysis (1 6). To 
identity potentially significant correlations between gene and protein 
expression, we used an analytical strategy similar to SAM (signifi- 
cance analysis of microarrays) (17), which uses a permutation tech- 
nique to determine the significance of changes In gene expression 
between different biological states. To obtain permuted correlation 
coefficients between gene and protein expression, genes were ex- 
changed first In such a way that permutated correlation coefficient 
were calculated based on pseudo pairs of genes and proteins. The 
distribution of permutated correlation coefficients became stable after 
60 permutations. This procedure was then repeated 60 times to 
obtain 60 sets of permutated correlation coefficients. For each of the 
60 permutations, the correlations of genes and proteins were ranked 



2 Chen et at. , submitted for publication. 
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Table II 

Correlation coefficients of protein and mRNA where multiple isoforms were present on 2D gefs 
coefficient value > 0.2445; p < 0.05. Values in boldface are significant at p < 0.05. 



Spot 



Unigene 



Gene name 



Protein name 



1494 
0957 
0353 
0855 
1198 
1203 
0523 
1492 
1493 
1181 
0439 
0505 
0593 
1874 
0935 
2524 
2324 
1192 
0350 
0992 
0861 
0853 
2503 
0381 
0371 
1179 
0762 
0760 
2506 
0772 
0723 
1239 
1237 
1234 
0428 
0427 
0424 
0863 
0780 
1527 
1484 
1728 
1712 
0947 
1232 
1229 
1595 
1810 
1459 
1458 
0619 
0615 
1250 
0549 
0338 
0333 
0331 
2381 
0535 



Hs.81915 

Hs.77899 

Hs.289101 

Hs.169476 

Hs.41707 

Hs.83848 

Hs.65114 

Hs.81915 

Hs.81915 

Hs.78225 

Hs.242463 

Hs.297753 

Hs.297753 

Hs.75313 

Hs.75544 

Hs.78225 

Hs.65114 

Hs.41707 

Hs.289101 

Hs.75313 

Hs.75313 

Hs.75313 

Hs.76392 

Hs.76392 

Hs.76392 

Hs.78225 

Hs.78225 

Hs.78225 

Hs.217493 

Hs.217493 

Hs.217493 

Hs.93194 

Hs.93194 

Hs.93194 

Hs.25 

Hs.25 

Hs.25 

Hs.75106 

Hs.75106 

Hs. 119140 

Hs.119140 

Hs.5241 

Hs.5241 

Hs.169476 

Hs.75207 

Hs.75207 

Hs.158300 

Hs.75990 

Hs.75990 

Hs.75990 

Hs.75990 

Hs.75990 

Hs.41707 

Hs.79037 

Hs.79037 

Hs.79037 

Hs.79037 

Hs.65114 

Hs.65114 



lapis 

TPM1 

GRP58 

GAPD 

HSPB3 

TPI1 

KRT18 

LAP 18 

LAP18 

ANXA1 

KRT8 

VIM 

VIM 

AKR1B1 

YWHAH 

ANXA1 

KRT18 

HSPB3 

GRP58 

AKR1Q1 

AKR1B1 

AKR1B1 

ALDH1 

ALDH1 

ALDH1 

ANXA1 

ANXAl 

ANXA1 

ANXA2 

ANXA2 

ANXA2 

AP0A1 

APOA1 

APOA1 

ATP5B 

ATP5B 

ATP 58 

CLU 

CLU 

EIF5A 

EIF5A 

FABP1 

FABP1 

GAPD 

GLOl 

GLOl 

HAP1 

HP 

HP 

HP 

HP 

HP 

HSPB3 
HSPD1 
HSPD1 
HSPD1 
HSPD1 
KRT18 
KRT18 



0.4003 
0.3930 
0.3802 
0.3693 
0.3668 
0.3395 
0.3335 
0.3234 
0.3154 
0.3102 
0.3049 
0.2939 
0.2809 
0.2780 
0.2775 
0.2612 
0.2601 
0.2558 
0.2516 
-0.2460 
0.0761 
-0.0675 
-0.0565 
-0.0371 
-0.0680 
0.2052 
-0.0739 
-0.0228 
0.2223 
0.2080 
0.0701 
0.1133 
-0.0373 
-0.0894 
0.0080 
0.0122 
-0.0992 
-0.0483 
-0.0443 
-0.0726 
-0.0376 
-0.1916 
-0.0473 
0.1745 
0.2249 
0.0450 
-0.0137 
-0.4672 
0.0802 
-0.0305 
0.0461 
-0.0034 
-0.1024 
0.1074 
0.2265 
0.1383 
0.1603 
0.2016 
0.1106 



OP18(Stathmln) 
Tropomyosins 1-5 

Protease disulfide isomerase (GRP58) 
Glycera!dehyde-3-phosphate dehydrogenase 
Hsp27 

Triose phosphate isomerase (TP I) 

Cytokeratin 18 

OP18(Stathmin) 

OP18(Stathmin) 

Annexin variant I 

Cytokeratin 8 

Vimentin 

Vimenttn 

Aldose reductase 
14-3-3 v 
Annexin I 
Cytokeratin 18 
Hsp27 

Phospholipase C (GRP58) 
Aldose reductase 
Aldose reductase 
Aldose reductase 
Aldehyde dehydrogenase 
Aldehyde dehydrogenase 
Aldehyde dehydrogenase 
Annexin variant I 
Annexin I 
Annexin I 

Upocotin (annexin II) 
Upocotin (annexin II) 
Lipocotin 

Apolipoprotein A1 (ApoA1) 

Apolipoprotein A1 (ApoA1) 

Apolipoprotein A1 (ApoA1) 

ATP synthase p subunit precursor 

ATP synthase p subunit precursor 

ATP synthase p subunit precursor 

Apolipoprotein J (ApoJ) 

Apolipoprotein J (ApoJ) 

elF-5A 

elF-5A 

L-FABP 

L-FABP 

Glyceraldohyde-3-phosphate dehydrogenase 

Glyoxalase-I 

Glyoxafase-1 

Huntingtin-associated protein 1 (neuroan 1) 

a-Haptoglobin 

a-Haptoglobin 

a-Haptoglobin 

B-haptoglobin 

B-haptoglobin 

Hsp27 

Hsp60 

Hsp60 

Hsp60 

Hsp60 

Cytokeratin 18 
Cytokeratin 18 
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Table II— continued 

Correlation coefficients of protein and mRNA where multiple isoforms were present on 2D gels 
r\ correlation coefficient value > 0,2445; p < 0,05. Values In boldface are significant at p < 0.05 



Spot 


1 IfrinAtiB 
vji <iyci !c 


Gene name 


r* 


0529 


Hs.65114 


KRT18 


0.1279 


0528 


Hs.65114 


KRT18 


0.0414 


0527 


Hs.65114 


KRT18 


0.0436 


0514 


Hs.65114 


KRT18 


0.0733 


0451 


Hs.242463 


KRT8 


-0.01 1 1 


0446 


Hs.242463 


KRT8 


0.0347 


0444 


Hs.242463 


KRT8 


-0.1311 


0443 


Hs.242463 


KRT8 


0.0942 


1488 


HS.B1915 


LAP18 


0.0495 


0321 


Hs.75655 


P4HB 


-0.0546 


0320 


Hs.75655 


P4HB 


-- 0.0041 


1063 


Hs.75323 


PHB 


0.0441 


0837 


Hs.75323 


PHB 


0.1402 


0326 


Hs.297681 


SERPINA1 


-0.0227 


0322 


Hs.297681 


SERPINA1 


-0.0277 


0241 


Hs.297681 


SERPINA1 


-0.0148 


1280 


Hs.301254 


SFTPA1 


-0.1488 


1278 


Hs.301254 


SFTPA1 


-0.2040 


0866 


Hs.73980 


TNNT1 


0.1162 


0778 


Hs.73980 


TNNT1 


0.0740 


1213 


Hs.83848 


TPI1 


0.0024 


1210 


Hs.83848 


TPI1 


0.0490 


1207 


Hs.83848 


TP 11 


-0.1615 


1204 


Hs.83848 


TPI1 


0.0209 


1202 


Hs.83848 


TPI1 


0.0721 


1161 


Hs.83848 


TPI1 


0.2265 


1052 


Hs.77899 


TPM1 


-0.1040 


1039 


Hs.77899 


TPM1 


-0.2999 


1035 


Hs.77B99 


TPM1 


-0.3821 


0783 


Hs.77B99 


TPM1 


0.0757 


1574 


Hs. 194366 


TTR 


-0.0065 


0809 


Hs.194366 


TTR 


0.0399 


2202 


Hs.76118 


UCHL1 


-0.0220 


1246 


Hs.76118 


UCHL1 


-0.1261 


1242 


Hs.76118 


UCHL1 


0.1473 


0606 


Hs.297753 


VIM 


0.0951 


0594 


Hs.297753 


VIM 


-0.2664 


0508 


Hs.297753 


VIM 


0.1008 


0419 


Hs.297753 


VIM 


0.0032 


1279 


Hs.75544 


YWHAH 


0.0059 



Protein name 



Cytokeratin 18 
Cytokeratin 18 
Cytokeratin 18 
Cytokeratin 18 
Cytokeratin 8 
Cytokeratin 8 
Cytokeratin 8 
Cytokeratin 8 
OP18(Stathmln) 
PDI (pro!y-4-OH-B) 
PDl (proly-4-OH-B) 
Prohibitin 
Prohibitin 
a-1-Antitripsin 
n-1 -Antitripsin 
«-1-Antttripsin 

Pulmonary surfactant-associated protein 
Pulmonary surfactant-associated protein 
Troponin T 
Troponin T 

Triose phosphate isomerase (TPI) 
Triose phosphate isomerase (TPI) 
Triose phosphate isomerase (TPI) 
Trios9 phosphate isomerase (TPi) 
Trios9 phosphate Isomerase (TPI) 
Triose phosphate isomerase (TPI) 
Tropomystn clean-product 
Cytoskeietal tropomyosin 
Tropomyosin 
Tropomyosins 1-5 
Transthyretin 
Transthyretin multimere 

Ubiquitin carboxyl-terminal hydrolase isozymB L1 
Ubiquitin carboxyl-terminal hydrolase isozyme L1 
Ubiquitin carboxyl-terminal hydrolase isozyme L1 
Vimentin 

Vimentln-derived protein (vid4) 
Vimentin-derived protein (vid2) 
Vimentin-derived protein (vidl) 
14-3-3 rj 



such that p p (i> denotes the rth largest correlation coefficient for pth 
permutation. Hence, tl>e expected correlation coefficient, p fc -(f), was the 
average over the 60 permutations, p^) - 2*. , ^60. A scatter plot of 
observed correlations (p(0) versus the expected correlations is shown in 
Fig. 2D. For this study, we chose threshold A - 0.1 15 so that correlation 
would be considered significant if absolute value of difference between 
tfj) and was greater than the threshold. Twenty-nine (including one 
with observed correlation coefficient -0.4672) of 1 65 pairs of gene and 
protein expression were called significant in such criteria, and the 
permuted data generated an average of 5.1 falsely significant pairs of 
gene and protein expression. This provided an estimated false dis- 
covery rate (the percentage of pairs of gene and protein expression 
identified by chance) for our data set. 

RESULTS 

Correlation of Individual Proteins and mRNA Expression 
within Each Tumor- We have examined quantitativety 165 



protein spots on 2D gels representing 98 genes and com- 
pared protein levels with mRNA levels for a cohort of 85 lung 
adenocarcinomas and normal lung samples. Of the 165 pro- 
tein spots, 69 proteins were represented by only one known 
spot on 2D gels for an individual gene, whereas 96 protein 
spots showed multiple protein products from 29 different 
genes. 2D Western blotting verified the proteins identified by 
mass spectrometry when specific antibodies were available. 
Spearman correlation coefficients of the proteins and their 
associated mRNA for each protein spot were generated using 
all 76 lung adenocarcinomas and nine non-neoplastic lung 
tissues (see Tables I and II, and see Rgs. 1 and 2). The 
correlation coefficients (r) ranged from -0.467 to 0.442 (Fig. 
2D). A total of 28 protein spots (21 genes) were found to have 
a statistically significant correlation between expression of 
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Fig. 1 digrta .rnage of a sHver-stained 2D-PAGE separation of a stage I lung adenocarcinoma showing protein spots separated bv 
molecular mass (MW> and .soelectric point <P/). Twenty-eight protein spots whose expression levels are MM^^^^^ 
indicated by the black arrows B, the outlined areas of A showing protein GRP58. C, 2D Western blot of GRpT3 
adenocarcnoma cell l.ne. D, the outlined areas of A showing the protein isoforms of 0 P 18. E, 2D Western blot o ^ ^18 from Z^clls 



their protein and mRNA (r > 0.2445; p < 0,05). This accounts 
for 17% (28/165) of the 165 protein spots. Among the 69 
genes for which only a single protein spot was known (Table 
I), nine genes (9/69, 13%) were observed to show a statisti- 
cally significant relationship between protein and mRNA 
abundance (r > 0.2445; p < 0.05). The proteins whose ex- 
pression levels were correlated with their mRNA abundance 
included those involved in signal transduction, carbohydrate 
metabolism, apoptosis, protein post-transiational modifica- 
tion, structural proteins, and heat shock proteins (Table III). 

individual Isoforms of the Same Protein Have Different 
Protein/mRNA Correlation Coefficients— Of the 165 protein 
spots. 96 represent protein products of 29 genes with at least 
two isoforms. Among these 96 protein spots, 19 (19/96 pro- 
tein spots, 20%) showed a statistically significant correlation 
between their protein and mRNA expression if > 0.2445; p < 
0.05) (Table II) and represented 12 genes (12/29, 41%). Individ- 
ual isoforms of the same protein demonstrated different 
protein/mRNA correlation coefficients. For example, 2D -PAGE/ 
Western analysis revealed four isoforms of OP18 differing in 
regards to isoelectric point but similar in molecular weight. 
Three of the four isoforms (spots 1 492, 1 493, and 1 494) showed 
a statistically significant correlation between their protein and 
mRNA abundance (r = 0.3234, 0.3154, and 0.4003, respective- 
ly). The forth isoform (spot 1488) showed no correlation be- 



tween protein and mRNA expression (r = 0.0495), Similarly, just 
one of five quantified isoforms of cytokeratin 8 (spot 439) dem- 
onstrated a statistically significant correlation between protein 
and mRNA abundance (r = 0.3049; p < 0.05) (Table II). 

In addition to differences In the relationship between mRNA 
levels and protein expression among separate isoforms, some 
genes with very comparable mRNA levels showed a 24-fold 
difference in their protein expression. Genes with comparable 
protein expression levels also showed up to a 28-fold vari- 
ance in their mRNA levels. 

Lack of Correlation for mRNA and Protein Expression when 
Using Average Tumor Values across Alt 165 Protein Spots (98 
Genes)-lhe relationship between mRNA and protein expres- 
sion was also examined by using the average expression 
values for ail samples. To analyze this relationship using this 
approach, the average value for each protein or mRNA was 
generated using all 85 lung tissue samples. The range of 
normalized average protein values ranged from -0.0646 to 
0.0979 (raw value 0.0036 to 4.1 947), and the range for mRNA 
was from 0 to 15260.5 for ail 165 individual protein spots. The 
Spearman correlation coefficient for the whole data set (165 
protein spots/98 genes) was -0.025 (Rg. 3A). Even for the 28 
protein spots (Rg. 2D) that were found to have a statistically 
significant correlation between their mRNA and protein, use of 
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A: 0H8 (spot #149) 
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Ra. 2. A-C, plots showing the correlation between mRNA and protein for the three selected genes Op18, Annexin IV, and GAPD for all 76 
lung adenocarcmomas and mne nonneoplastic lung samples (p < 0.05). O, distribution of all 165 Spearman correlation^^ 
ve«on analysis us.ng SAM. A more detailed description of the method is provided under "Experimental Procedures/ Approxima 17% 
of he 1 65 proteins demonstrate a significant correlation between mRNA and protein levels as demonstrated by the valuesThown beyond the 
outer range of threshold A = 0.1 15. Normalized protein values were used, thus negative values for some proteins a/e TrJe^ed 



the average value resulted in a correlation coefficient value of 
-0.035 t which was not significant (Fig. 38). 

Lack of a Relationship between Protein/mRNA Correlation 
Coefficients and Average Protein Abundance- To determine 
whether an absolute protein level might influence the corre- 
lation with mRNA, the mean value of each protein (relative 
abundance) and the Spearman protein/mRNA correlation co- 
efficients among all 85 samples were examined. No relation- 
ship between the protein abundance and the correlation co- 
efficients was observed (r = 0.039; p > 0.05). A detailed 
analysis of separate subsets of proteins with differing levels of 
abundance (less than -0.0014, larger than -0.0014, or larger 
than 0.0077) also showed a lack of correlation between mRNA 
and protein expression among the 83 (50%), 82 (50%), and 41 
(25%) of 165 total protein spots, respectively (r = 0.016, 0.08, 
and 0.172, respectively). 

Stage-related Changes in the Protein/mRNA Correlation 
Coefficients—To determine whether the 21 genes (28 protein 
spots) showing a significant correlation between the protein 
and mRNA expression among all samples demonstrate 
changes in this relationship during tumor progression, the 
correlations were examined separately for stage I {n = 57) and 



stage It) {n = 19) lung adenocarcinomas (Table III). The num- 
ber of non-neoplastlc lung samples (n = 9) was insufficient for 
a separate correlation analysis of this group. Many of the 
protein spots represent one of several known protein isoforms 
for a given gene. The majority of genes (16/21) did not differ in 
the protein/mRNA correlation between stage I and stage III 
tumors indicating a similar regulatory relationship between the 
mRNA and protein spot GRP-58, PSMC, SOD1, TPI1, and 
VIM, however, were found to demonstrate significant differ- 
ences in the correlation coefficients between stage I and 
stage III lung adenocarcinomas. For GRP-58, PSMC, and VIM 
the change in the correlation coefficient was because of a 
relative increase in protein expression in stage III tumors. For 
SOD and TPI the change resulted from a relative decrease in 
expression of this specific protein in stage III tumors. 

DISCUSSION 

Relatively little is known about the regulatory mechanisms 
controlling the complex patterns of protein abundance and 
post-translational modification in tumors. Most reports con- 
cerning the regulation of protein translation have focused on 
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Table III 

Stage-dependent analysis of protein-mRNA correlation coefficients 
r, correlation coefficient. Values in boldface indicate a significant difference between stage I and stage III. 



opOt 


Gene name 


r (btage i) 


r fCtarta lift 

t (oiage uij 


1874 


AKR1B1 


0.269 


0.106 


2524 


ANXA1 


0.184 


0.572 


0994 


ANXA4 


0.660 


0.362 


0963 


ANXA5 


0.241 


0.390 


1314 


DJ-1 


0.363 


0.354 


1405 


FTL 


0.126 


0.358 


0855 


GAPD 


0.243 


0.581 


0350 


GRP58 


0.327 


-0.087 


0264 


HNRPK 


0.360 


0.243 


1192 


HSPB3 


0.457 


0.633 


0523 


KRT1B 


0.115 


0.371 


0439 


KRT8 


0.323 


0.436 


1492 


LAP 18 


0.483 


0.663 


153B 


LGALS1 


0.200 


0.528 


1252 


PSMC 


0.253 


0.060 


1104 


SFN 


0.465 


0.475 


1454 


SOD1 


0.352 


0.079 


1203 


TPI1 


0.378 


0.009 


0957 


TPM1 


0.475 


0.225 


0593 


VIM 


-0.054 


0.556 


0935 


YWHAH 


0.283 


0.210 



one or several protein products (18). Cells et a/. (19) found a 
good correlation between transcript and protein levels among 
40 well resolved, abundant proteins using a proteomic and 
microarray study of bladder cancer. By comparing the mRNA 
and protein expression levels within the same tumor samples, 
we found that 1 7% (28/165) of the protein spots (21/98 genes) 
show a statistically significant correlation between mRNA and 
protein. These proteins appear to represent a diverse group of 
gene products and include those involved in signal transduc- 
tion, carbohydrate metabolism, protein modification, cell struc- 
ture, heat shock, and apoptosis. These results suggest that 
expression of this subset of 165 proteins is likely to be regulated 
at the transcriptional level in these tissues. The majority of the 
protein isoforms, however, did not correlate with mRNA levels, 
and thus their expression is regulated by other mechanisms. We 
also observed a subset of proteins that demonstrated a nega- 
tive correlation with the mRNA expression values; for example 
a-haptoglobin demonstrated a strong negative correlation with 
its mRNA expression values. This may reflect negative feedback 
on the mRNA or the protein or the presence of other regulatory 
influences that are not understood currently. 

Post-translational modification or processing will result in 
individual protein products of the same gene migrating to 
different locations on 2D- PAGE gels (20). Because the identity 
of all possible isoforms for each protein examined has not 
been characterized completely, this may influence the corre- 
lation analyses performed in this study. This is partly because 
of limitations of the 2D-PAGE and mass spectrometry tech- 
nologies (21, 22). Potential inconsistencies between mRNA 
and protein correlations that have been reported may also be 
because of differences, even in the same gene, in the mech- 



Function 

Carbohydrate metabolism; electron transporter 
Phospholipase inhibitor; signal transduction 
Phospholipase inhibitor 

Phospholipase Inhibitor; calcium binding; phospholipid binding 
Signal transduction 
Iron storage protein 

Carbohydrate metabolism {glycolysis regulation) 
Signal transduction; protein disulfide isomerase 
RNA-binding protein (RNA processing/modification) 
Heat shock protein 
Structural protein 
Structural protein 

Signal transduction; eel! growth and maintenance 
Apoptosis; ceil adhesion; cell size control 
Protein degradation 

Signal transduction (protein kinase C inhibitor) 

Oxidoreductase 

Carbohydrate metabolism 

Structural protein (muscle); control of heart 

Structural protein 

Signal transduction 



anisms of protein translation among different cells or as 
measured in different laboratories (23). 

In this study, we examined 165 protein spots identified in 
lung adenocarcinomas. Ninety-six protein spots, representing 
the products of 29 genes, contained at least two protein 
isoforms. Nineteen of 96 protein spots, representing 12 
genes, were shown to have a statistically significant correla- 
tion between their protein and mRNA expression, suggesting 
that the levels of these proteins reflects the transcription of the 
corresponding genes. Differences in protein/mRNA correlations 
were found among the Individual isoforms of a given protein. For 
example, of the four OP18 isoforms, three showed a statistically 
significant correlation between the protein and mRNA expres- 
sion levels. The lack of relationship for the one isoform, how- 
ever, indicates that individual protein isoforms of the same gene 
product can be regulated differentially. This is not unexpected 
and likely reflects other post-translational mechanisms that can 
influence isoform abundance in tissues and cancer. 

In addition to the analyses of the correlation of mRNA/ 
protein within the same tumor samples, we also tested the 
global relationship between mRNA and the corresponding 
protein abundance across all 165 protein spots in the lung 
samples. A protein and mRNA average value for each gene 
was generated using all 85 lung tissues samples. We ob- 
served a very wide range of normalized average protein and 
mRNA values. The correlation coefficient generated using this 
average value data set was -0.025, and even for the 28 
protein spots that showed a statistically significant correlation 
between individual mRNA and proteins, the correlation value 
was only -0.035. This suggests that it is not possible to 
predict overall protein expression levels based on average 
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Fig. 3, The overall correlation of 
mRNA and protein levels across all 
165 protein spots (A) and across 28 
protein spots that contained Individ- 
ual r values larger than 0.244 (B) are 
shown. Each protein or mRNA mean 
value was calculated based on all 76 
lung adenocarcinomas and nine non- 
neoplastic lung samples using quantita- 
tive 2D-PAGE and Affymetrix oligonu- 
cleotide microarrays. The Spearman 
correlation coefficients for the two data 
sets (A and S) were -0.025 and -0.035, 
respectively, indicating a tack of correla- 
tion if mean values for mRNA and protein 
for all samples is used. 
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mRNA abundance in lung cancer samples. This conclusion is 
also supported by previous results from Anderson and Seil- 
harrier (24), who examined 19 genes in human liver cells, and 
by Gygi er a/. (25), who examined 106 genes in yeast. Both 
studies found a lack of correlation between mRNA and protein 
expression when average or overall levels were used. 

A good correlation was reported when the 11 most abun- 
dant proteins were examined in yeast (25), suggesting that the 
level of protein abundance may be a factor that may influence 
the correlation between mRNA and protein. In the present 
study, a fairly wide range of mean protein values among 165 
protein spots in lung adenocarcinomas was observed, and 
the correlation coefficients also varied from -0.467 to 0.442. 



A comparison between the mean value of each protein and 
the correlation coefficient generated using all 85 tissue sam- 
ples did not reveal a strong relationship between the overall 
protein abundance and the correlation coefficients (r = 0.039; 
p > 0.05). Detailed analysis of different subsets of protein abun- 
dance also failed to show a correlation between mRNA and 
protein expression. Thus in contrast to yeast, a relationship 
between mRNA/protein correlation coefficient and protein 
abundance in human lung adenocarcinomas was not observed. 

The results of this study indicate that the level of protein 
abundance in lung adenocarcinomas is associated with the 
corresponding levels of mRNA in 17% (28 proteins) of the 
total 165 protein spots examined. This was substantially 
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higher than the amount predicted to result by chance alone 
(which was 5.1) and suggests that a transcriptional mecha- 
nism likely underlies the abundance of these proteins in lung 
adenocarcinomas. We also demonstrate that the expression 
of individual isoforms of the same protein may or may not 
correlate with the mRNA, indicating that separate and likely 
post-translational mechanisms account for the regulation of 
isoform abundance. These mechanisms may also account for 
the differences in the correlation coefficients observed between 
stage I and stage III tumors, indicating that specific protein 
isoforms show regulatory changes during tumor progression. 
Further studies in lung adenocarcinomas will examine the rela- 
tionship between the expression of individual protein isoforms 
and specific clinical-pathological features of these tumors, such 
as the presence of angiolymphatic invasion, and nodal or pleu- 
ral surface involvement. The potential to identify specific protein 
isoforms associated with biological behavior in lung adenocar- 
cinomas would be of considerable Interest and will add to our 
understanding of the regulation of gene products by transcrip- 
tional, translational, and post-translational mechanisms. 
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An increased high-mobility group A2 expression level is associated with 
malignant phciiotype in pancreatic exocrine tissue. 

Abc_N, Watanabe T, Suzulci V . Matsumoto N Masaki T . Mori T . Sugiyama M 
Cliiappc ttaC, Fusco A , Atomi Y . 

First Department of Surgery, Kyorin University School of Medicine, 6-20-2, Shinkawa 
Mitaka, Tokyo 1 8 1 -86 1 1 , Japan, abenbtg@kyorin-u.ac.jp 

The altered fonn of the high-mobility group A2 (HMGA2) gene is somehow related to 
the generation of human benign and malignant tumours of mesenchymal origin 
However, only a few data on the expression of HMGA2 in malignant tumour originating 
from epithelial tissue are available. In this study, we examined the HMGA2 expression 
level in pancreatic carcinoma, and investigated whether alterations in the HMGA2 
expression level are associated with a malignant phenotype in pancreatic tissue. High- 
mobility group A2 mRNA and protein expression was determined in eight surgically 
resected specimens of non-nepplastic tissue (six specimens of normal pancreatic tissue 
and two of chronic pancreatitis tissue) and 27 pancreatic carcinomas by highly sensitive 
reverse transcriptase-polymerase chain reaction (RT-PCR) techniques and 
immunohistochemical staining, respectively. Reverse transcriptase-polymerase chain 
reaction analysis revealed the expression of tire HMGA2 gene in non-neoplastic 
pancreatic tissue, although its expression level was significantly lower than that in 
carcinoma. Immunohistochemical analysis indicated that die presence of the HMGA2 
gene in non-neoplastic pancreatic tissue observed in RT-PCR reflects its abundant 
expression in islet cells, together with its focal expression in duct epithelial cells Intense 
and multifocal of diffuse HMGA2 immunoreactivity was noted in all the pancreatic 
carcinoma examined, A strong correlation between HMGA2 overexpression and the 
diagnosis of carcinoma was statistically verified. Based on these findings, we propose . 
that an i ncreased expression level of the HMG A2 protein is closely associated with the 
malignant phenotype in the pancreatic exocrine system, and accordingly, HMGA2 could 
serve as a potential diagnostic molecular marker for distinguishing pancreatic malignant 
cells from non-ncoplasfic pancreatic exocrine cells. 
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extracts. If these minor cell proteins differ among cells to the same extent as the 
more abundant proteins, as is commonly assumed, only a small number of pro- 
tein differences (perhaps several hundred) suffice to create very large differences 
in cell morphology and behavior. 

A Cell Can Change the Expression of Its Genes 
in Response to External Signals 3 

Most of the specialized cells in a multicellular organism are capable of altering 
Iheir patterns of gene expression in response to extracellular cues. If a liver cell 
is exposed to a glucocorticoid hormone, for example, ihe production of several 
specific proteins is dramatically increased. Glucocorticoids are released during 
periods of starvation or intense exercise and signal Ihe liver lo increase the 
production of glucose from amino acids and other small molecules; (he set of 
proteins whose production is induced includes enzymes such as tyrosine amino- 
transferase, which helps to convert tyrosine to glucose. When the hormone is no 
longer present, the production of these proteins drops to its normal level. 

Other cell types respond to glucocorticoids in different ways. In fat cells, for 
example, the production of tyrosine aminotransferase is reduced, while some 
other cell types do not respond to glucocorticoids at all. These examples illustrate 
a general feature of cell specialization — different cell types often respond in dif- 
ferent ways to die same extracellular signal. Underlying this specialization are 
features that do not change, which give each cell type its permanently distinc- 
tive character. These features reflect die persistent expression of different sets of 
genes. 

Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNv\ to Protein 1 

If differences between the various cell types of an organism depend on the par- 
ticular genes that the cells express, at what level is the control of gene expression 
exercised? There are many steps in the pathway leading from DNA to protein, and 
all of them tan in principle he regulated. Thus a cell can control the proteins it 
makes by (1) c on t rolling when and how often a given gene is transcribed {tran- 
scriptional control), (2) controlling how the primary ft N A transcript is spliced or 
otherwise processed (RNA processing control), (3) selecting which completed 
rnRNAs in the eel) nucleus are exported to the cytoplasm (UNA transport con- 
trol), t-t) selecting which mRNAs iir the cytoplasm rue translate! I by nhosomes 
(translational control), {$) selectively destabilizing certain mliNA molecules in 
the cytoplasm (mKNA degradation control), or (f>) selectively activating, inacti- 
vating, or compartmentalizing specific protein molecules after they have been 
made (protein activity control) (Figuie 0-2). 

l : or most genes transcript ion a) controls are paramount. This makes sense 
because, of all the possible control points illustrated in Figure 'J-^, only transcrip- 
tional control ensures that no super tluous intermediates are synthesized. In the 
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Summary 

The many types of celts in animals ami plants are treated largely through mecha- 
nisms that cause different genes to be transcribed in different cells. Since many spe- 
cialized animal cells can maintain their nnitpte character when grown in culture, the 
gene regulatory mechanisms involved in creating them mast be stable once estab- 
lished ami fieri table when the celt divides, endowing the cell with a memory of its 
developmental history. Pr oca ryot es mni yeas fs provide tin usually accessible modal 
systems in which to study gene regulatory mechanisms, some of which may be rel- 
evant to the creation ofspe.dtdize.tlce.il types in higher eucaryotes. One such mecha- 
nism involves a competitive interaction between two for more) gene regulatory pro- 
teins, each of which inhibits the synthesis of the other; this can create a jMp flop 
switch that switches a cell between two alternative patterns of gene expression. Di- 
rect or indirect positive feedback loops, which enable gene regulatory proteins to 
perpetuate their own synthesis, provide a general mechanism for cell memory. 

tn enenryotes gene transcription is generally controlled by combinations of gene 
regulatory proteins. It is thought that each type of celt in a higher encoiyotic organism 
contains a specific combination of gene regulatory proteins that ensures the expres- 
sion of only those genes appropriate to that type of celt. A given gene regulatory pro- 
tein may be expressed iti a variety ofcirrnmstances ami typically is involved in the 
regulation of many genes. 

In addition to diffusible gene regulatory proteins, inherited states of chromatin 
condensation are also utilized by ettcaryoiic cells to regulate gene expression. In ver- 
tebrates DNA methylafion also plays a part, mainly as a device to reinforce, decisions 
about gene expression that are made initially by other mechanisms. 
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FROM DNA TO RNA 

Transcription and translation are the means by which cells read out, or.express, 
the genetic instructions in their genes. Because many identical RNA copies can 
be made from the same gene, and each RNA molecule can direct the synthesis 
of many identical protein molecules, cells can synthesize a large amount of 
protein rapidly when necessary. But each gene can also be transcribed and 
translated with a different efficiency allowing the cell to make vast quantities of 
some proteins and tiny quantities of others (Figure 6-3). Moreover, as we see in 
the next chapter, a cell can change (or regulate) the expression of each of its 
genes according to the needs of the moment— most obviously by controlling 
the production of its RNA. 

Portions of DNA Sequence Are Transcribed into RNA 

The first step a cell takes in reading out a needed part of its genetic instructions 
is to copy a particular portion of its DNA nucleotide sequence— a gene— into an 
RNA nucleotide sequence. The information in RNA, although copied into another 
chemical form, is still written in essentially the same language as it is in DNA— 
the language of a nucleotide sequence. Hence the name transcription. 

Like DNA, RNA is a linear polymer made of four different types of nucleotide 
subunits linked together by pbosphodiester bonds (Figure 6-1). It differs from 
DNA chemically in two respects: (I) the nucleotides - in RNA are 
ribonucleotides— that is, they contain the sugar ribose (hence the name ribvnu- 
cleic acid) rather than deoxyribose; (2) although, like DNA, RNA contains the 
bases adenine (A), guanine IG). and cytosine (C), it contains ihe base uracil |U) 
instead of ihe thymine (T) in DNA. Since U, like T, can base-pair by hydrogen- 
bonding with A (Figure 6-5). the complementary base- pairing properties 
described for DNA in Chapters -I and 5 apply also to RNA (in RNA, G paiis with 
C, and A pairs with U). It is not uncommon, however, to find other types of hase 
pairs in RNA: for example, G pairing with U occasionally. 

Despite these small chemical differences, DNA and RNA differ quite dra- 
matically in overall si menu e. Whereas DNA always occurs in cells as a double- 
stranded helix, RNA is single- stranded. RNA chains therefore fold up into a 
variety of shapes, just as a polypeptide chain folds up to form the final shape of 
a protein (Figure 6-6). As nv see later in this chaprer, the ability to fold into com- 
plex three-dimensional shapes allows some RNA molecules to have MnrcimaJ 
and catalytic functions. 



Figure 6—3 Genes can be expressed 
with different efficiencies. GeneArs 
transcribed and translated much more 
efficientty than gene B.This allows ihe 
amount of protein A ir> the celt to be 
much greater than that of protein B. 
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Transcription Produces RNA Complementary to 
One Strand of DNA 

AJI of the RNA in a cell is .made by DNA i miisci iption. a process that has cer- 
tain similarities to the piooss of DNA replication discussed in Chapter S 
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Figure 6-90 The production of a 
protein by a eucaryotic cell. The final 
level of each protein in a eucaryotic cell 
depends upon the efficiency of each step 
depicted 
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ure G-90) cot ild be regulated by the cell for e;uh individual piotein. However, as 
we shall see in (Chapter 7, (he initial ion of tra nsc ii pi km is J he most common 
point foi a cell to regulate the expression of each of its genes". This makes sense, 
inasmuch as the most efficient way to Keep a gene horn being expressed is to 
block the very first step — the transcription of its DNA seijnence into an RNA 
molecule. 

Summary 

The translation of (he nucleotide sequence of an mRNA molecule info protein takes 
place in the cytoplasm on a large ribonncleoprotein assembly called n ribosome. The 
amino /Ida's used for protein synthesis ore first attached to a family of tRNA 
molecules, each of which recognizes, by complement ary hose- pair interactions, par- 
ticular sets of three nucleotides in the mRNA (cottons). The sequence of nucleotides in 
the mRNA is then rend from one end to the other in sets of three according to the 
genetic cotfe. 

To initiate translation, a small ribosomal subunit hintls to the tnliNA molecule 
at a start codon (AUG) that is recognized by a unique initiator tRNA molecule. A 
large ribosomal Sidmnit binds to complete the ribosome ami begin the elongation 
phase t)j pintein synthesis. During this phase, aminoacyl tRNAs — each bearing a 
specific anti no acid bind sequentially to the appropriate ettdon in miii\'A by forming 
complementary base pairs with the tRNA anticodon. lurch amino acid is added to the 
C-terminal end of the growing polypeptide !/)■ menus <>/ // cycle of three sequential 
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Figure 7-5 Six steps at which 
eucaryotic gene expression can be 
controlled. Controls that operate at 
steps I through 5 are discussed in this 
chapter. Step 6, the regulation of proLein 
activity, includes reversible activation or 
inac ovation by protein phosphorylation ' 
(discussed in Chapter 3) as well as 
irreversible tnactrvation by proteolytic 
degradation (discussed in Chapter 6). 



Gene Expression Can Be Regulated at Many of the Steps 
in the Pathway from DNA to RNA to Protein 

If differences among the various celt types of an organism depend on the partic- 
ular genes that the cells express, at what level is the control of gene expression 
exercised? As we saw in the last chapter, there are many steps in "the pathway 
leading from DNA to protein, and all of them can in principle be regulated. Thus 
3 cell can control the proteins it makes by ( 1) controlling when and .how often a 
given gene is transcribed (transcriptional control), (2) controlling how the RNA 
transcript is spliced or otherwise processed (RNA processing control), (3) 
selecting which completed mRNAs in the cell nucleus are exported to the cytosol 
and determining where in I lie cytosol they are localized (RNA transport and 
localisation control), W) selecting which mRNAs in the cytoplasm are translated 
by ribosomes (translations! control), IS) selectively destabilizing certain mRNA 
molecules in the cytoplasm (mRNA degradation control), or (f>) selectively acti- 
vating, inactivating, degrading, or compartmentalizing specific, protein 
molecules after they have been made (protein activity control) (1'ignre 7-.S). 

For most genes transcriptional controls are paramount. This makes sense 
because, of all the possible control points illustrated in Pigure 7-5, only tran- 
scriptional control ensures that the cell will not synthesize su peril nous inter me- 
diates. In the following sections 'we discuss- the ON A and protein components 
■;'iat peiiorm this function by regulating the initiation of gene transcription. We 
-hall return at ihe end of the chapter to the additional ways of regulating gene 
< xpiession. 



nummary 

i he genome of n cell contains in its DNA sequence the information to make many 
thousands of different protein and UNA molecules. A cell typically expresses only a 
traction of its genes, and the different types of cells in mnfticelhdnr organisms tnise 
--.cause different sets of genes are e.\ pressed. Moreover, cells can change the pattern 

genes they express in response to changes in their environment, such as signals 
'•■rin other cells. Although all of the steps involved in expressing a gene can in pnn- 
:: pie he regulated, for most genes the initnttion of UNA transcription is the most 

.port ant point of control. 



■>w does a cell determine which of its thousands of genes io u a ascribe? As 

• miioned briefly in Chapters I and - 1>. the r.ansmption of each gene is con- 
He d by a regulatory region of DNA n --la lively near the bite where hanscripiion 
■Vns. .Some regulatory regions air simple and act as switches that ate thiown 

■ single signal. Many others are comply and a. I as tiny microprocessors, 
oonding to a variety of signals drat thev inter pi ei and integrate to'swiich the 

i- ;:hboringgene on or oil. Whether eornplrv or simple, these swiiching devices 
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Selective apoptosis of natural killer-cell tumours by l-asparaginase. 

Ando M , Sugimoto K, Kitoh T . Sasaki M , Mulcai K , Ando J, E gashira M , Schuster 
SM , OshimiK . 

Department of Haematology, Juntendo University School of Medicine, Tokyo, Japan. 

We examined the effectiveness of various anti-tumour agents to natural killer (NK)-cell 
tumour cell lines and samples, which are generally resistant to chemotherapy, using flow 
cytometric terminal deo xy nucleotidyl trans ferase-mediated dUTP-biotin nick end- 
labelling (TUNEL) assay. Although NK-YS and NK-92 were highly resistant to various 
anti-tumour agents, I-asparaginase induced apoptosis in these two NIC-cell lines. NK-cell 
ieukaemia/lymphoma and acute lymphoblastic leukaemia (ALL) samples, were 
selectively sensitive to l-asparaginase and to doxorubicin (DXR) respectively. Samples of 
chronic NK lymphocytosis, an NK-cell disorder with an indolent clinical course, were 
resistant to both drugs. Our study clearly separated two major categories of NK-cell 
disorders and ALL according to the sensitivity to DXR and ^asparaginase. We examined 
asparagine synthetase levels by real-time quantitative polymerase chain reaction (RQ- 
PCR) and immunostalning in these samples. At least in nasal-type NK-cell lymphoma, 
there was a good correlation among asparagine synthetase expression, in vitro sensitivity 
and clinical response to l-asparaginase. In aggressive NK-cell leukaemia, although 
asparagine synthetase expression was high at both mRNA and protein levels, l- 
asparaginase induced considerable apoptosis. Furthermore, samples of each disease entity 
occupied a distinct area in two-dimensional plotting with asparagine synthetase mRNA 
level (RQ-PCR) and in vitrol-asparaginase sensitivity (TUNEL assay). We confirmed 
rather specific anti-tumour activity of l-asparaginase against NK-cell tumours in vitro, . 
which provides an experimental background to the clinical use of l-asparaginase for NK- 
cell tumours. 
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Human thyroid carcinoma cell lines and normal thyrocytes: expression 
and regulation of matrix metaUoproteinase-1 and tissue matrix 
metalloproteinase inhibitor ! messenger-RNA and protein. 

AustG, Hofmann A, Laue S, Rost A. Kohler T. Scherbaum WA . 

Institut of Anatomy, University of Leipzig, Germany. 

Matrix metalloproteinase- 1 (MMP-1) and tissue matrix metalloproteinase inhibitor 1 
(TIM?- 1) play an important role in remodeling the extracellular matrix in normal and 
pathological processes. The effect of phorbol-myristate acetate (PMA), interleukin-1 (1L- 
. ,1), and tumor necrosis factor-alpha (TNF-aipha) on MMP-1 and TMP-1 expression was 
studied on highly purified thyrocytes and undifferentiated 8505 C, C 643, HTh 74, SW 
1736 thyroid carcinoma cells compared with thyroid-derived fibroblasts. Messenger RNA 
(mRNA) levels were monitored by competitive semiquantitative reverse transcriptase 
polymerase chain reaction (RT-PCR) after 24 hours. Culture supernatants were assayed 
for free and/or complex ed MMP-1 and T1MP-1 after 48 hours using enzyme-linked 
immunosorbent assay (ELI S A) systems (detection limit: <2 ng/mL). MMP-1 and T1MP-1 
mRNA were present in all cell types, although thyrocytes showed MMP-1 mRNA levels 
near the detection limit. 8505 C expressed MMP-1 mRNA levels of up to 10(6) times 
those of the other cells analyzed. PMA and IL-1 increased MMP-1 mRNA in most cell . 
types. T1MP-I mRNA increased after treatment with PMA in all cells except 8505 C, 
whereas only slight effects were shown after IL-1 stimulation. MMP-1 protein was 
undetectable in normal thyrocyte cultures, but was secreted spontaneously by all cell 
lines ([ng/mL]; C 643: 15+/-7; HTh 74: 81+A1; SW 1736: 13+/-2; 8505 C: 2097+A320). 
There was a strong correlation between levels of MMP-1 mRNA and protein (r = 0.99, p 
< .0001). PMA and IL-1 increased MMP-1 secretion in all cell types after 48 hours. 
Fibroblasts ([ng/mL] 517+/-55) and the cell lines (C 643: 142+/-48; HTh 74: 1 15+/-13; 
SW 1736: 202+/-14; 8505C: 120+/-19) secreted T1MP-1 in unstimulated cultures, 
whereas only a trace amount was detected in thyrocyte cultures, even after PMA 
treatment. IL-1 upregulated T1MP-1 secretion after 48 hours in SW 1 736, HTh 74, and C 
643 cells. Our data suggest that in contrast to normal thyrocytes, dedifferentiated thyroid 
carcinoma cell lines are potential producers of MMP-1. as well asTIMP-I. High MMP-1 
or MMP-1/T1MP-I expression may play a role in tissue invasion of undifferentiated 
thyroid cancer cells. 

PM1D: 9349574 [PubMed - indexed for MEDLfNE] 
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Human Thyroid Carcinoma Cell Lines and Normal 
Thyrocytes: Expression and Regulation of Matrix 
Metalloproteinase-1 and Tissue Matrix Metalloproteinase 
Inhibitor- 1 Messenger-RNA and Protein 

G. AUST,» A. HOFMANN, 2 S. LAUE* A. ROST^ T. KOHLER, 3 and ■ W.A. SCHERBAUM 4 



ABSTRACT 

Matrix metalloproteinase-l (MMP-1) and tissue matrix metalloproteinase inhibitor 1 (TTMP-1) play an impor- 
tant role in remodeling the extracellular matrix in normal and pathological processes. The effect of phorbol-myris- 
tate acetate (PMA), mterleukin-1 (HI), and tumor necrosis factor-a (TNF-a) on MMP-1 and TIMP-1 expres- 
sion was studied on highly purified thyrocytes and undifferentiated 8505 C, C 643, HTh 74, SW 1736 thyroid 
carcinoma cells compared with thyroid-derived fibroblasts. Messenger RNA (mRNA) levels were monitored by 
competitive semiquantitative reverse transcriptase polymerase chain reaction (RT-PCR) after 24 hours Culture 
supernatant* were assayed for free and/or complexed MMP-1 and TTMP-1 after 48 hours using erayme-linked 
immunosorbent assay (EL1SA) systems {detection limit; <2 ng/mL). MMP-1 and T1MP-1 mRNA were present in 
all cell types, although thyrocytes showed MMP-1 mRNA levels near the detection limit. 8505 C expressed MMP- 
1 mRNA levels of up to 10 6 times those of the other ceDs analyzed. PMA and EL- 1 increased MMP-1 mRNA in 
most cell types. TIMP-1 mRNA increased after treatment with PMA in all cells except 8505 C, whereas only 
shght effects were shown after IL-1 stimulation. MMP-1 protein was undetectable in normal thyrocyte cultures, 
but was secreted spontaneously by all cell lines ((ng/mlj; C 643: 15 ± 7; HTh 74: 81 ± 1; SW 1736: 13 ± 2; 
8505Q 2097 ± 320). There was a strong correlation between levels of MMP-1 mRNA and protein (r = 0.99* 
p < .0001). PMA and IL-1 increased MMP-1 secretion in all cell types after 48 hours. Fibroblasts {[ng/mLJ 517 ± 
55) and the cell lines {C 643: 142 ± 48; HTh 74: 115 ± 13; SW 1736: 202 ± 14; 8505 C: 120 ± 19) secreted 
TIMP-1 in unstimulated cultures, whereas only a trace amount was detected in thyrocyte cultures, even after PMA 
treatment. IL-1 uprcgulated TIMP-1 secretion after 48 hours in SW 1736, HTh 74, and C 643 cells. Our data 
suggest that in contrast to normal thyrocytes, dedifferentiated thyroid carcinoma cell lines are potential produc- 
ers of MMP-1 as well as TIMP-1. High MMP-1 or MMP-l/TIMP-1 expression may play a role in tissue inva- 
sion of undifferentiated thyroid cancer cells. 



INTRODUCTION 

MATRDC METALLOPROTTINASES, (MMPs) Constitute 3 
family of structurally related proteolytic enzymes re- 
sponsible for the proteolytic degradation of extracellular 
matrix (ECM) components. They are important partici- 
pants m normal tissue remodeling and contribute ro the 
phenorype of several pathological conditions thai are as^ 
sdoared with progressive ECM degradation. MMPs are 
highly regulated at different levels (1). At the transcrip- 
tional level, MMP expression can be directly induced or 



suppressed on external stimulation, ie, with cytokines, 
phorbol 12-myristate 13-acetate (PMA), lipopolvsaccaride 
(LPS), or retinoic acid (2,3). After secretion at post-tran- 
scriptional level, latent MMP proenzymes are regulated by 
proteolytic activation and interaction with tissue inhibitors 
of matrix metalloproteinase (TIMPs), their specific in- 
hibitors. Any imbalance between the proteolytic MMPs ac- 
tivities and the TIMPs that could be influenced and caused 
by cytokines could potentially lead to pathological condi- 
tions (4). 

MMP-1, although known as an interstitial collagenase, 



Mnsnrui pi Anaiomy, ^Department of Internal Medicine III, and ^Institute of Clinical ChemW ami Pa.hobiochcmistry, University of 
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is the only enzyme active at neutral pH that can degrade 
extracellular fibers comprised of collagen rypes I, U, and 
III. With this initial step, MMP-1 provides the cleavage 
products to other collagenase types (5). The major specific 
inhibitor of MMP-1 is TIMP-1, a 28.5-kd glycoprotein, 
which forms 3:1 stoichiometric complexes with the pro- 
tease (6). Cytokines and growth factors have been shown 
to regulate the expression of both MMP-1 "and T1MP-1 
d>7>8). 

Although the participation of MMP-1 as the initial col- 
lagenase in tissue breakdown during tumor development is 
well documented (9-11), only one study has described the 
expression (12) but no study has as yet investigated the reg- 
ulation of this enzyme in different thyroid rumors. Few stud- 
ies have been published investigating rhe role of other 
MMPs in normal and pathological thyroid tissue by in situ 
hybridization and immunohistochemistry (13-16). Fur- 
thermore, tissue remodeling includes both the action of 
MMPs and their inhibitors; thus, these enzymes could be 
involved in autoimmune and other nonautoimmune thyroid 
diseases during morphological changes (1 7,1 8). It is still un- 
known whether or not thyrocytes are able to express MMPs 
and TlMPs. Although type 1Y collagenases (MMP-2 and 
MMP-9) were detected in various human epithelial cells of 
different tissue origin (19,20), only one study described the 
secretion of MMP-1 by epithelial cells (21). 

Highly purified normal thyrocytes and four thyroid car- 
cinoma cell lines were included in this study to investigate 
the involvement of these cells in MMP-1 and T1MP-1 pro- 
duction during thyroid tissue remodeling processes and in 
malignant thyroid neoplasms. MMP-1 and T1MP-1 ex- 
pression were studied at both the rnRNA and protein level 
by semiquantitative RT-PCR and ELJSA measurement, re- 
spectively. 

In unstimulated carcinoma cell lines both MMP-1 and 
T1MP-1 rnRNA were expressed, partly at a high level, fol- 
lowed by the spontaneous secretion of the proteins. The 
various conditions for the stimulation of the different cell 
lines by cytokines and PMA were defined. In contrast to 
the cell lines, normal thyrocytes did not secrete MMP-1 
and only trace amount of T1MP-1, even after stimulation 
with PMA. 



MATERIALS AND METHODS 

Preparation of tissues, 'thyroid-derived cells, and 
cell lines 

Thyrocytes were prepared from surgical thyroid speci- 
mens from 3 patients (1 Graves' disease, 2 nontoxic goi- 
ter; mean age 54.3 ± 5.0 years). Fibroblasts were separated 
from thyroid tissue of 5 other patients (3 Graves' disease, 
2 nontoxic goitci; mean age -13.6 i 6.4 years). Graves 1 dis- 
ease and nontoxic goiter were diagnosed on the strength 
of clinical, biochemical, and immunologic features as well 
as thyroid scintiscans. 

Thyroid tissue was trimmed of fat and connective tissue 
immediately after surgery. Thyroidrderived cells were en- 
riched after giaduaJ enzymatic digestion of tissue and cul- 
tured over a period of 16 hours as desciibed. Thyrocytes 
were obtained from the adherent fi action by incubating 



the cell monolayer with phosphate buffered saline (PBS) 
without Ca 2 ~/Mg 2+ for 45 minutes (22). Residual fibro- 
blasts were removed after subsequent incubation of the 
cells with the hbroblast-specific mab FibASOl (22> and 
goat-anti-mouse IgG-DYNABEADS® M45G (DYNAL, 
Hamburg, Germany) according to the manufacturer's pro- 
tocol.. 

Thyroid-derived fibroblasts were obtained after cuJturing 
small pieces of thyroid tissue in Dulbeccos's Modified Ea- 
gle's Medium (DMEM) with 10% fetal calf serum (FCS) 
and harvested in the 5th to 7th passage. The purity of the 
thyrocytes and fibroblasts was determined by using indirect 
immunofluorescence technique on a FACS-Scan (Bectori 
Dickinson GmbH, Heidelberg, Germany) as described (22). 

The following human anaplastic thyroid carcinoma cell 
lines were cultured in DMEM with 10% FCS: C 643 (23); 
SW 1 736 (23); and HTh 74 (24). The cell line 8505 C (25) 
was purchased from the German Collection of Microorr 
ganisms and Animal Cell Cultures (DSM ACC219). This 
cell line was established from a primary thyroid tumor 
characterized histologically as a undifferentiated carcinoma 
that was partially composed of poorly differentiated par>- 
illary cells (25). This is a feature of a subgroup of anaplas- 
tic carcinoma (26). The majority of these coexisrent better 
differentiated carcinoma foci in anaplastic carcinoma were 
papillary (26): 

In vitro cultures 

Using 24- well plates, 1 X 10 5 cells were cultured for 24 
hours. The medium was aspirated and replaced with 500 
/iL GPTI-MEM (G1BCO BRL, Grand Island, NY) without 
FCS to eliminate possible stimulation of MMP-1 and 
T1MP-1 production by FCS. The medium contained the 
desired concentration of human IL-la (10 U/ml; Pepro 
Tech EC Ltd., London, UK), TNF-or (100 U/mL; Pepro 
Tech EC Ltd.), interferon-? (1TN-7) (500 U/mL; Pepro 
Tech EC Ltd.), or 10 ng/mL PMA (SIGMA)^ 

Triplicate cultures of each stimulator were analyzed af- 
ter 3, 6, and 24 hours at the rnRNA and after 24 and 48 
hours at the protein level. The supematants were removed 
and stored at -80X for further use. First,.a collagenolytic 
assay based on the digestion of type I collagen was per- 
formed. This method showed direct evidence of free pro- 
MMP : 1 en2yme in the cell culture supernatants of un- 
stimulated and 1L-1 a stimulated 8505 C, HTh 74, and 
C634 cells (data not shown). However, the method does 
not allow quantitation of MMP-1 enzyme activity. Thus, 
the cell culture supernatants were assayed for MMP-1, 
TIMP-1, and MMP1/T1MP-I complex by ELISA (Amer- 
sham Life Sciences, Braunschweig, Germany). The MMP- 
1 assay (sensitivity: 1.7 ng/mL) detected only total human 
MMP-1, ie, free MMP-1 and MMP-1 complexed with in- 
hibitors such as TIMP-1. h did not detect MMP-1 bound 
by the nonspecific protease inhibitor a 2 -macrog}ob\ilin. 
The MMP-1/T1MP-1 assay (sensitiviry: 1.5 ng/mL) de- 
tected MMP-l/TlMP-1 complex, te, activated MMP-1 that 
has been subsequently complexed with the specific MMP- 
1 inhibitor TLMP-1. It did not detect free active MMP-1, 
free TIMP-1, or pro-MMP-1. There was no cross-reactiv- 
ity with active MMP-1 bound by the nonspecific protease 
inhibitor a 2 -macrpglobuJin. The T1MP- 1 assay (sensitivity: 
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1.25 ng/mL) detected total human TlMP-1, ie, free TIMP- 
1 and that complexed with MMPs. The assay did fully 
cross-react with TlMP-1 in complexes with other MMP. 
It did not cross-react with TIMP-2. 

RNA extraction and cDNA synthesis 

For gene expression studies, 5 mL RNAzolJ 7 * B (Biotex 
Laboratories Inc., Houston, TX) was added to the cell cul- 
ture wells. The content of three wells of any cell type was 
pooled and then stored frozen for further mRNA analysis 
in liquid nitrogen. Total cellular RNA (cDNA) was iso- 
lated from the probes according to the manufacturer's pro- 
tocol. RNA was fractionated on a denaturing 1.0% agarose 
gel and stained with ethidium bromide to confirm that, 
spectrophotometric measurements were accurate arid that 
the RNA had not been degraded. Five micrograms total 
RNA was reverse-transcribed to cDNA using the First- 
strand cDNA synthesis kit of Pharmacia (Uppsala, Swe- 
den), in a total reaction volume of 15 fiL. 

mRNA analysis by competitive RT-PCR 

To correct for variations across different cDNA prepa- 
rations, all samples were first adjusted to contain equal in- 
put glyceralderyde-3-phosphate dehydrogenase (GAPDH) 
cDNA concentrations. Semi-quantitative GAPDH RT-PCR 
was used with a heterologeous synthetic competitor frag- 
ment. The generation of the specific PGR products from 
the competitor and the cDNA with the GAPDH primers 
were published earlier (22,27). 

We then estimated the MMP-1 and TlMPrl cDNA in 
these adjusted samples. The primers were selected using the 
DNAsis computer program (Hitachi Software Engineering 
Co, Yokohama, japan). The primer pairs span one or more 
introns to allow unambiguous discrimination berween 
cDNA and unwelcome contaminating genomic DNA. In 
quantitating MMP-1 and TlMP-1 cDNA, a rapid one- step 
method was introduced to synthesize an internal horriolo- 
geous competitor (plan diagram of procedure: Fig. 1, ex- 
emplary for MMP-1 [28]. A hybrid primer was synthesized 
(MMP-lhy) that consisted of two segments (segj, seg 2 ). It 



MMP1I 

5* » 3' 



s eq ? .' 

MMPThy*! MMPlr 



5 478 bp 



MMPt internal homologeous competitor 

FIG- 1. General scheme for generating homologous com: 
pciitors used lor quantitative PCR. 



had a length of 40 nucleotides, in which 20 nucleotides (segj) 
at the 3' end corresponded to the opposite strand of the tar- 
get sequence a predetermined distance from primer MMP- 
lf* and 20 nucleotides at the 5' end (seg2 = MMP-lr) that 
corresponded to the target sequence upstream from the seg- 
ment segi. Amplification with the primers MMP-lf and 
MMP-1 hy from the cDNA resulted in a 478-base pair (bp) 
(polymerase chain reaction (PCR) product. It was freed from 
excess primers and deoxynuclepside- triphosphates (dNTPs) 
using the Qiaquick Gel Extraction Kit (Qiageri GmbH, 
Hilden, Germany) and quantified. A known number of 
copies of the competitor was introduced in the GAPDH-ad- 
justed samples and amplified with the primers MMP-lf and 
MMP-lr. With this approach, two products were generated, 
one derived from, the cDNA (560 bp) arid another, 82 bp 
smaller in size derived from the interna) competitor (Fig. 1). 
PCR products were resolved by gel electrophoresis (1.5% 
agarose gel). The relative amounts of sample cDNA and 
competitor were quantified by measuring the intensity of 
ethidium fluorescence with a CCD image sensor and. ana- 
lyzing the data with the EASY program (Herolab, Wieslocb, 
Germany). The initial amounts of sample cDNA and com- 
petitor were assumed to be equal in those reactions where 
the ratio of the two products was judged to be equal. This 
was expressed in arbitrary units (AU) (22,29). One AU was 
defined as the lowest concentration of competitor yielding 
a detectable amplification product when added to PCR 
alone. For example, if equivalence between sample cDNA 
and competitor was reached using a 100-fold concentrated 
competitor the relative sample cDNA concentration was 100 
AU. "Thyrocytes and the cell lines were analyzed for the ex- 
pression of thyroid-specific and cytokine receptor mRNAs 
in a simple RT-PCR; The sequences of the TPO and cy- 
tokine receptor primer pairs have been published by Wat- 
son et al. (30) and Tada et al. (31) and gave the following 
product sizes: TPO: 506 bp; 1L-1R type 1 (p80):30Q bp; LL- 
1R type 0 (p68): 392 bp; TNF-aR (p75): 324 bp; TNF-aR 
(p55): 587 bp and IFN-yR: 899 bp. The thyroglobulin (Tg) 
and thyroid stimulating hormone receptor (TSH-R) primer 
pairs were selected according to the published sequences us- 
ing the DNAsis program (Table 1). 

Each 25-uL amplification reaction contained 2.5 jiL 
10 X concentrated PCR buffer (15 mM MgCb, Boehringer 
Mannheim, Germany), 0.3 U Taq DNA polymerase 
(Boehringer Mannheim, Germany), 100 jjM dNTPs 
(Perkin Elmer, Weiterstadt, Germany), 0.1 /xM of each 
primer (1MB, Jena, Germany), and 1 /iL cDNA and com- 
petitor in adjusted dilution. Furthermore, restriction map- 
ping (restriction enzymes: Boehringer Mannheim GmbH, 
Germany) was carried out to confirm the originality of the 
PCR product (Fig. 1, Table 1). 

Statistics 

Protein levels of thyrocyte or fibroblast cultures from the 
different patients and of the thyroid carcinoma cell lines ob- 
tained from three separate experiments were presented as 
mean ± SEM values. Statistical comparisons berween un- 
stimulated and stimulated cell culrutes were performed by 
the alternate (Welch) t-test. The correlation berween basal 
mRNA levels and the unstimulated protein secretion in all 
cell rypes was calculated according to the Spearman method. 
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. Table K Primers, Length of Amplified Templates, Restriction Mapping and Assay CoNDmoNS for RT-PCR 



Primer 



y- 



Lengthof Length of Annealing . Number of 
~3* cDNA (bp) competitor (bp) temperature cycles 



Tg 


forward 
reverse 


GCAGATCTTACrGAGTGCCT 
TGTCAGCACAGTGGCAATAC 


416 




60 


35 


TSH-R 


forward 


ACTTGCTGCAGCTGGTGCT 


354 




65 


35 


exons 1-4 


reverse 


TGAGGGCATCAGGGTCTATG 










TSH-R 


forward 


GAAATTCGGAATACCAGGAACTTA ACT 


896 




53 




cxons 4-10 


reverse 


AACTCATCGGACTTGGGGGTACA 










MMP-I 


forward 
reverse . 
hybrid 


TGGGAGCAAACACATCTGAC 
ATCACTTCTCCGCGAATCGT * 
ATCACTTCTCCCCGAATCGT 
CCATATATGGGTTGIGATGCC 


560 


478 


64 


33 


TIMP-1 


forward 
reverse 
hybrid 


CTTAGGGGATGCCGCTGACA 
GGCAGGCAGGCAAGGTGACG 
GGCAGGCAGGCAAGGTGACG 
GGATGGATAAACAGGGAAAC 


351 


274 . 


.64. 


30 



Tg indicates thyroglobuline; TSH-R, thyroid stimulating hormone receptor; MMP-1, matrix racialloproieinase-I; TIMP-1, tissue inhibitor of 
metalloproteinase-l; bp, base pair. 



RESULTS 

Thyroid specific and. cytokine receptor 
mRNA expression 

isolated thyrocytes as well as 8505 C cells expressed Tg 
and thyroperoxidase (TPO) mRNA, whereas transcripts of 
the TSH-R {exons 1-4, 354 bp, exons 4-1 0, S96 bp) were 
present only in the thyrocytes. The three anaplastic thy- 
roid carcinoma eel! lines SW 1736* C 634, and HTh 74 
were completely negative for the Tg, TPO, and TSH-R mR- 
NAs (Fig. 2). AJ1 cell lines and thyrocytes expressed 1L-3R 
{type 1 and type II), TNF-oR (p75 and p55) and IFN-yR 
mRNA {Fig. 2). 

Basal MMP-1 and TJMP-1 mRNA and 
protein expression 

In most stimulation experiments, mRNA levels did not 
increase until 24 hours of incubation. The 24-hour mRNA 
levels arc shown in Figures 3 and 4. The 3- and 6-hour 
levels are demonstrated in those experiments where the 
mRNA levels reached their peak before 24 hours of stim- 
ulation. If not otherwise indicated, the MMP-1 levels were 
measured using the ELJSA system, which recognizes 
free/complexed MMPT. 

MMP-1 and TIMP-1 mRNA were found during un- 
stimulated culture in all investigated cell types, although 
the mRNA levels varied over a great range. 8505 C showed 
a basal MMP-I mRNA level 20 times as high as those of 
the HTh 74 cells, 6 X JO 4 times as high as C 643, and 2 X 
10 6 times as high as SW 1736 cells. In thyrocytes, MMP- 
1 mRNA levels were found near the detection limit {Figs. 
3 and 4). 

Generally, when analyzing the noted cell types, the mea- 
sured basal MMP-1 or TIMP-1 mRNA levels correlated 
well with the basal protein expression (MMP-1: r = 0.99, 
p< ,0001;TIMP-l:r= 0.98, p < .002). Corresponding to 
rhe high MMP-1 mRNA level, 8505 C cells secreted ex- 
tremely high levels of MMP- 1 . No MMP- I or T1MP- 1 was 



detected in unstimulated thyrocyte cultures, at any time- 
point examined. All other cell types showed a spontaneous 
MMP-1 and TIMP-1 secretion (Figs. 5 and 6). thyroid- 
derived fibroblasts produced basal TIMP-1 levels of up to 
4 times higher in the four carcinoma cell lines, which se- 
creted nearly the same amounts of basal TIMP-1 protein. 
Nevertheless, TIMP-1 secretion of fibroblasts was found 
at lower levels than expected after TTMP-1 mRNA mea- 
surement in 4 of 5 analyzed patients. The results of the fi- 
broblast cultures from patient five showing a higher TTMP- 
1 expression than those from the 4 other patients (basal 
24 hour: 50 ± 2; PMA 24 hour: 90 ± 6 ng/mL TTMP-1) 
was omitted in Figure 6. 

Comparing the basal amount of free/complexed and 
"IT MP- 1 complexed MMP- 1 after 24 hours of stimulation, 
a significant level of MMP-1 was not complexed with 
TIMP-1 in 8505 C cultures, whereas in fibroblast cultures 
most of the MMP-1 activity was inhibited by TIMP-1. The 
anaplastic carcinoma cell line HTh 74 did not show such 
a great discrepancy between free/complexed and TTMP-1 
complexed MMP-1 level as 8505 C cells (Fig. 7). 

Effects of IL-la on MMP ) and T1MP1 mRNA and 
protein expression 

Experiments were performed to determine whether hu- 
man thyroid epiihelial cells and thyroid carcinoma cell lines 
could produce or increase basal MMP-1 and TIMP-1 se- 
cretion after exposure to various stimuli. The results from 
these stimulation experiments are summarized in Figures 
3 through 5. Generally, there was a delay in protein se- 
cretion level in comparison to the mRNA expression level. 
At the protein level, the cytokine-mediatcd stimulating or 
inhibiting effect is more distinct after 48 hours compared 
with 24 hours, even when the mRNA level had already de^ 
creased after 6 hours. 

1L 1 upregulated MMP-1 mRNA in SW 1736 cells up 
to 100 times and, in rhyroid-derived fibroblasts, up to 12 
times after 24 hours of incubation (Fig. 3). This increased 
mRNA level was accompanied by a significantly enhanced 
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TPO 

Tg 

TSH-R 
Exon 1-4 

11-1 -R1 
11-1 -R2 
IFN-7R 
p75TNFR 
p55TNFR 



FIG. 2. Amplification of thyroid specific and interleukin- receptor mRNA in thyrocytes (1), SW 1736 (2), C 643 (3), HTb 
74 (4) and 8505 C (5) cells using RT-PCR; M f = 100-bp ladder (GIBCO). 



MMP-1 secretion after 48 hours. Furthermore, IL- la in- 
creased MMP-1 mRNA expression in thyrocytes up to 
seven times after 6 hours, but no MiMP-1 protein could be 
detected in thyrocyte cultures. IL-1 had no stimulatory ef- 
fect on MMP-1 mRNA expression in C 643, HTh 74, and 
8505 C cells after 24 hours, although a significant increase 
of MMP-1 secretion was found in HTh-74 and SW 1736 
cells after 48 hours of incubation (Fig. 5). This discrepancy 
may be explained by a possible increase in MMP-1 mRNA 
level after 24 hours of stimulation. The same effect could 
also be observed in the IL-1 stimulated TJMP-1 at the 
mRNA as well as the protein level: the only slight effect 
of IL-1 on T1MP-1 mRNA expression in carcinoma cell 
lines after 24 hours was accompanied by a significant in- 
crease of T1MP-1 secretion in 8505 C and HTh 74 cells 
after 48 hours (Figs. 4 and 6). 

Effects of TNF-a on both MMF-lfTlMP-} mRNA 
and protein expression 

In contrast to IL-1, TNF-a did not stimulate the MMP- 
I and TIMP-1 mRNA and protein levels in all carcinoma 
cell lines and thyrocytes. Only thyroid-derived fibroblasts 
responded wirh a slight upregulation of MMP-1 and T1MP- 
1 mRNA expression after TNF-a stimulation, which was 
not accompanied by an increase of MMP-1 and TIMP-1 
secretion. 



Effects of PMA, and IFN-y on MMP-1 and TIMP-1 
mRNA and protein expression 

PMA was included in our study as a positive control 
because it is known to upregulate or induce both MMP- 

1 and TIMP-1 secretion in various cell types (1,32). In- 
deed, PMA was able to induce or enhance MMP-1 
mRNA levels in all cell types investigated, although the 
detected levels varied to a large extent (Fig. 3). This re- . 
suit is in good correlation with the significantly increased 
MMP-1 protein levels ihat were already detectable after 
24 hours of stimulation (Fig. 5). PMA upregulated T1MP- 

J mRNA levels by up to 20 times in C 643, and up to 

2 times in SW 1736 and HTh 74 cells, fibroblasts and 
thyrocytes, but it did not change the TIMP-1 rnRNA con- 
tent in 8505 cells (Fig. 4). At the protein level, we found 
a significant stimulation of TIMP-1 secretion in C 643 
and HTh 74 cells, as well as in thyroid-derived fibrob- 
lasts (Fig. 6). 

In contrast to PMA, IFN-7 was without effect on stim- 
ulation or downregulation of MMP-1 andTlMP-I mRNA 
or protein in anv of the cell types investigated (Figs 5 and 

6). 

1 he main inhibitor of MMP-1 is TIMP-1, which forms 
1:J stoichiometric complexes with MMP-1, although some 
Other inhibitors can also bind MMP-1. On the other hand, 
TJMP-J can bind other MMP types. 
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FIG. 3. Representative samples of competitive amplified MMP-I mRNA of thyrocytes, thyroid-derived fibroblasts and thy- 
roid carcinoma cell lines without stimulation (control) and after stimulation with 10 U/mL IL-la and 100 U/mL TNF-cr and 
10 ng/mL PMA after 24 hours. Serial dilutions of known amounts of the competitor fragment were coamplified with identi- 
cal aliquots of cDNA. The 560-bp (cDNA) and 478-bp (competitor) PCR products were visualized by agarose gel electrophoresis 
and ethidiumbrorhide staining. The relative concentration of the added competitor was given in arbitrary units (AU) in the 
figure- One AU was defined as the lowest. concentration of the competitor yielding a detectable amplification for MMP-1 
mRNA. The ratio of competitor to cDNA fragments was determined by measuring the intensity of ethidium fluorescence with 
a CCD image sensor and analysis of data. Measured cDNA concentration can expressed in AU. 



DISCUSSION 

Our findings demonstrate for the first time that thyroid 
carcinoma cell lines are able to express MMP-1 and T1MP- 
1 mRNA and protein at significant levels in vitro. The ob- 
servation of spontaneous release of MMP-1 and T1MP-1 
corresponds well with earlier studies covering the secretion 
of these proteins by several carcinoma cell lines (33,34). 

However, in contrast to its clear physiological function 
in extracellular matrix breakdown, the role of MMP-1 in 
tumor growth and metastases is still controversial 
(9-11,35). Recently, Murray et al. (10) demonstrated thai 
MMP-1 is associated with poor prognosis in colorectal can 
cer, and has a prognostic value independent of the Dukes 
stage. Therefore, MMP-'J could be a target for therapeu- 
tic intervention in such tumors. Furthermore, the hypoth- 
esis of whether Or not cancer cells themselves are able io 



produce MMP, or whether cancer cells stimulate the sur- 
rounding stromal celts to secrete MMP in vivo y is disputed. 
MMP-1 mRNA and protein were detected by both in situ 
hybridization and tmmunohistochemistry in stromal as 
well as tumor cells of head, neck, gastric, colorectal, and 
mammary carcinomas (9,10,36,37). In contrast, 
Kameyama (12) demonstrated by in situ hybridization that 
the MMP-1 mRNA was not expressed in the cancer cells 
but in the surrounding fibrous capsules of strongly differ- 
entiated papillary thyroid carcinoma tissue. Flighly differ- 
entiated follicular carcinomas and follicular adenomas 
were depleted for MMP* 1 transcripts. Undifferentiated fol- 
licular)-, papillary, and aggressive anaplastic carcinomas 
that showed poor prognosis and strong tumor invasive and 
metastatic potential and that can be compared in their 
rnotphological, genetic and growth features with undiffer- 
eniiaied thyroid carcinoma cell lines were not included this 
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HO. 4. Competitive TLMP-1 mRNA RT-PCR yielding a 351 -bp (cDNA) and a 274-bp (competitor) PCR product. For far- 
ther details see Figure 3. 



study. However, the missing expression of MMP-1 by nor- 
mal thyrocytes and the spontaneous secretion of this pro- 
tein by highly malignant thyroid carcinoma cell lines, as 
demonstrated in our study, indicate the involvement of 
MMP-1 secretion of transformed thyrocytes in aggressive 
thyroid tumors. 

Although all cell lines analyzed in our study sponta- 
neously secreted MMP-1, we observed marked differences 
in the basal secretion capacity. The highest MMP-1 levels 
were determined in cultures of .8505 C cells. Only 8505 C 
cells expressed TPO and Tg mRNA that may be put down 
to residual differentiated components in the cell line (see 
Materials). However, none of the analyzed cell lines ex- 
pressed TSH-R mRNA. The cell population doubling rimes 
were less than 40 hours. All cell lines had accumulations 
of multiple genetic events. These facts indicate the undif- 
ferentiated pathology of the studied lines. It is well known 
that anaplastic carcinoma cell lines well retain the malig- 
nant characterises of their parental tumors (38^*0). 

Furthermore, we found a distorted proportion berween 
MMP-1 and TIMP-I mRNA/proiein for carcinoma cell 
lines but not for normal thyroid-derived fibroblasts. The 
most disadvantageous constellation berween MMP-1 and 
TfMP-) was found in 8505 C cells. Similar to other stud- 
ies (41), these results suggest Ehe influence of an altered 
MMP/T1MP relation on minor progression. However, it 



should be mentioned that most studies, including the pre- 
sent one, do not take into consideration that a number of 
inhibitors distinct from TIMP-1 may regulate MMP-1 ac- 
tivity. Taking into account that the balance of active en- 
zyme and TlMP-l concentration strongly influence the ex- 
tent of local matrix degradation, a number of studies 
showed unexpectedly high levels of TTMP-1 in malignant 
neoplasms (9,42,43). There is a great discussion as to 
whether the overall expression of MMP-1 and T1MP-1 or 
the amount of noncomplexed MMP-) could be crirical in 
aggressive tumor development. This fact underlines the na- 
ture of tissue breakdown, reflecting the complicated net- 
work of selective and coordinated production of individ- 
ual proteinases and inhibitors under normal and 
pathophysiological conditions. Thus, the invasive and 
metastatic potential of thyroid tumors depends on the lo- 
cal net level of active MlViPs. 

The synthesis of MMP-1 and TIMP-I is influenced by a 
variety ol biochemical stimuli. The recent findings on 
MMP-1 and TIMP-1 gene promoters are useful in under- 
standing the complex mechanisms implied in the regula- 
tion of MMP synthesis modulated by cytokines ar\d tumor 
promoters (34,44,45). The promoter regions contain tu- 
mor promoter responsive elements (TRE) and binding mo- 
tifs for the transcription factor PEA-3, which are recog- 
nized by pi oto- oncogenic transcript ion factors, such as the 
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FIG. 7. Compari between <i) free/complex and (ii) TIMP- 
1 completed MMP-1 levels, and (iii) TIMP-1 levels in super- 

m5T " n ^ m ^ lCd 8505 C 3nd mh 74 ^d thy- 

H KA Cr L f,brob,asts af *' 24 hours using a |i) MMP-1 
xax^!^! rcco S ni2 « ^tal MMP-1 (sec Figure 5). The (ii) 
MMP-1H1MP-1 assay recognizes MMP- IVThMP- 1 complexes 
ie, activated MMP-1 that has subsequently been completed 
with the specific MMP inhibitor TIMP-1. The (iii) TTMP-I 
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fos and /M „ f arni | Y (45-47). IL-1, TNF-«, and PMA up- 
regulate proto-oncogenes like fos and jun, resulting in the 
stimulation of MMP-1 and TIMP-1 (45,48). The action of 
A n ^ iS mcdialed ^ spe C1 fic receptors. In our 
7^L R ^ 1 and We ID, TOF-aR (p75 and p55) 
and IFN-yR m RNAs were demonstrated in all investigated 

and TTMP-1 In nearly all cell types investigated, thus con- 
fining the results of several studies on other epithelial cells 
(reviewed m refs. 1,7,49). In :the majority of experiments, 
we round o concordant expression of MMP ] and T1MP- 
1 aher stimulation, possibly achieved by the coordinated 
r ,0 T^ 0 x nUC,Car transcri P rion 'actors, although MMP- 
1 and TIMP-1 expression can also be independently or even 
reoprocaJJy regulated (1). The effect of TNF-er was not as 
distinct as tn the case of PMA and IL-1, although several 
invest.gators found a pronounced effect of TNF-a partieu- 
arly on TbVlP-J secretion (4,34). In contrast to studies per- 
formed wnh other cell types (363,864,819), IFN-y did not 
mfluence MMP-I and TIMP-1 expression in .hyroid caret 
noma cell ,„«. ) n summary, the .involvement of the in- 
("thyroidal physiological and pathological cytokine m.- 
croenv.ronment m the regulation of MMP- Land TIMP-1 
induction activation and inhibition is strongly suggested 

runhermore, the data demonstrate that regular human 
'hyrocyres did not produce MMP-1, even after powerful 
snmulation with PMA. Investigating other mammahan rp- 
«hrl.al cells, only one study revealed the production of 
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MMP-1 by rabbit corneal cells (21). It is yet not clear 
whether the MMP-1 mRNA detected in thyrocytes is due 
to a low level of constitutive transcription of the MMP-1 

f^^'l^nxr 3 ^ transcri P tion >> a " listing pool of stable 
MMP-1 mRNA, or. in vitro induction of MMP-1 mRNA 
But n seems more likely that residuaWibroblasts contained 
in the purified thyrocyre preparation (< 0.2%) are respon- 
sible for the slightly positive RT-PCR results. Another ex- 
planation could be that thyrocytes are indeed MMP-1 oro- 
ducers, but the ELISA detection system used was not 
sensitive-enough to measure extremely low MMP-1 secre 

Xi CV i ClS */^ h t Crm , 0re ; l , hc discrc P anc y ^tween elevated 
^Mp'| mRNA ,CyC,S pf th ^« and the extremely low 

■ -1 pr0tCm sccret >°n by these cells is difficult to ex- 
plain ^transcriptional regulatory events may be re- 
sponsible for this confounding result. 

Taken together, the present study suggests that the in- 
trathyroidal cytokine microenvironment is involved in the 
regulation of MMP-1 and its inhibitor TIMP-1 in the thy- 
roid, amTrhat both proteins may be secreted by dediffer- 
entiated thyroid carcinoma cells and involved in aggressive 
thyroid tumors in vivo. «t- 
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Expression of embryouic fibroucctin isoform EHIA parallels alpha-smooth 
muscle actin in maturing and diseased kidney. 

Barnes VL . Musa J . Mitchell RJ . Barnes JL . 

Department of Medicine, Division of Nephrology, University of Texas Health Science 
Center, San Antonio, Texas, USA. 

In this study we examined if an association exists between expression of an alternatively 
spliced "embryonic" fibronectin isoform EIHA (Fn-EHIA) and alpha-smooth muscle 
actin (alpha-SMA) in the maturing and adult rat kidney and in two unrelated models of 
glomerular disease, passive accelerated anti-glomerular basement membrane (GBM) 
nephritis and Habu venom (HV)-induced proliferative glomerulonephritis, using 
immunohistochemistry and in situ hybridization. Fn-EHIA and alpha-SMA proteins were 
abundantly expressed in mesangium and. in peri glomerular and peritubular interstitium of 
20-day embryonic and 7-day (D-7) postnatal kidneys in regions of tubule and glomerular 
development Staining was markedly reduced in these structures in maturing juvenile (D- 
14) kidney and was largely lost in adult kidney. Expression of Fn-EIIIA and alpha-SMA 
was reinitiated in the mesangium and the peri glomerular and peritubular interstitium in 
both models and was also observed in glomerular crescents in anti-GBM nephritis. 
Increased expression of Fn-EHIA mRNA by in si tu hybridization corresponded to the 
localization of protein staining. Dual labeling experiments verified co-localization of Fn- 
EHIA and alpha-SMA, showing a strong correlation of staining between location and 
staining intensity during kidney development, maturation, and disease. Expression of 
EIHA mRNA corresponded to protein expression in developing and diseased kidneys and 
was lost in adult kidney. These studies show a recapitulation of the co-expression of Fn- 
EHIA and alpha-SMA in anti-GBM disease and suggest a functional link for these two 
proteins. 
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BMI-1 gene amplification and ovei expression in hematological 
malignancies occur mainly in mantle cell lymphomas. 

£caS, !2rL£, Pinyot M, Puig X , Hernandez L . Hernandez S . Fernandez PL. van 
Lohuizen M ; Colomer D . Campo E . 

The Hernatopathology Section, Laboratory of Anatomic Pathology, Hospital Clinic, 
Institut d'Investigacions Biomediques August Pi i Sunyer (IDIBAPS), University of 
Barcelona, Spain. . • 

The BMI-l gene is a putative oncogene belonging to the Polycornb group family that 
cooperates with c-myc in the generation of mouse lymphomas and seems to participate in 
cell cycle regulation and senescence by acting as a transcriptional repressor of the 
rNJC4a/ARF locus. The BMl-1 gene has been located on chromosome I0pl3, a region 
involved in chromosomal translocations in infant leukemias, and amplified in occasional 
non-Hodgkin's lymphomas (NHLs) and solid tumors. To determine the possible 
alterations of this gene in human malignancies, we have examined 160 
lymphoproliferative disorders, 13 myeloid leukemias, and 89 carcinomas by Southern 
blot analysis and detected BMI-l gene amplification (3- to 7-fold) in 4 of 36 (1 1%) 
mantle cell lymphomas (MCLs) with no alterations in the INK4a/ARF locus. BMI-l and 
pl6iNfC4a mRNA and protein expression were also studied by real-time quantitative 
reverse tTanscription-PCR and Western blot, respectively, in a subset of NHLs. BMI-l 
expression was significantly higher in chronic lymphocytic leukemia and MCL than in 
follicular lymphoma and large B cell lymphoma. The four rumors with gene 
amplification showed significantly higher mRNA levels than other MCLs and NHLs with 
the BMI-l gene in germline configuration. Five additional MCLs also showed very high 
mJINA levels without gene amplification. A good correlation between BMI-l mRNA 
levels and protein expression was observed in all types of lymphomas. No relationship 
was detected between BMI-l and pl6INK4a mRNA levels. These findings suggest that 
BMl-1 gene alterations in human neoplasms are uncommon, but they may contribute to 
tlie pathogenesis in a subset of malignant lymphomas, particularly of mantle cell type. 
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BMI-1 Gene Amplification and Overexpression in Hematological Malignancies 
Occur Mainly in Mantle Cell Lymphomas 1 

Silvia Bea, Frederic Tort, Magda Pinyo), Xavier Puig, Luis Hernandez, Silvia Hernandez, Pedro L. Fernandez, 
Maarten van Lohnizen, Dolors Co)omcr,'and Elias Campo 2 . 

J066 GX Amsterdam. Netherlands /M v. LJ 1 of Molecular Carafes*, The Netherlands Cancer Institute, 



Abstract 

The BMF1 gene is a putalive oncogene belonging to the Polycomb 
group family that cooperates with c-myc in the generation of mouse 
lymphomas and seems lo participate in eel) cycle regulation and senes- 
cence by acting as a transcriptional repressor of the INK4a/ARF Jocos. 
The BMUI gene has been located on chromosome 10p)3, a region involved 
in chromosomal translocations in infant leukemias, and amplified in. 
occasional non-Hodgkin's lymphomas (NHLs) and solid tumors. To de- 
termine the possible alterations of . this gene in human malignancies, we 
have examined J60 lymphoproliferalive disorders, J 3. myeloid leukemias, 
and 89 carcinomas by Southern blot analysis and detected BMJ-I gene 
amplification (3- to 7-fold) in 4 of 36 (11%) mantle cell lymphomas 
(MCLs) with no alterations in (he WX4a/ARF locus. BMI-I and pJ6 WK4 * 
mRNA and protein expression were also studied by reaMime quantitative 
reverse transcript ion-P.CR and Western blot, respectively, in a subset of 
NHLs. BMJ- J expression was significantly higher in chronic lymphocytic 
leukemia and MCL than in follicular lymphoma and large B eel) lym- 
phoma. The four tumors with gene amplification showed significantly 
higher mRNA levels than other MCLs and NHLs with the BMI-1 gene in 
gcrmline configuration. Five additional MCLs also showed very high 
mRNA levels >vitboiit gene amplification. A good correlation between 
BMM mRNA levels and protein expression was observed in al) types of 
lymphomas. No relationship was detected between BMM and p!6 ,KK< * 
mRNA levels. These findings suggest that BMJ-I gene alterations in 
human neoplasms are uncommon, but they may contribute to the patho- 
genesis in a subset of malignant lymphomas, particularly of mantle cell 
type. 



INK4a/ARF locus by acting as an upstream negative regulator of 
pJ6 ,NK4a and P 14/pJ9 ARF gene expression (5). The human BMI-I 
gene has been mapped to chromosome 1 Op 1 3 (6), a region inyolved.in 
chromosomal translocations in infant leukemias (7) and rearrange- 
ments in malignant T cell lymphomas (8, 9). More recently, higb-Ieve, 
DNA amplifications of this region have been found by comparative 
genomic hybridization in NHLs and solid tumors (10, 11). However, 
the possible implication of the BMI-J gene in these alterations and its 
role in the pathogenesis of human tumors is not known. The aim of 
this study was to analyze- the possible BMJ-I gene alterations, and 
expression in a large series of human neoplasms and to determine the ' 
relationship with JNK4a/ARF locus aberrations. 

Materials and Methods 



Introduction 

The BMM 3 gene is a putative oncogene of the PoJycomb group 
originally identified by retroviral insertiona! mutagenesis in Eji-c- 
myc transgenic mice infected with the Moloney murine leukemia 
virus (1, 2). These animals had a rapid development of pre- B cell 
lymphomas showing frequent proviral insertions near the BMJ-I gene. 
This integration resulted in BMM overexpression suggesting n coop- 
erative effect between C-MYCmd BMI-J genes in the development of 
these rumors (3, 4). Recent studies have indicated that the BMI-I gene 
may also participate in cell cycle control and senescence through the 
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Case Selection. A series of 262 human tumors, including 173 hematolog- 
ical malignancies and 89 carcinomas (Table 1), matched normal tissues from 
all carcinomas, 1J samples of normal peripheral mononuclear cells, and 5 - 
reactive lymph nodes and tonsils, were selected based on tbe availability of 
fro2en samples -for molecular analysis. 

DNA Extraction and Southern Blot Analysis. Genomic DNA was ob- 
tained using Proteinase K/RNnse treatment. 15 /ig were digested with fcoRl 
and UindUi restriction eh2ymes (Life Technologies, Inc., Gaiihersburg, MD), 
for Southern blot analysis and hybridized with a 1.5-kb Pst\ fragment of the 
partial BMI-J cDNA (6). 

RNA Extraction and Real-time Quantitative RT-TCU. Total RNA was 
obtained from 67 lymphoid neoplasms (10 CLLs, 27 MCLs, 8 FLs„ and 22 
LCLs) using guanidine/isothiocyanate extraction and cesium/chloride gradient 
centritugation. One u.g of total RNA was transcribed into cDNA using 
MMLV-reverse transcriptase (Life Technologies, Inc.) and random hexamers, 
following manufacturer's directions. Sequences of the BMJ-J and the p?6 
detection probes and primers were designed using the Primer Express program 
(Applied Biosystems, Foster City) as follows: BMI-J sense, y-CfGGTTGC: 
CCATTGACAGC-3'; BMJ-J antisense. 5-CAGAAAATGAATGCGAG- 
CCA-3'; P J6 sense, 5'-CAACGCACCGAATAGTTACGG-3'; pJ6 antisense. 
5'-AACTTCGTCCTCCAGAGTCGC-3'. The probes BMJ-I, y-CAGCTC- 
GCTTCAAGATGGCCGC-3', and pJ6, 5' -CGGAGGCCGATCCACOTGG- 
GTA-3', were labeled with 6-caiboxy-iluorescein as the reporter dye. The 
TaqMan-GAPDH Control Reagents (Applied Biosystcnrs) were used lo am- 
plify and detect the GAPDJ1 gene, as recommended by (he manufacturer. The 
quantitative assay amplified I pm) of cDNA in two to four replicates using the 
primers and probes described above and the standard master mix (Applied 
Biosystems). All reactions were performed in an AB1 PRISM 7700 Sequence 
Detector System (Applied Biosystems). GAPDH. BMl-l, and P 16 ,NK4 " ex- 
pression was related lo a standard curve derived from serial dilutions of Raji 
cDNA. The RUs of BMJ-I and pJ6 ,NX4a expression WC re defmed as the 
mRNA levels of these genes »ormali?xd lo the GADPH expression level in 
each case. 

Protein Analysis. Whole-cell protein extracts were obtained from addi- 
tional fro7.m tissue available in 31 cases (7 CLLs. \2 MCLs, 8 FLs, and 4 
LCLs), loaded onto n 10% SDS-pnJyacrylamide gel. and cJectrobloMed to a 
nitrocellulose membrane (Amersham). Blocked membranes were incubated 
sequentially with the monoclonal antibody BM1-F6 (I?), aniimouse coriju- 
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Table I Hematological malignancies and solid tumor samples analyzed for BMI-I 
gene alterations 



Tissue samples 


No. of cases 


Hematological malignancies 




Hodgkin's disease 


2 


B cell lymphoprolifcralive disorders 




B-Acurc lymphoblastic leukemia 


14 


CLL 


29 


Hairy cell leukemia 


. A 


FL 


15 


MCL 


36 


LCL 


40 


T cell rymphoproliferalive disorders 




T-Acutc lymphoblastic leukemia 


8 


large granular cell leukemia 


4 


Peripheral T-cell lymphoma 


8 


Myeloproliferative disorders 




Acute myeloid leukemia 


7 


Chronic myeloid leukemia 




Solid tumors 




Colon carcinoma 


26 


Breast carcinoma 


29 


Laryngeal squamous cell carcinoma 


34' 


Total 


262 



gated to horseradish peroxidase (Amersham), and delected by enhanced cherm- 
luminescence (Amersham) according to the manufacturer's recommendations. 
Statistics) Analysis. Because of the non-normal distribution of the samples 
. and the small. size of some subsets of tumors, the slatistical evaluation was 
. performed using nonparamelric lests (SPSS, version 9.0). Comparison between 
mRNA expression levels in the different groups of NHLs was performed using 
the Kniskal-Wallis Test, with a P for significance set at 0.05. For differences 
between particular groups, the conservative Bonferroni procedure was per- 
formed, and the P was set at 0.005. The remaining statistical analyses were 
carried out using the Mann- Whitney nonparametric U lest (significance, P 
<0.05). The comparison between BMM and p!6 ,NK4 * quantitative mRNA 
levels was also performed using the Pearson's correlation coefficient. 

KesuJts 

BMJ-J Gene Amplification. The BMJ-J gene was examined by 
Southern Wot in a large series of human tumors and normal samples 
(Table 1). The cDNA probe used in the study delected three £roRl 
fragments of 7.3, 3:8, and 2.6 kb and three HtivS\\\ fragments of 6.2, 
4, and 3.5 kb. BMJ-J gene amplification (3- lo 7- fold) was detected in 
4 of 36 (1 1%) MCLs (Fig. 1). The amplifications were confirmed with 
both restriction enzymes. The amplified MCLs were two Mastoid and 
two typical variants. No amplifications were observed in any of the 
solid tumors when compared with their respective matched non- 
neoplastic mucosa. No BMl-l gene rearrangements were observed in 
any of the samples examined. 

BMM mRNA Expression. To determine Ihe BMJ-1 expression 
pattern in NHL we analyzed BMI-1 mRNA levels by real-lime quan- 
titative RT-PCR in 67 lymphomas ( 1 0 CLLs, 27 MCLs, 8 FLs, and 22 
LCLs), including the four tumors with gene amplification. A distinct 
BMM mRNA expression pattern was observed in the different types 
of lymphomas (Fig. 2; Kruskal- WalJis Test; P < 0.001). The DM1 
mRNA levels in CLLs (mean, 2.2 RU; SD, 1.3) and MCLs with no 
BMJ-J gene amplification (mean, 2.5 RU; SD, 2.3) were significantly 
higher than in FLs (mean, 0.9 RU; SD, 0.8) and LCLs (mean, 0.6 RU; 
SD, 0.-1; Mann- Whitney nonparametric U lest; P < 0.01). The -1 
MCLs with BMJ-J gene amplification showed significantly higher 
levels of expression than all other groups of tumors (mean, 5. J RU; 
SD, J. 6; P < 0.005). In addition, five typical MCLs with no structural 
alterations of ihe gene also showed very high levels of BMl-l mRNA 
expression ranging from 4 lo 9.8 RU, similar lo cases with gene 
amplification (Fig. 2.4). 

BMM Protein Expression. BMl-l protein expression was exam- 
ined by Western b/ot in 3 I tumors (7 CLLs; 12 MCLs. including two 
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tases with BMJ-J gene amplification and 4 cases with mRNA over- 
expression and no structural alteration of the gene; 8 FLs, and 4 LCLs) 
in which additional frozen tissue was available. The monoclonal 
antibody against BMM detected three closely migrating proteins of 
M r 45.000-48,000 (2). The two more slowly migrating bands prob- 
ably represent phosphorylated isofonns of the protein (12). The two 
MCLs wjth gene amplification and three of four cases with mRNA 
overexpression without amplification of the gene showed very high 
levels of protein expression. The remaining MCLs and CLLs showed . 
intermediate levels of protein expression, whereas low- or no-expres- 
sion signals were detected in the LCLs and FLs included in the study 
(Fig. 3). These results indicate that. BMM protein expression in NHL 
is concordant with the mRNA levels observed by real-time quantita- 
tive RT-PCR. 

Relationship between BMJ-J and pJ6 JNK4a Gene Alterations. 

The JNK4a/ARF locus has been recently identified as a downstream 
target of the transcriptional repressing activity of the BMJ-J gene, 
suggesting that this gene may contribute to human neoplasias with 
wild type JNK4/AJIF (5). Most of the lymph ©proliferative disorders 
analyzed in the present study, including the four cases with BMJ-J 
gene amplification, had been previously examined for p53 gene mu- 
tations and JNK4a/ARF locus alterations, including gene deletions, 
mutations, bypermethyjation, and expression (13, 14). The four MCLs 
with BMJ-J gene amplification and mRNA overexpression and the. 
five tumors with BMM mRNA overexpression with no structural 
alterations of the gene showed a wild- type configuration of the 
JNK4a/ARF locus (13). However, one case vrixh BMJ-J gene ampli- 
fication and one case with mRNA overexpression with no alteration of 
Ihe gene showed p53 gene mutations associated with allelic deletions. 

To determine the possible relationship between BMM and 
pl6 1NK4a mRNA expression, pl6 iNK " a mRNA levels were evaluated 
by real-time quantitative RT-PCR in 50 tumors (10 CLLs, 27 MCLs, 
and 13 LCLs), including 6 cases with alterations in the JNK4a/ARF 
locus (2 MCLs and I LCL with pJ6^ 9 gene deletion, 2 LCLs with 
pJ6 promoter hypcrmethylation, and 1 CLL with pJ6 i>3K4s> gene 
mutation), and the 4 lymphomas with BMJ-J amplification. Negative 
or negligible levels of p!6 J>,K4:> were observed in Ihe 6 tumors with 
JNK4a/ARF locus alterations. These cases were not included in the 
comparisons between BMM and pl^* 4 * mRNA expression. The 
pl6 ,NK4a expression levels were relatively similar in the different 
types of tumors. Only LCLs tended to have lower levels of expression, 
but the differences did not reach slatistical significance (Fig. 2B). No 
differences were observed in the p\6 WK ** mRNA levels between 
tumors with BMJ-J gene amplification and overexpression and lym- 
phomas with germ line configuration of the gene. 
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Fig. I. Southern bloi □nalysii of BMl-l C cnr. Four MCLs (MCL*) showed ft Ml- J jjene 
■ amplication (3- fo 7- fold) compared wlih nonneoplastic tissues (/V) 3nd uthci NHLs] "No 
amplifications oi ecne rearrangement were detected in ihe remaining NHLs and raiti- 
nomas included in the itudv. 
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Fig. 2. A, quantitative DM1- 1 mRNA transcript analysis (median and range) using 
real-time RT-PCR in a series of NHLs. MCLs with BMI-J gene amplification (MCL*) 
revealed significantly higher overall DM]- 1 mRNA levels than all other types of NHLs. 
including MCLs with no structural alterations of the gene {P < 0.005). MCLs and CLLs 
expressed significantly higher levels than FLs and LCLs (P < 0.001). Results ore depicted 
as the ratio of absolute BMM:GADPH mRNA ttanscripl numbers (RU). Bars, SD. tf. 
tjuanriutiyc pl6 ,h * K4 ' mRNA transcript analysis (median and range) using real-tiroc 
RT-PCR in a series of NHLs. Expression levels weie relatively similar in the different 
types of tumors. Results arc depicted as the ratio of absolute plG'^'-'GADPH mRNA 
transcript numbers (RU). Bars, SD. 



Discussion . 

In the present study, we have examined a large series of human 
tumors for ihe presence of gene alterations and mRNA expression of 
the BMJ-I gene. Gene amplification was identified in four MCLs. 
These tumors showed significantly higher levels of mRNA and pro- 
lein expression compared with other lymphomas with BMI-I in germ- 
fine configuration. BMI-J expression levels were also highly up- 
rcgulated in a subset of MCLs with no appareni structural alterations 
of the gene. No alterations were detected in any of the different types 
of carcinomas included in the study. BMI-J is considered an oncogene 
belonging to the PoJycomb. group family of genes. These proteins 
mainly act as transcriptional regulator, controlling specific target 
genes involved in development, cell differentiation, proliferation, and 
senescence. Different studies have shown the implication of BMI- i 
overexpression in the development of lymphomas in murine and 
feline animal models (3, A). The findings of the present study indicate 



for the first time that BMI-J gene alterations in human neoplasms are 
an uncommon phenomenon, but they seem to occur mainly in a subset 
of NHLs, particularly of mantle cell type.: 

The human BMJ-I gene has been mapped lo chromosome J Op 13. 
High-level DNA amplifications and gains in this region have been 
identified by comparative genomic hybridization in occasional solid 
rumors and NHLs (10. 1 )). Different chromosomal translocations 
involving the 10pl3 region have also been identified in infant leuke- 
mias and T cell lymph oproliferative disorders (7, 8, 15). Most acute 
Jeukemias with this chromosomal alteration occur in children <I2 
months of age, whereas it seems to be extremely rare in adults. J Op 
translocations in T-cell lymphoproJiferative disorders have been ob- 
served mainly in adult T cell leukemia/lymphomas and occasional 
cutaneous T cell lymphomas. In our study, we did not observe BMf-J 
. rearrangements or amplifications in any of the acute Jeukemias or T 
cell lymphomas. However, all of the acute leukemias in this study 
were diagnosed in patients over 16 years, and no adult T cell leuke- 
mia/lymphomas or cutaneous lymphomas could be included in the 
series. Similarly, high-level DNA amplifications at the 10pl3 region 
have been detected in head and neck carcinomas and other solid 
rumors. Although we found no. evidence for BMI-J gene rearrange- 
ments or amplifications in a substantial set of carcinomas, this does 
not exclude the possibility of increased gene expression or protein 
levels in these tumors. Additional studies are required to elucidate the 
possible involvement of BMJ-J in these particular groups of human 
neoplasms. 

In human hematopoietic cells, BMLI is preferentially expressed in 
primitive CD34+ bone marrow ceJIs, whereas it is negative or very 
low in more mature CD34- cells (16). In peripheral Jymphocyles, and 
particularly in follicular B cells, BM1-1 protein expression has been 
delected in resting cells of the mantle zone, whereas it is down- . 
regulated in proliferating germinal center cells (17, 18). These obser- 
vations indicate that BMM expression in normal hematopoietic cells 
is tightly regulated in relation with cell differentiation in bone marrow 
and antigen-specific response jn peripheral lymphocytes. BM1-1 ex- 
pression in human rumors has not been examined previously. In this 
study, we have demonstrated that BMJ- 1 mRNA and protein expres- 
sion show a distinct pattern in different types of iymphomas. Thus, 
BMM levels were low in LCLs and FLs and significantly higher in 
MCLs and CLLs. These findings suggest lhat BM1-1 expression 
patterns in B cell lymphomas maintain in part the expression profile 
of their normal cell counterparts; because FLs and at least a subgroup . 
of LCLs are considered lymphomas derived from follicular germinal 
center cells, whereas MCLs and CLLs are tumors mainly derived from . 
naive pregerminal center cells. However, the four MCLs with BMI-J 
gene amplificaiion expressed significantly higher mRNA levels than 
all other tumors. In addition, five MCLs with no structural alterations 
of the gene showed high mRNA levels similar to those observed in 
tumors with BMI-J gene amplification, suggesting that other mecha- . 
nisms may be involved in np- regulation of the gene in these lympho- 
mas. Different studies using animal models have shown a dose- 
dependent effect of BMI-I gene expression on skeleton development 
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Fit. 3. We&itrm Mo* analysis of BMM piorcin inN)lL.v The amplified MCL (I7b2-1) 
showed ihr hi E hcji BMI- 1 p.oicin levels, whereas oihcr MCLs and CLLs had intermedin 
levels of opicssinn. Very ton or negative signal was observed in L'Ls and LCLs 
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and lymphomagenesis (J. 3). These observations suggest that the high 
mRNA and protein levels detected in a subset of MCLs may play a 
role in the pathogenesis of these neoplasms. 

Recent studies have identified the JNK4/ARF locus as a down- 
stream target of the BMI-1 transcriptional repressor activity, suggest- 
ing that BM1-1 overexpression may contribute to human neoplasias 
that retain the wild-type JNK4a/ARF locus (5). Interestingly, in our 
study, BMJ-J amplification and overexpression appeared in rumors 
with no alterations in p)6™ Vi4 * and pJ4 ARF genes. However, we could 
not detect differences in the expression levels of pl6 I ^ K4i> in tumors 
with arid without BMJ-J gene alterations. The reasons for this apparent 
discrepancy with experimental observations are not clear. One possi- 
bility may be that genes other than JNK4a/ARFare the main targets of 
BJVIM repressor activity in these tumors. Particularly, different genes 
of the HOX family are regulated by BMM and may also be involved 
in lymphomagenesis (19, 20). 

In conclusion, the findings of this study indicate that BMJ-J gene 
, expression is differentially regulated in B cell lymphomas. Alterations 
. of the gene seem to be ah uncommon phenomenon in human neo- 
plasms, but they may contribute to the pathogenesis in a subset of 
MCLs. Although, BMJ-J gene alterations occurred in tumors with 
wild-type WK4a/ARF locus, the possible cooperation between these 
genes and the oncogenic mechanisms of BMj-1 in human neoplasms 
require additional analysis. 
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Histopathology is insufficient to predict disease progression and clinical outcome in lung adeno- 
carcinoma. Here we show that gene-expression profiles based on microarray analysts can be 
used to predict patient survival in early-stage lung adenocarcinomas. Genes most related to sur- 
vival were identified with univariate Cox analysis. Using either two equivalent but independent 
training and testing sets, or 'leave- one- out' cross-validation analysis with all tumors, a risk index 
based on the top SO genes identified low-risk and high-risk stage I lung adenocarcinomas which 
differed significantly with respect to survival. This risk index was then validated using an inde- 
pendent sample of lung adenocarcinomas that predicted high- and low-risk group*. This index 
included genes not previously associated with survival. The identification of a set of genes that 
predict survival in early-stage lung adenocarcinoma allows delineation of a high-risk group that 
may benefit from adjuvant therapy. 



Lung cancer remains the leading cause of cancer death in indus- 
trialized countries. Most patients with non-small cell lung can- 
cer (NSCLC) present with advanced disease, and despite recent 
advances in multi-modality therapy, the overall 10-year survival 
rate remains a dismal 8-10%'. However, a significant minority of 
patients (-25-30%) with NSCLC have stage I disease and receive 
surgical intervention alone. Although 35-50% of patients with 
stage I disease will relapse within 5 years'**, it is not currently 
possible to identify specific high-risk patients. 

Adenocarcinoma is currently the predominant histological 
subtype of NSCLC (rels. 1,5,6). Although morphological assess- 
ment of lung carcinomas can roughly stratify patients, there is a 
need to identify patients at high risk for recurrent or metastatic 
disease. Preoperative variables' that aflect survival of patients 
with NSCLC have bren identified 7 " ,0 Tumor si2e, vascular inva- 
sion, poor differentiation, high tumor-proliferative index and 
several genetic alterations, including K-ras (refs. 11,12) and p53 
(refs. 10,13) mutations, have prognostic significance. Multiple 
independently assessed genes or gene products have also been 
investigated to better predict patient prognosis in lung can- 
cer'*"'*. Teehnologiei that simultaneously analyze the expression 
of thousands of genes 1 ' can be used to correlate gene-expression 
patterns with numerous clinical patameteis— -including patient 
outcome— to better predict tumor behavior in individual pa- 
tients™. Analyses of lung cancers using array technologies have 
identified subgroups of tumors that differ according to tumor 
type and histological subclasses and, to a lesser extent, survival 
among adenocarcinoma patients' 1 - 77 . Here we correlated gene- 
expression profiles wiih clinical outcome in a cohort of patients 
with lung adenocarcinoma and identified specific genes that 



predict survival among patients with stage I disease. For further 
validation, we also show that the risk index predicted survival in 
3n independent cohort of stage 1 lung adenocarcinomas. 

Hierarchical profile clustering yields three tumor subsets 
Using'oligonucleolide arrays, we generated gene-expression pro- 
files for 86 primary lung adenocarcinomas, including 67 stage I 
and 19 stage 111 tumors, as well as 10 non-neoplastic lung sam- 
ples. Selected sample replicates showed high correlation among 
coefiicients and reliable reproducibility. We determined tran- 
script abundance using a custom algorithm and the data set was 
trimmed of genes expressed at extremely low levels, that is, 
genes were excluded if the measure of their 75th percentile value 
was less than 100. Although potentially resulting in the loss of 
some information, trimming in this manner decreased the possi- 
bility that the clustering algorithm would be strongly influenced 
by genes with little or no expression' in these samples. 
Hierarchical clustering with the resulting 4,966 genes yielded 3 
dusters of tumors (Fig. 1). All 10 non neoplastic samples clus- 
tered tightly together within Cluster 1 (data not shown). We ex- 
amined the relationships between cluster and patient and tumor 
characteristics (f : ig. ] and Supplementary Figure A online). There 
were associations between cluster and stage {P = 0.030) and be- 
tween cluster and differentiation (P= 0.01). Cluster J contained 
the greatest percentage (42.8%) of well differentiated tumors, 
followed by Cluster 2 (27%) and Cluster 3 (4.7%). Cluster 3 con- 
tained the highest percentage of both poorly differentiated 
(4 /.(>%) and stage 111 turnois (4 2. 8%), yet contained 3 (14.3%) 
moderately differentiated ami } (5%) well differential ed stage 1 
tumor. Notably, I J stage I tumors were present in Cluster 3, sug- 



816 



f/arilfc l MtmCINl - VOIUMI R . N'i;Mf:( v f> . ,-.ucui7 ?oo? 



gesting a common gene-expression profile for 
this subset of stage I and stage III tumors. . 

For patients with stage 1 and stage III tumors, 
the average ages were 68.1 and 64.5 years and 
the percentage of smokers was 88.9% and 
89.5%, respectively. Marginally significant as- 
sociations between cluster and smoking his- 
tory were observed (P = 0.06). A significant 
relationship between histopathological classifi- 
cation and cluster was only discern able for 
bronchioloalveolai adenocarcinomas (BAs), 
which were only present in Clusters 1 and 2 
(P = 0.0055) and comprised 35.7% and 12.3% 
of tumors for Clusters 1 and 2, respectively. 

We examined the heterogeneity in gene-ex- 
pression profiles based on the trimmed data. set among normal 
lung samples and stage 1 and stage 111 adenocarcinomas by calcu- 
lating correlation coefficients between all pairs of samples. In 
contrast to norma! lung samples that displayed highly similar 
gene-expression profiles (median correlation 0.9), both stage I 
and III lung tumors demonstrated much greater heterogeneity in 
their expression profiles with lower correlation coefficients (me- 
dian values, 0.82 and 0.79, respectively). 

Northern-blot and immunohiitochenrmtry analyses 
Of the 4,966 genes examined, 967 differed significantly between 
stage 1 and 10 adenocarcinomas, a number in excess of that ex- 
pected by chance alone (248 at alpha level (a) = 0.05). Three 
genes were arbitrarily selected to verify the microarray expression 
data. The mRNA from 20 of the normal lung and tumor samples 
was examined by northein-blot hybridization with probes for in- 
sulin-like growth factor-binding protein 3 (JGFBP'3), cystatin C 
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Fig. 1 Unsupervised classification analysis of lung adenocarcinomas. 3 classes of tumors identi- 
fied by aggtomeratrve hierarchical clustering of gene- expression profiles using the 4,966 expressed 
genes. Patient and histopathological information lor each lung adenocarcinoma case by cluster 
designation and methods tor K-ros 1 2/1 3th-codon mutational status and nuclear p53 protein acr 
cumulation are provided (Supplementary Figure A online). 7N classification denotes information 
regarding patient tumor st2e and nodal involvement. Associations between cluster membership 
and patient or histopathological variables are indicated at significance level (P< 0.05). 



and lactate dehydrogenase A (LDH-A) (Fig. 2a). Two gene probes 
not represented on the microairays were used as controls, includ- 
ing histone H4, a potential index of overall cell proliferation, and 
28S ribosomal RNA, a control for sample loading and transfer. 
The relative amounts of IGFBP3, cystatin C and LDH-A mRNA 
strongly correlated with micr oar ray- based measurements (Fig. 
2b). In both assays, IGFBP3 and LDH-A mRNA levels increased 
from stage I to stage 111 adenocarcinomas and were higher than 
those in normal lung. Cystatin C mRNA levels were more variable 
but relatively greater in normal lung than tumors. These results 
suggest that the oligonucleotide miooarrays provided reliable 
measures of gene expression. The tumors showed slightly greater 
histone H4 expression than the normal lung, likely reflecting in- 
creased proliferation of tumor cells. 

Immunobistochemistry was performed for 1GIBP3, cystatin C 
and HSP-70 to determine whether mRNA ovcrexpiession was re- 
flected by an increase of their corresponding proteins in tumors. 
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Cystatin C 



Cystatin C 



with prominent apical staining (blue react ant staining, anew, upper left). 
Diffuse cytoplasmic HSP-70 immunor eac I ivily (tumor L?7), yet Hrom;>l el- 
ements show no reactivity (upper right). Normal lung parenchyma (lower 
tell) shows cytoplasmic cystatin C immunoreac tivity in alveolar pneumo- 
< ytes (arrow) and intra- alveolar mac rophages but turner (L°0) shows dih 
tuse (ytoplasmic cystatin C immunor eattiviiy with pi eminent apkal 
staining (lower light). Magnification, >?00 



ARTICLES 



b , 



0 20 40 60 80 tOO 



0 2D 40 u eo 100 



1 



■a 



X) 

e 9 



0 20 - 40 60 60 100 



— to* ink 



^3 r 
> 

to 



2D ID 60 60 tOO 




20 10 60 00 100 




■ dublm t 



8 



0 20 40 60 eo too 




v. 



0 20 *0 M BO 100 




- high iifX 



0 20 40 60 60 100 



Time to death (months) " 

Immunoieactivity for both 1GFBP-3 and HSP-70 (Fig. 2 r) was de- 
tected in the cytoplasm of the adenocarcinomas, with little de- 
tectable reactivity in the stromal ox inflammatory cells. Cystatin 
C was delected in alveolar pneumocytes and intra- alveolar 
macrophages in nonneoplastic lung parenchyma and also con- 
sistently in the cytoplasm of neoplastic cells. 

Gene-expression profiles predict survival 

As expected, Kaplan-Meier survival curves (Fig. 3a) and log-iank 
tests indicated poorer survival among stage 111 compared with 
Mage I adenocarcinomas (P = <0.0001). Two statistical ap- 
proaches were used to determine whether gene-expression pio- 
files could ptedict survival using the data set of 4,966 genes. In 
one approach, equal numbeis of randomly assigned singe I and 
Mage 1)1 tumors constituted training (n = 43) and testing (n = 43) 
sets. In the training set, the top 10, 20, 60 or 75 genes were used 
to orate risk indices that were evaluated loi thru association 
with survival using the 50th, 60th 01 70th percentile euloif 
points to categorize patients into high 01 low gioups. The insults 
ive; C similar across cutoff points but the 50 gene risk index had 
the best overall association with suivival in the training set. 



Fig. 3 Gene- expression profiles and patient survival, o. Relationship be- 
tween tumor stage and patient survival (stage 1 and stage 3 differ signifi- 
cantly, P< 0.0001). b, Relationship between the survival. in the 43 test 
samples and their risk assignments based on the 50-gene risk index esti- 
mated in the 43 training samples. The high- and low-risk groups differ sig- 
nificantly (P = 0.024). c, Relationship between patient survival and the risk 
assignments in test samples (in b) conditional for tumor stage. The higb- 
and low-risk stage I groups differ significantly (P ~ 0.028), whereas stage III 
low- and high-risk groups did not (P= 0.634). rf. Relationship between sur- 
vival in the test cases and their risk assignments based on the 86 Teave-one- 
out' cross-validation of the 50-gene risk index. The high- and low-risk 
groups differ significantly (P= 0.0006). e, Relationship between test case f s 
risk assignment and survival (in d) conditional on tumor stage. The high- 
and low- risk stage I lung adenocarcinoma groups differ significantly from 
each other (P= 0.003), whereas low- and high-risk stage III tumors do not. 
f t Relationship between tumor class identified by hierarchical clustering and 
patient survival. Survival for patients in Cluster 3 differed relative to the tu- 
mors in Cluster 2{P= 0.037) and approached significance for Cluster ) and 
2 combined (/>= 0.06). 9 , Analysis of the Michigan- based risk index using 
top cross- validated survival genes identify a low- and high-risk group in an 
independent cohort of 84 Massachusetts- based lung adenocarcinomas that 
are significantly different (P= 0.003). h, Among the 62 stage I lung adeno- 
carcinomas in the Massachusetts sample, the high- and low-risk groups dif- 
fered significantly (P= 0.006). 

Alter conservatively choosing the 60th percentile cutoff point 
from the training set, we then applied this risk index and cutoff 
point to the testing set. The risk index of the top 50 genes cor- 
rectly identified low- and high-risk individuals within the inde- 
pendent testing set (P = 0.024) (Fig. 3b and Supplementary 
Methods online). Notably, )] stage J tumors were included in 
the high-risk subgroup. When this risk assignment was then 
conditionally examined for stage progression (Fig. 3c), low- and 
high-risk groups among stage 1 tumors were found lo differ (P = 
0.028) in their survival. 

Identification of a robust set of survival genes 
Although predictive of patient survival, a single training-testing 
set may not provide the most robust set of genes due to random 
sampling issues. Therefore, a 'leave- one-out' cross-validation ap- 
proach was used to identify genes associated with survival from 
all 86-tumor samples. We first developed a 50-gene risk index in 
each training set, and then applied the risk index to the test case 
held out from the hill set of tumors and assigned the held out 
tumor to the high- or low-risk groups (Fig. 3d). The high and 
low-risk subgroups determined in the test cases differed signifi- 
cantly in their overall survival (p = 0.0006). Among the larger 
group of stage 1 lung adenocarcinomas, the low- risk (r» = 46) and 
high-risk (n = 21) groups had markedly different survival {p = 
0.003) {Fig. 3e). Table 1 lists selected examples of the cumulative 
top 100 genes derived from this cross-validation procedure 
(complete list in Supplementary Table A online). 

It was also noted that many ol the stage I patients in the high- 
risk subgroup (Fig. 3e) were present in Cluster 3 (Fig. 1). 
Kaplan- Meier analysis (fig. 3f) demonstrated a significantly 
worse survival (/■» = 0.037) for patients in Cluster 3 relative to pa- 
tients in Cluster 2 and approaching significance for Cluster ] 
and 2 combined {P = 0.06) This further indicates the important 
relationship between gene- expression profiles and patient sur- 
vival, independent ol disease stage. 

Consistent with previous analyses ol lung adenocarcinomas 73 , 
40% ol stage I ami 57.8% ol stage III tumors had 1 2th or J 3th 
rodon K-r.j5 gene mutations. Those patients with tumors con- 
taining K-w mutations .showed a trend of poorer survival, but 
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Table 1 Selected examples of the top 100 genes from cross- validation 



uene name 


p 


Of. 


r 


% Change in 


Coefficient 


Unigene comment 




\iiurmiii veiiui 


Change in tumor 


(stage 1 versus 


rt-,<,p III 

stage in 


o ' 
P 






tumor t-test) 




stage 1)1 Mest) 




















Apopt osij- relat ed 


CASP4 


0.56 ■ 


—6% 


0.02 


57% 


0.0022 


Caspase 4, apoptosis- 














related cysteine protease 


P63 


9.73E-04 


37% 


0.03 


43% 


0.0010 


Transmembrane protein (63 rVD), 














endoplasmic reticulum/ 














Colgi intermediate compartment 














Cell adhesion and structure 




O.U/t-UO 


126% 


0.11 


55% 


0.0003 


Keratin 7 


LAMB1 


0.14 


-20% 


0.01 


60% 


0.0027 


Laminin, pi 














Cell cycle and growth regulators 
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Growth factor receptor-bound protein 7 
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Vascular endothelial growth factor 


WIN » IUt> 


0.05 


31%* 


0.48 


20%. 


0.0022 


Wingless- type MMTV integration site famiry. 
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Heat-shock 70 kD protein B 
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v-erb-b2 avian erythroblastic leukemia viral 














oncogene homolog 2 


FXYD3 


0.10 


111% 


0.31 


73% 


0.0046 


FXYD domain-containing ion transport 
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Cytochrome P450, subfamily XXIV 
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Steroidogenic acute regulatory protein related 
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Phosphodiesterase ?A 
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Solute carrier famiry 1 (high- affinity aspartate/ 
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Transcription and translation 
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-33% 
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Core promoter element binding protein 
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0.0098 


v-crk avian sarcoma virus CI 1 0 oncogene 
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"table 1 Selected examples of Die cumulative top 100 genes identified using 
training-testing,- cross- validation of all 8 ft lung tumor samples. The percent 
c hange, as well as the direc lion, loi the average values of the 1 0 nonneoplastic 
lung to all tumors, and lor the 67 stage I to the 19 stage III tumors are shown. A 
positive ( oelfictenl [} value is indicative ol a relationship of gene expression to a 



poorer patient outcome. The genes are listed in potential functional categories. 
Ones that were also present in the top 50 survival genes using the 43-lumoi 
1 raining jp! (fig. 3b) are indicated in bold type. Complete listing of the gene 
probe sets and annotated gene and unigene identifiers can be found in the 
.Supplementary Methods. 
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Fig. 4 Gene e*p» fusion patterns ol top survival genes o, Gene- expression patterns de- 
termined using agglomerative hierarchical c luMer ir>y ol the 86 lung adenot an inomas 
against the 100 survival- related genes fTable 1) identified by Ibe training, testing, tross- 
validation analysis, iubst ami ally elevated (fed) or dec reaped (gree.n) e:>pression ol the 
genes is observed in individual tumors. 5orne tumors (blail aiiow and e>panded oiea) 
show extremely elevated e>prpssion ot specilk genes, b. An oudier genee>pies>ion pal- 
tern (>i limes the inlet guar tile range among all samples) is observed lor the cibW'2 and 
fteg~i A genes (top left and right, tespec lively). T he 5 7 PQV and <rk genes (bottom trll and 
light, respectively) ihow a graded pattern ol expression related lo patient survival. O, 
alive; dead (also in r), <, The number ol outliers prr penon identified in the top 100 
grnes plotted by survival distribution. 
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this difference did not reach statistical significance among all 
patients (P - 0.25), between patients within tumor clusters {P = 
0.41) or when analyzed separately among stage I [P= 0.22) and 
stage 1JI.(?= 0.53) patients. Nuclear accumulation of p53 was de- 
tected in ] 7.9% stage I and in 22.2% stage 1)1 tumors. No signifi- 
cant, relationship was observed for p53 staining and patient 
survival, cluster or tumor stage. 

Confirmation using an independent set of adenocarcinomas 
The robustness of our 50-gene risJt index in predicting survival in 
lung adenocarcinomas was tested using oligonucleotide gene-ex- 
pression data obtained from a completely independent 
(Massachusetts-based) sample of 84 lung adenocarcinomas (62 
stage 1, 14 stage II and 8 stage 111; ref. 21, and datasel A at 
www.genome.wi.mit.edu/MPR/Iung). To ensure equivalent 
power for testing and comparability of samples, the criteria for 
including tumors in the analysis were 40% or greater tumor cellu- 
larity, no mixed histology (that is, adenosquamous) and patient 
survival information. To obtain comparative gene- expression 
measures between the two data sets, gene sequences present on 
the U95A and HuGeneFL array were examined, and expression 
data for our top 50 cross-validation genes for all 84 Massachusetts 
samples were obtained and processed" (see. also Supplementary 
Methods online ). When we examined the risk assignment of 
these 84 samples, employing the identical cutoff point used for 
the 86 Michigan-based lung samples, we observed low- and high- 
rislc groups (Fig. 3g;P= 0.003). Notably, among the 62 stage I tu- 
mors, high- and low- risk groups were observed that differed 
significantly {P= 0.006) in their survival (Fig. 3/i).. 

Survival genes hod graded and outlier expression patterns 

A statistical and graphical analysis of the 1 00 survival-related 



genes (table 1) clustered against all 86 tumors revealed individ- 
ual rumors with substantially elevated expression in both a lim- 
ited and larger number of genes (Fig. 4a). Among these genes, we 
observed two distinct patterns of expression related to patient 
survival. One pattern, designated 'outlier', included genes show- 
ing substantially elevated expression (greater than five times the 
interquartile range among al) samples), whereas the other pat- . 
tem, designated 'graded', was characterized by continuously dis- 
tributed expression with patient survival (Fig. 4b). The abB2 and 
ReglA genes are examples of outlier expression patterns and 
SI OOP and ak genes of graded patterns. The number of outliers 
per person in the top 100 genes was identified and plotted ac- 
cording to survival times and events (Fig. 4c). Both stage I and . 
stage 1J1 lung adenocarcinomas showed outlier gene patterns 
and 10 tumors contained 3 or more outlier genes. 

Because gene amplification may result in increased gene ex- 
pression, the nine genes with outlier expression patterns (erfcB2, 
SLC1A6, Wnt 1, MGB1, ReglA, AKAP\2, PACE, CYPZ4, KYNV) 
and one gene with a graded expression pattern {KRT\S) were ex- 
amined using quantitative genomic PCR to evaluate genomic 
copy number (Fig. So). Gene amplification of erbB2 (17ql2) was 
detected in tumor L94, which had the highest erbB2 mRNA ex- 
pression (Fig. 4a). Gene amplification was not detected for any 
of the other seven tested genes in tumor L94, as well as in other 
tumors. The two genes most frequently demonstrating the out- 
lier pattern in these lung adenocarcinomas were KWU and 
CYP24, and were present in 10 and 9 tumors, respectively. 
CYP24 has been described as a gene amplified and overexpressed 
in breast cancer", and these results indicate elevated expression 
in lung adenocarcinoma. 

To determine whether the graded or outlier gene-expiession 
patterns also occur at the protein-expression level, 10 of the 100 
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lig, 5 Gene amplification and protein e>.presiion of survival- related genes. 
o. Analysis ot potential gene ampliiic alion for 9 genei showing outlier eapies- 
ilon patterns in the lung tumors (e/oB?, 5/C/A6, Wnt 1, MC81, ffrglA, 
AKAPM, PACl. OP24 and KYNU) and e>aminecl using quantitative genomic 
PCR. A gene showing graded expression pattern (KK718), and one gene 
(PACf4) with a similar chromosome location as PACl, were used as controls. 
Only frbR? and Reg} A are shown. An esophageal adenocarcinoma with 
known high level genomic ampliiic ation ol abft? was used as a positive corv 
hoi and normal esophagus DNA was used as a negative control (Ctl). PC R 
fragments s>?es woe 343 bp tor GAPDH, 166 bp lor e/bB? and 1?6 hp lor 



iJealA. DNA is from noimal long (N) and tumorfT) Irorn each patient (loi ex- 
ample 137). b, Irnrnonohistoc hemic at analysis ol survival related genes with 
lung adenocarcinoma mic roan ays using 1he tumors horn IhiV study. I he 
transmembrane etbB? protein (lop left) e>pression is substantially increased 
in tumor I 9 4 containing the amplified etbR2 gene (f ig. Aa and b). [> press ion 
ol VtCI (top right) and M00P (bottom lett) was located within the neoplas- 
tic cells and the pattern ol imrnunoreactivity was consistent with the graded 
e>piession pattern demonstrated by their mRNA pioliles. l> pression ol the 
oncogene cik (bottom right) was abundantly e> pressed in neoplastic lung 
(flls. Magnification, x-100 (eibB?); >?00 (Vf CI , 51 OOP and crd). 
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top survival genes (Table ]) for which specific antibodies were 
available were chosen foi immunohistochemical analysis using 
lung-tumor arrays from this study (Fig. 5b), Expression of mem- 
brane eibB2 protein was substantially increased in the erfrB2-am- 
plified tumor 194 and very low levels of expression were present 
in other tumors, consistent with mRNA-expression measure- 
ments (Fig. 4a and b). CDC6 protein expression was also sub- 
stantially higher in tumor L94, consistent with mRNA levels 
(data not shown). Expression of vascular endothelial growth fac- 
tor (VEGF) and Si OOP (Fig. Sb), as well as cytokeratin 1 8 (KRT18), 
cytokeratin 7 (KRT7) and fas-associated death domain (FADD) 
protein (data not shown), xvas located within the lung tumor 
cells and consistent with the graded expression pattern of the 
mRNA profiles. The oncogene crk showed both graded mRNA as 
well as a graded protein-expression pattern with survival, and 
was abundantly expressed in the tumor ceils (Fig. 5b). These re- 
sults indicate that many survival-associated genes are expressed 
at the protein level and demonstrate similar mRNA and protein- 
expression patterns. 

Discussion 

We used several approaches for the analysis of gene- expression 
data related to clinicopalhological variables and patient sur- 
vival. One approach, hierarchical clustering, was used to exam- 
ine similarities among lung adenocarcinomas in their patterns 
of gene expression. Previous studies of lung tumors 31,22 have also 
used this method to describe subclasses. of lung tumors. Here, 
we found three clusters that showed significant differences with 
respect to tumor stage and tumor differentiation. This suggests, 
as expected, that tumors with similar histological features of 
differentiation demonstrate similarities in gene expression. 
This feature also partly underlies the observed statistical associ- 
ation of tumor stage and cluster, as many of.the higher-stage tu- 
mors, often poorly differentiated and previously associated 
with a reduced survival'-' 0 , were located in Cluster 3. Although 
this cluster contained the highest percentage of stage 111 tu- 
mors, it also contained a nearly equal mixture of stage I and 
stage 1)1 tumors and not all tumors were poorly differentiated. 
This indicates that a subset of stage I lung adenocarcinomas 
share gene-expression profiles with higher-stage tumors. 
Notably, 10 of the 1 1 stage I tumors found in Cluster 3 were the 
high-risk stage 1 tumors identified using the risk index in the 
'leave-one- out" cross-validation. 

In contrast to previous analyses of Jung adenocarcinomas 71 - 27 , 
we validated the expression data from the arrays. The strong cor- 
relation of northern-blot analysis and oligonucleoticle-nrray data 
for gene expression in the same samples (Fig. 2b) indicates that 
these studies provide robust gene- expression estimates. 
Immnnohistocbemistry using the S3me tumor samples in tissue 
arrays demonstrates protein expression within the lung tumor 
cells. Together, these studies indicate that many of the genes 
identified using gene-expression profiles are likely relevant to 
lung adenocarcinoma. For example, JGFBPZ gene expression is 
increased in lung adenocarcinoma* (Fig. 2c). IGFBP3 protein 
modulates the autocrine or paracrine effects of insulin-like 
growth /act oi s. elevated IGFBP3 expression is observed in colon 
cancer 6 , and increased serum IGFBP3 is associated with progres- 
sion in breast cancer 7 . Heat-shock protein 70 lHSP-70) is in- 
creased in lung adenocarcinomas ol smokers" and is associated 
with increased metastatic potential in breast cancer 7 *. Increased" 
serum lactate dehydrogenase is correlated with tumor stage and 
tumor burden 11 ", and cyslatin C, a cysteine protease inhibitor ex- 



pressed in human lung cancers 31 , is prognostic in some cancers 32 . 
The decreased expression of this protease inhibitor may affect 
the invasive properties of the tumor cell. 

The cross-validation analytical strategy we used is particularly 
informative for these types of gene-expression analyses for dis- 
ease outcome 33 - 34 , and identification of cross-validated genes with 
a larger tumor cohort may help refine this risk index for use in a 
clinical setting. The gene- expression data also provide opportune 
ties to observe overarching patterns that advance our under- 
standing of associations between genes and disease. For example, 
the top 100 survival genes include those involved in signaling, 
cell cycle and growth, transcription, translation and metabolism. 
Expression of many of these genes is likely a function of increased 
proliferation and metabolism in the more aggressive tumors. 
Some genes, such as erbB2 and ReglA (Kg. 4a and b\ were highly 
overexpressed in a few patients having poor survival. In one 
tumor, the erbB2 gene was amplified (Fig. 5a), demonstrating that 
genomic changes may underlie the overexpression of a subset of 
these outlier genes. Immun ©histochemistry confirmed protein 
overexpression in this patient's tumor (Fig. 5b), Notably, seven of 
the eight outlier genes were not amplified, indicating that other 
, mechanisms underlie the increased mRNA expression of these 
survival-related genes. 

Most genes showed a graded relationship between expression 
and patient survival. Genes such as that encoding VEGF, known 
to be strongly associated with survival in lung cancer 34 - 3 * were 
identified as related to patient survival in oui study. VEGF 
demonstrated a graded expression pattern, as did the S100P and 
ok oncogene (Fig. 5b). S100P is a calcium-regulated protein not 
previously reported in lung cancer. The crk gene, the cellular ho- 
molog of the v crk oncogene, is a member of a family of adaptor 
proteins involved in signal transduction and interacts directly 
with c-jun N-terminal kinase 1 (JNK1) 37 . Although ok has not 
been shown to have a role lung cancer, its role in the rMAP-ki- 
nase pathway, which leads to activation of matrix metallopro- 
teinase secretion and cell invasion 38 , indicates potential 
involvement in the the tumor cell invasion or metastasis of 
some lung adenocarcinomas. Among the many genes identified 
in this study, like crk, that may be causally involved in lung can- 
cer progression (Table 1), some were related to survival in many 
patients, and others in only smaller subsets of patients. This re- 
sult is consistent with the complex molecular architecture of tu- 
mors in general, the heterogeneity of lung adenocarcinomas in 
particular and the multiple mechanisms underlying tumor-cell 
survival, invasion and metastasis 39 . 

Our results demonstrate that a gene-expression risk profile- 
based on the genes most associated with patient survival— can 
distinguish stage I lung adenocarcinomas and differentiate prog- 
noses. The particular genes that define the dusters, or are associ- 
ated with survival, likely reflect the characteristics of the 
particular tumors included in the analysis. Current therapy for 
patients with stage 1 disease usually consists of surgical resection 
without adjuvant treatment, 1 - 3 . Clearly, the identification ol a 
high-risk group among patients with stage 1 disease would lead 
to consideration of additional therapeutic intervention lor this 
group, possibly leading to improved survival of these patients. 

Methods 

Patient population. Sequential patients seen al (he Unk-nsity of Mkhiyan 
Hospital between May t994 and July 2000 lot Mage I oi stage lit tuny ade- 
noouinoma were evaluated lor this study. Consent was received and the 
piojeit wai approved by the local Institutional Review Board. Primary tu- 
mors and adjacent non- neoplastic lung tissue wefe obtained at the time o( 
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surgery. Peripheral portions of resected lung carcinomas were sectioned, 
evaluated by a study pathologist and compared with routine H&E sections 
of the same tumors, and utilized for mRNA isolation. Regions chosen for 
. analysis contained a tumor, cellular ity greater than 70%, no mixed histol- 
ogy, potential metastatic origin, extensive lymphocytic infiltration or fibro- 
sis. Tumors were histopatbologicatly divided into two categories based on 
their growth pattern: bronchial-derived, if they exhibited invasive features 
with architectural destruction, and bronchiploarveplar, if they exhibited 
preservation of the lung architecture. AH stage I patients received only sur- 
gical resection with intrathoracic nodal sampling and no other treatments. 
Stage. Ill patients received surgical resection plus chemotherapy and iadio- 
© therapy. 

u Gene-expression profiling and K-/os mutation analysis. RNA isolation, 
E cRNA synthesis and gene-expression profiling were performed as de- . 
£ scribed 1 *. Details of gene annotation and K-ros mutation analysis are pro 

% vided rn supplementary information. 
,c 

0 Northern-blot analysis. Total cellular RNA (10 ug) was separated in 1 .2% 
^ agarose-formaldehyde gels and vacuum- transferred to Gene Screen Plus 
a (NEN Life Science Products, Boston, Massachusetts): Hybridization condi- 
ro tions and probe labeling were as described 40 . Individual sequence-validated 
> cDNA image clones for human IGfBPH (clone 1407750), tDH-A (clone 

1 2420241), cystatin C (CT53; clone 949938) were from Research Genetics 
5= . (Huntsvitle, Alabama). The human bis tone H4 cDNA and the 75 S ribosomal 
£ RNA 26-mer oligonucleotide probe were prepaied and labeled as de- 

scribed* 0 . 

a 

O Gene-amplification analysis. 1 1 genes were selected lor the analysis of ge- 
O nomk alterations. Primers were designed using PrimerSelect 4.05 Windows 
cr> 32 software (DNASTAR, Madison, Wisconsin), avoiding pseudogenes or po- 
— tential homologous regions. Forward and reverse primers for the genes are 
. .2 provided (Supplementary Methods online). Quantitative genomic- PC R was 
•° then applied and analyzed as described". 
0_ 

o> tmmunohistochemtcal staining. The -H&E-stained slides of oil primary 
3- lung tumors were used to identify the most representative regions ol each 
^ tumor and a tissue microarray (TMA) block was constructed as described 0 . 
t\j Irnmunohislochemistry (IHC) was performed using both rouiine and sec- 
o tions Irom the TMA block as described 7 *. Detailed methods and the con- 
M centralions used for all antibodies are provided in the Supplementary 
® Mel hods. 

Statistical methods. Mests were used to identify differences in mean gene- 
expression levels between comparison groups. Agglomeraiive hierarchical 
~ clustering^ was applied using the average linkage method to investigate 
whether there was evidence for natural groupings of tumor samples based 
on correlations between gene- expression profiles. 7o investigate the ro- 
bustness of the clustering infeience, gene-e>press»on values were per- 
turbed by adding random Gaussian error of magnitude obtained from a 
duplicate sample to each data point and then rec luster ed to determine con- 
cordance in the tumor's class membership. Pearson, and fiber's e*act 
tests were used to assess whether cluster membership was associated with 
physical and genetic characteristics ol the turnois. 

lo determine whether gene-expression profiles weie associated with 
variability in survival times, 2 separate but complementary approaches 
were used. In the first approach, the 86 turnou were landomly assigned to 
equivalent training and testing sets consisting ol equal numbers of stage I 
and 111 tumors in order to validate a novel risk- index tunc lion that captured 
the effect ol many genes at once. In the second approac h, c ross- validation" 
was used to more robustly identify the genes associated with survival. 
Briefly, a 'leave- one- out' cross-validation procedure in which 85 of the 86 
tumors (the training set) was used to identify genes thai were univariate!)' 
associated with survival. The rijk index was defined as a linear combination 
of the gene- expression values for the top genes identified by univariate Cox 
pi oportional- hazard regression modeling* 1 , weighted by then estimated re- 
gression coefficients. Kaplan- Meier survival plots ami ion- rank tests were 
Ihrn used to assess whether the risk-inde> aisinnmenl to high/ low cate- 
gories was validated inihe test set. A more detailed riesc liption is provided 
(i> tippler nentary Methods online). 



Note: Supplementary information is ovoitobte on the Nature Medicine website. 
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Rapid quantitation of proinflammatory and chemoattractant cytokine 
expression in small tissue samples and monocyte-derived dendritic cells: 
validation of a new real-time RT-PCR technology. 

Blaschke V, Reich K , Blaschke S , Zipprich S , Neumann C . 

Department of Dermatology, von-Siebold-Str. 3, D-37075, Goettingen, Germany, 
vb 1 asch@gwdg! d e 

The analysis of cytokine profiles plays a central part in the characterization of disease- 
related inflammatory pathways and the identification of functional properties of immune 
cell subpopulations. Because tissue biopsy samples are too small to allow the detection of 
• cytokine protein, the detection of rnRNA by RT : PCR analysis is often used to investigate 
the cytokine milieu in inflammatory lesions. RT-PCR itselfis a qualitative method, 
indicating the presence or absence of specific transcripts. With the use of internal or 
external standards it may also serve as a quantitative method. The most widely accepted 
method is quantitative competitive RT-PCR, based on internal shortened standards. 
Recently, online real-time PCR has been introduced (LightCycler), which allows 
quantitation in less than 30 min. Here, we have tested its use for the analysis of cytokine, 
gene expression in different experimental in vitro and ex vivo settings. First, we 
compared quantitative competitive RT-PCR with real-time RT-PCR in the quantitation of 
transcription levels of the CD4(+) cell-specific chemoattractant Interleuldn- 16 during the 
maturation of monocyte-derived dendritic cells, and found a good correlation between 
both methods. Second, differences in the amounts of IL-16 mRNA in synovial tissue 
from patients with rheumatoid arthritis and osteoarthritis as assessed by real-time RT- 
PCR paralleled differences in the level of IL-16 protein in the synovial fluid. Finally, we 
employed real-time RT-PCR to study the cutaneous expression of several cytokines 
during experimental immunomodulatory therapy of psoriasis by Interleukin-10, and 
demonstrate that the technique is suitable for pharmaco genomic monitoring. In summary, 
real-time RT-PCR is a sensitive and rapid tool for quantifying mRM A expression even 
.with small quantities of tissue. The results obtained do not differ from those generated by 
quantitative competitive RT-PCR. 

Publication Types: 

• Evaluation Studies 



PMID: 1 1121 549 [PubMed - indexed for MEDLINE] 



74: Apoptosis. 1997;2(6):5 18-28. Related Articles. Links 

^ SpringerLink 

Hmsjisei 



Butyrate-induced reversal of dexamethasone resistance in autonomous rat 
Nb2 lymphoma cells. 

Buckley AR , Krumenacker JS, Buckley DJ LeffMA . Magnusou NS . R cc d JC 
Miyashita T. de Jong G , Gout PW . ' 

Department of Pharmacology and Toxicology, University of North Dakota School of 
Medicine and Health Sciences, Grand Forks 58202-9037, USA. 
abuckley@mail.med.und.nodak.edu 

The parental rat Nb2 lymphoma is a prolactin (PRL)-dependent T cell line. Exposure of a 
PRL-independent subline, Nb2-SFJCD 1 , to sodium butyrate (NaBT) causes transient 
reversal of their growth factor-independent proliferation in association with constitutive 
expression of protooncogenes pim-1 and c-myc. In the present study, we investigated the 
effect of NaBT treatment on the sensitivity of Nb2-SFJCD1 cells to dexamethasone 
(DEX)-induced apoptosis. Pretreatment with NaBT (2 raM, 72 h) partially reversed 
resistance to apoptosis in Nb2-SFJCD1 cells exposed to DEX (100 nM) for 12 h 
assessed by flow cytometric analyses of DNA fragmentation. However, the cytolytic 
effect of DEX was abrogated by PRL in a time- and concentration-dependent manner. 
Evaluation of apoptosis-associated gene expression in NaBT-pre-treated cultures 
incubated with DEX or DEX+PRL indicated that the apoptosis resistance did not stem 
from altered bcl-2 or bax expression. However, there was a strong correlation between 
■ the resistance to DEX-activaled apoptosis and their enhanced expression of pim-1 mRNA 
and protein. The results show that it is possible to reverse DEX-induced apoptosis of Nb2 
pre-T cells and suggest the pim-1 gene product has an important role as a suppressor of 
this process, perhaps functioning as a mediator of PRL action. 
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Alterations iu neuropeptide Y levels and Yl binding sites in the Flinders 
Sensitive Line rats, a genetic animal model of depression. 

Caberlotto L , Jimenez P. Overstreet PH . Hurd YL, Ma the AA , Fuxe K . 

Department of Neuroscience, Karolinska Institute, Stockholm, Sweden. 

Previously, we observed specific alterations of neuropeptide Y (NPY) and Yl receptor 
mRNA expression in discrete regions of the Flinders Sensitive Line rats (FS L), an animal 
model of depression. In order to clarify the correlation between mRNA expression and 
protein content, radioimmunoassay and receptor autoradiography were currently 
performed. In the FSL rats, NPY-like immunorcactivity (NPY-LI) was decreased in the 
. liippocampal CA region, while Yl binding sites were increased; NPY-LI was increased 
in the arcuate nucleus. Fluoxetine treatment elevated NPY-LI in the arcuate and anterior 
cingulate cortex and increased Yl binding sites in the medial amygdala and occipital 
cortex in both strains. No differences were found regarding the Y2 binding sites. The 
results demonstrate a good correlation between NPY peptide and mRNA expression, and 
sustain the possible involvement of NPY and Yl receptors in depression. 
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Neurokinin 1 receptor and relative abundance of the short and long 
isoforrns in the human brain. 

Cabcrtotto L , Hui d YL , Murdock P , Wahlin JP , Mclotto S T Corsi M , CnrtcUi R . 

Department of Biology, Psychiatry CEDD, GlaxoSmitliKIine Medicine Research Centre, 
Verona, Italy. Laura.L.CaberIotto@gskxom 

Substance P exerts its various biochemical effects mainly via interactions through 
neurokinin- 1 receptors (NKl). Recently, the NKl receptor has attracted considerable 
interest for its possible role in a variety of psychiatric disorders including depression and 
anxiety. However, little is known regarding the anatomical distribution of NKl in the 
human central nervous system (CNS). Riboprobe in situ hybridization, quantitative PCR 
and in vitro autoradiography were performed. Highest NKl mRNA levels were localized 
in the locus coeruieus and ventral striatum, while moderate hybridization signals were 
observed in the cerebral cortex (most abundant in the visual cortex), hippocampus and 
different amygdaloid nuclei. Very low levels of the NKl mRNA were detected in the 
cerebellum and thalamus. In view of the existence of a long and short isoform of the NKl 
receptor, it was of interest to assess whether there was a differential distribution of the 
two splice variants in the human CNS and peripheral tissues. A quantitative TaqMan 
PCR analysis showed that the long NKl isoform was the most prevalent throughout the 
human brain, while in peripheral tissues the truncated form was the most represented. 
3H-Substance P autoradiography revealed a good correlation between receptor binding 
sites and NKl mRNA expression throughout the brain, with the highest levels of binding 
in the locus coeruieus. These results provide the anatomical evidence that the NKl 
receptors have a strong association with neuronal systems relevant to mood regulation 
and stress in the human brain, but do not suggest a region-specific role of the two 
isoforrns in tire CNS. 
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Characterization of cycliri D2 expression in human endometrium. 

ChoH), YoonS, LeeE, Hjyajigj), gongjj, Kim J, Yoon BH Lec JK 

. Department of Obstetrics and Gynecology, Samsung Medical Center, Sungkyunkwan 
University School of Medicine, Seoul, South.Korea. dschoi@smc.samsung.co.kr • 

OBJECTIVE: This study was undertaken to investigate cyclin D2 mRNA and protein 
expression in human endometrium during the menstrual cycle. METHODS: Endometrial 
samples were obtained from 15 premenopausal nonpregnant women who had 
hysterectomies for benign gynecologic reasons. They were divided into the following five 
groups according to histologic dating: early proliferative (n = 3), mid to late proliferative 
(n = 3), early secretory (n = 3), mid secretory (n = 3), and late secretory (n = 3). Cyclin 
D2 mRNA and protein expression were analyzed using reverse transcriptase-polymerase 
chain reaction, Western blotting, and immunolustochemistry. RESULTS: Cyclin D2 
mRNA and protein were expressed in human endometrial tissue throughout the menstrual 
cycle. Cyclin D2 mRNA and protein expression of proliferative phase endometrium were 
significantly higher than those of secretory phase endometrium (P <.05). The staining 
intensity of cyclin D2 in proliferative phase endometrium was higher than that in 
secretory phase (P <Q5). Cyclin D2 mRNA level showed good correlation with cyclin 
D2 protein level (R = 0.579, P <.03), and cyclin D2 protein also showed good correlation 
with immunohistochemical staining intensity (R = 0.562, P < 03). CONCLUSION: 
Cyclin D2 was expressed in human endometrium tiiroughout the menstrual cycle. Cyclin 
D2 mRNA and protein were expressed at high levels in proliferative phase endometrium, 
especially in the early proliferative phase, and then decreased in die secretory phase. 
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Human chorionic gonadotrophs beta expression in malignant Barrett's 
oesophagus. 

Couvclard A , Paraf F , Vidaud D , Dubois S , Vidaud M Flcjou JF , Pcgott C 

Service d* Anatomic Pathologique, Hopital Beaujon, 921 18 Clichy cedex, France. 
anne.couvelmd@bjn.ap-hop-paris.fr 

BACKGROUND: Human chorionic gonadotropin beta (hCGbeta) is expressed in 
several non-trophoblastic tumours, and this is usually associated with aggressive 
behaviour. Little is known about hCGbeta expression in Barrett's adenocarcinoma. 
MATERIALS AND METHODS: We determined the hCGbeta profile in a large series of 
surgically resected Barrett's adenocarcinoma (a) at mRN A level using real-time 
quantitative reverse-transcription polymerase chain reaction analysis and (b) at protein 
level using iinmunohistochemistry with a polyclonal antibody and with a monoclonal 
antibody specific for free hCGbeta. We then sought links between the hCGbeta protein 
expression pattern and clinical and pathological parameters, including patient outcome as 
well as vascular endothelial growth factor (VEGF) expression. RESULTS: hCGbeta 
protein expression was observed in 43 of 76 (57%) Barrett's adenocarcinomas. We 
showed a strong correlation between hCGbeta protein abundance and CGB mRNA level. 
We observed a statistical link between hCGbeta protein expression and infiltrative 
tumour type ( P=0.023), perineural neoplastic invasion ( P-0.007) and VEGF protein 
expression ( P=0.016). hCGbeta expression tended to be associated with a poor outcome 
(16% versus 36% survival 8 years after resection). CONCLUSION: Expression of 
hCGbeta correlates with specific infiltrative characteristics and is associated with higher 
VEGF expression. Both molecules may play a co-ordinated role in the development of 
Barrett's adenocarcinomas. 
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Downregulation of ENaC activity and expression by TNF-alpha in alveolar 
epithelial cells. 

Dagcaais A, Frechette R, Yamagata Y . Yamagata T . CarmelJP , Clermont ME 
Brochiero E , Masse C. Bertliiaumc Y . 

Centre de recherche, CHUM-Hotel-Dieu, 3850 St-Urbain, Montreal, Quebec, Canada 
H2W lT7.andre.dagenais.chum@ssss.gouv.qcxa 

Sodium absorption by an amiloride-sensitive channel is the main driving force of lung 
liquid clearance at birth and lung edema clearance in adulthood. In this study, we tested 
whether tumor necrosis factor-alpha (TNF-alpha), a proinflammatory cytokine involved 
in several lung pathologies, could modulate sodium absorption in cultured alveolar 
epithelial cells. We found that TNF-alpha decreased the expression of the alpha-, beta-, 
and gamma-subunits of epithelial sodium channel (ENaC) mRNA to 36, 43, and 16% of 
the controls after 24-h treatment and reduced to 50% the amount of alpha-ENaC protein 
in these cells. There was no impact, however, on alpha(l) and beta(l) Na(+)-K(+)- 
ATPase mRNA expression. Amiloride-sensitive current and ouabain-sensitive Rb(+) 
uptake were reduced, respectively, to 28 and 39% of the controls. A strong correlation 
was found at different TNF-alpha concentrations between the decrease of amiloride- 
sensttive current and alpha-ENaC mRNA expression. All these data show that TNF- 
alpha, a proinflammatory cytokine present during lung infection, has a profound 
influence on tire capacity of alveolar epithelial cells to transport sodium. 
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Involvement of the CCNDI gene in hairy cell leukemia. 

dc Boer CJ , Kluin-Nelcmaris JC Dreef E , Kester MG , Kluin PM, Schmiring E , van 
Krieken Jit . 

Department of Pathology, University of Leiden, The Netherlands. 

BACKGROUND: Previous results suggested increased mRNA expression of CCNDI in 
hairy cell leukemia (HCL). The CCNDI gene is involved in the t(l I;l4)(ql3;q32) 
chromosomal rearrangement, a characteristic abnormality in mantle cell lymphoma 
(MCL). We and others reported that, in contrast to other B-cell lymphomas, almost all 
MCL have over-expression of the CCNDI gene with a good correlation between RNA 
and protein analysis. Recent studies showed that overexpression of the cyclin Dl protein 
can be easily detected by immunohistochemistry (IHC) on formalin-fixed, paraffin 
embedded tissues. PATIENTS AND METHODS: To investigate whether the CCNDI 
gene is involved in HCL, we performed IHC on a series of 22 cases using formalin-fixed 
paraffin embedded splenectomy specimens. For IHC the sections were boiled in citrate 
buffer. The presence of rearrangements within the BCL-1 locus and the CCNDI gene 
was analyzed in 13 of 22 cases by Southern blot analysis using all available break-point 
probes. Expression of CCNDI was analyzed al the mRNA level (Northern blot) and 
protein level (IHC). RESULTS: Overexpression of the cyclin Dl protein using IHC was 
observed in all cases, with strong expression in 5. cases. Pre-existing B- and T-cell areas 
of the spleen did not express significant levels of the cyclin Dl protein. Seven of 9 cases 
analyzed by both IHC and Northern blotting showed overexpression of the CCNDI gene 
with both methods. No genomic abnormalities were observed in any of the 13 cases 
studied by Southern blot analysis. Additionally, no I lql3 abnormalities were detected by 
banding analysis of 19 of 22 cases. CONCLUSIONS: The elevated levels of CCNDI 
mRNA and protein in conjunction with the absence of overt rearrangements within the 
BCL-1 locus distinguish HCL from MCL and other B-cell malignancies. This suggests 
that activation of the CCNDI gene in HCL is due to mechanisms other than chromosomal 
rearrangement. 
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Inhibin aud activiu production and subimit expression in human placental 
cells cultured in vitro. 

Debicve F ; Pampfer S , Thomas KL 

Department of Obstetrics and Gynecological Endocrinology, UniversUe Catholiquede 
Louvain, 1200 Brussels, Belgium. 

Inhibins and activins are dimeric proteins, with each subunit being one of three related 
protein subunits (alpha, betaA or betaB). Tlie mRNA levels of these subunits were 
studied quantitatively during in-vitro differentiation of human cytotrophoblast cells into 
syncytium, using Northern blot analysis and semi-quantitative reverse transcription- 
polymerase chain reaction (RT-PCR) analysis. The corresponding protein concentrations 
were determined by specific enzyme-linked immunosorbent assays for inhibin A, B, pro 
alphaC and activin A in cellular protein extracts and culture medium (n = 5). 
Immunofluorescence studies showed syncytium formation after 48 h. The alpha subunit 
was present before plating and increased at 48 h (P<0.001) while the betaA subunit was 
weak before plating and increased at 24 h. The betaB subunit was not detected. With 
respect to corresponding protein synthesis, inhibin A (alpha + betaA) had risen after 48 h 
in cellular protein extract and after 72 h in culture medium, while activin A (betaA + 
betaB) was detected after 24 h, with no significant variations in culture medium. There 
was a good correlation between inhibin A and alpha subunit expression (r = 0.736, 
PO.001), as well as between activin A and betaA subunit expression (r = 0.755, 
PO.001). This study showed that mRNA expression parallels protein synthesis of inhibin 
and activin in trophoblast cells. Inhibin A synthesis appears to be dependent on alpha 
subunit mRNA expression, rather than on the betaA subunit which controls activin A 
synthesis. This study has also shown that isolated cytotrophoblast cells do not produce 
dimeric inhibin. However, during the transformation of cytotrophoblast cells into 
syncytium, betaA subunit mRNA expression may be an indicator of cell aggregation, 
while alpha subunit mRNA expression may be an indicator of cell fusion. 
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Expression of membranc-type matrix metalloproteinascs 4, 5, and 6 in 
mouse corneas infected with P, aeruginosa. 

Dong Z , Katar M, Alousi S , Berk RS . 

Department of Tmmunology and Microbiology, Wayne State University School of 
Medicine, 540 E. Canfield, Detroit, MI48201, USA, 

PURPOSE: To investigate the expression and regulation of membrane-type matrix 
metalloproteinases (MT-MMPs) 4, 5, and 6 in the mouse corneas infected with 
Pseudomonas aeruginosa. METHODS: C57BL/6J mice were intracorneally infected with 
P. aeruginosa. The expression of MT4-, MT5-, and MT6-MMP was detected at both the 
mRNA arid protein levels by RT-PCR and immunobiot analysis. Immunohistochemical 
staining was performed to. localize the expression of MT4- and MT5-MMP in the mouse 
corneas. RESULTS:. Expression. of MT4- and MT5-MMP was detected in the normal 
(uninfected) cornea by RT-PCR and immunobiot analysis. When infected with P. 
aeruginosa, the corneas showed significant induction of each MT-MMP. Localization of 
MT4- and MT5-MMP revealed that the expression of MT5-MMP was restricted to the 
epithelial tissue in the normal cornea, whereas the induced expression of MT4- and MT5- 
MMP was predominantly in the substantia propria, which contained most of the 
infiltrating cells. MT6 MMP expression was not detected in the uninfected cornea but 
was upregulated in the infected corneas. CONCLUSIONS: Expression of MT4-, MT5-, 
and MT6-MMP was induced in corneas infected with P. aeruginosa. 
Immunohistochemistry showed predominant immunoreactivity of MT4- and MT5-MMP 
in the substantia propria. Previous histologic studies have revealed different patterns of 
inflammatory cell infiltration with an increased number of polymorphonuclear 
neutrophils (PMNs) during the early stage of inflammation and increased macrophages 
during the late stage. These results indicate a good correlation between the 
overexpression of the MT-MMPs in the infected corneas and the inflammatory response- 
that is, leukocyte infiltration-indicating that inflammatory cells such as macrophages and 
PMNs may play a role in the upregulation of MT-MMPs during corneal infection, which 
in turn can cause the destruction of corneal tissue. 
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Assessment of proliferative activity in colorectal carcinomas by 
quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). 

Duchrow M , Hasenievcr S T Broil R , Brucli Ht\ Wind hovel U . 

Surgical Research Laboratory, Surgical Clinic, Medical University of Lubeck, 
Ratzeburger Atlee 160, D-23538 Lubeck, Germany. 

The monoclonal antibody Ki-67 and the isospecific monoclonal antibody MIB-1 are 
routinely used in oncology to assess the proliferation index of tumor cells. A more 
objective and sensitive method is the determination of the of Ki-67 protein-specific 
mRNA by quantitative reverse transcriptase-polymerase chain reaction (RT-PCR). In 25 
resected colorectal adenocarcinomas of different stages and grades we determined 
between 0.2 and 4.4 amol (10(-18) mol) Ki-67 protein-specific mRNA per microgram 
total RNA (median = 0.88 amol). The corresponding Ki-67 indices (expressing the 
percentage of Ki-67/MIB-l positive tumor cells) ranged from 41 to 81% (median = 61%). 
We found a good correlation between Ki-67 index and mRNA expression (r = 0.75), a 
significant correlation between both data and tumor stage (primary tumor, regional nodes, 
metastasis [pTNM] staging classification) (p < 0.001), but not between both data and 
tumor grade. Both Ki-67 indices (p = 0.05) and mRNA levels (p = 0.014) correlated 
significantly to the patients' survival. These results demonstrate that the Ki-67 protein- 
specific quantitative RT-PCR is a useful method for the characterization of tumor cell 
proliferation. 
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Molecular characterisation of carbohydrate digestion and absorption in 
equiue small intestine. 

Dyer J, Fernandcz-Castano Merediz E , Salmon ICS , Proudman CJ , Edwards GB , 
Shirazi-Becchcy SP . 

Department of Veterinary Preclinical Sciences, University of Liverpool, UK. 

Dietary carbohydrates, when digested and absorbed in the small intestine of the horse, ' 
provide a substantial fraction of metabolisable energy. However, if levels in diets exceed 
the capacity of the equine small intestine to digest and absorb them, they reach the 
hindgut, cause alterations in microbial populations and the metabolite products and 
predispose the horse to gastrointestinal diseases. We set out to determine, at the 
molecular level, the mechanisms, properties and the site of expression of carbohydrate 
digestive and absorptive functions of the equine small intestinal brush-border membrane. 
We have demonstrated that the disa.ccharidases sucrase, lactase and maltase are expressed 
diversely along the length of the intestine and D-glucose is transported across the equine 
intestinal brush-border membrane by a high affinity, low capacity, Na+/glucose 
cotransporter type 1 isoform (SGLT1). The highest rate of transport is in duodenum > 
jejunum > ileum. We have cloned and sequenced the cDNA encoding equine SGLT1 and 
alignment with SGLT1 of other species indicates 85-89% homology at the nucleotide and 
84-87% identity at die amino acid levels. We have shown that there is a good correlation 
between levels of functional SGLT1 protein and SGLT1 mRNA abundance along the 
length of the small intestine. This indicates that the major site of glucose absorption in 
horses maintained on conventional grass-based diets is in the proximal intestine, and the 
expression of equine intestinal SGLTl along the proximal to distal axis of the intestine is 
regulated at the level of mRNA abundance. The data presented in this paper are the first 
to provide information on the capacity of the equine intestine to digest and absorb soluble 
carbohydrates and has implications for a better feed management, pharmaceutical 
intervention and for dietary supplementation in horses following intestinal resection. 



PMID: 121 17106 [PubMed indexed for MEDLfNE] 



r 

"". 159jJJminunol : 2002 Apr 1;168(7):3 181-7. Related Articles. Links 



: #.fiREEjfuJj rext3orf Weijt ;V 



Suppressors of cytokine signaling proteins are differentially expressed in 
TU1 and Th2 cells: implications for Th cell lineage commitment and 
maintenance, 

Egwuagu CE , Yu CR, Zhang M IWahdi RM , Kim SJ. GcryJ. 

Laboratory o f Immunology, National Eye Institute, National Institutes of Health, 
Bethesda, MD 20892, USA. emeka@heiix.niJi.gov 

Positive regulatory factors induced by IL-12/STAT4 and IL-4/STAT6 signaling during T 
cell development contribute to polarized patterns of cytokine expression manifested by 
differentiated Th cells. These two critical and antagonistic signaling pathways are under 
negative feedback regulation by a multimember family of intracellular proteins called 
suppressor of cytokine signaling (SOCS). However, it is not known whether these 
negative regulatory factors also modulate Thl/Th2 lineage commitment and maintenance. 
We show here that CD4(+) naive T cells constitutive^ express low levels of SOCS1 
SOCS2, and SOCS3 mRNAs. These mRNAs and their proteins increase significantly in 
nonpolarized Th cells after activation by TCR signaling.. We further show that 
differentiation into Thl or Th2 phenotype is accompanied by preferential expression of 
distinct SOCS rnRNA transcripts and proteins. SOCS1 expression is 5-fold higher in Thl 
than in Th2 cells, whereas Th2 cells contain 23-fold higher levels of SOCS3. We also 
demonstrate that IL-12-induced STAT4 activation is inhibited inTh2 cells that express 
high levels of SOCS3 whereas IL-4/STAT6 signaling is constitutively activated in Th2 
cells, but not Till cells, with high SOCS1 expression. These results suggest that mutually 
exclusive use of STAT4 and STAT6 signaling pathways by differentiated Th cells may 
derive in part, from SOCS3- or SOCS I -mediated repression of IL-12/STAT4- or IL- 
4/STAT6 signaling in Th2 and Thl cells, respectively. Given the strong correlation 
between distinct patterns of SOCS expression and differentiation into the Thl or Th2 
phenotype, SOCS 1 and SOCS 3 proteins are therefore Th lineage markers that can serve 
as therapeutic targets for immune modulation therapy! 
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Intravitreal invading cells contribute to vitreal cytokine milieu in 
proliferative vitreorelinopathy. 

El-Ghrably IA , Dua HS , Orr GM, Fischer D , Tigbe PJ . 

Larry A Donoso Laboratory for Eye Research, Department of Ophthalmology, University 
of Nottingham, UK. 

AIM: To examine the contribution of infiltrating cells in the local production of cytokines 
within the vitreous of patients with proliferative vitreoretinopathy (PVR), METHODS: 
The presence of mRNA coding for IL-6, IL-8, IL-lbeta, IL-1 alpha, TNFalpha, 
IFNgamma, IL-12, and HPRT was investigated in 25 vitreous samples from patients with 
PVR, 1 1 vitreous samples from patients with retinal detachment (RD) not complicated by 
PVR, and 10 vitreous samples from patients with macular hole (MH). A quantitative 
reverse transcriptase polymerase chain reaction (RT-PCR) using an internal competitor 
was used to investigate these samples. From these samples, 15 PVR, & RD, and 8 MH 
were analysed for the' protein levels of the same cytokines using eniyme linked 
immunosorbent assay (ELISA). Spearman correlation was used to test any association 
between mRNA and cytokine protein levelspas an indicator of the contribution these cells 
make to the intravitreal cytokine milieu. RESULTS: A strong correlation was found 
between mRNA and their respective cytokine levels ;(protein products) for IL-6, IL-8, 1L- 
1 beta, IL-1 alpha, TNFalpha, IFNgamma (Spearman r = 0:83, 0.73, 0.67, 0.91 , 0.73, and 
0.73 respectively), but not for IL-12. The median levels of IL-6, IL-8, IL-lbeta, and 
IFNgamma mRNA and their respective cytokines were significantly higher (p <0.05) in 
patients with PVR than in those with macular' hole. There was no statistically significant 
difference in the median levels of IL-1 alpha mRNA between PVR and MH but the 
cytokine IL-1 alpha was detected at a significantly higher level in PVR compared with 
MH patients. Between PVR and RD patients, there was no statistically significant 
difference in mRNA levels for all the investigated cytokines (p >0.05) except for IL-6 
where there was a statistical significance (p= 0.038). In contrast, the median levels of IL- 
6, IL-8, and IL-lbeta cytokines were significantly higher (p <0.05) in patients with PVR 
than in those with RD, whereas for IL- 1 alpha and IFNgamma no significant statistical 
difference was detected between PVR and RD patients (p >0.05). When results of RD 
and MH patients were compared, a statistical difference was only detected in mRNA 
levels of INFgamma (p - 0.008). However, no difference was detected for INFgamma 
(protein product) or for any of the other cytokines between RD and MH patients.^ 
CONCLUSION: Levels of both protein and mRNA encoding IL-6, IL-8, IL-lbeta, and 
IFNgamma is significantly increased in vitreous samples from patients with PVR. The 
strong correlation between ELISA detectable cytokines (protein products) and their 
respective mRNA levels suggest that intravitreal, invasive cells are the major source of 
these cytokines, with the exception of 1L-I2. Cells invading the vitreous do not appear to 
locally produce IL-12 mRNA. This would appear to implicate cells peripheral to the 



vitreal mass as the major source of this cytokine. 
PMID: 1 1264138 [PubMed - indexed for MEDLINE] 
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Modulation of the glutamatergic receptors (AMPA and NMD A) and of glutamate 
vesicular transporter 2 in the rat facial nucleus after axotomy. 

BleoreJL, VassiasJ, Vidal PP . de Wacle C 

P. NRS 7 ( 2S S p" P ^^ E L A I 060 ' CentrC Universitaire de * Saints-Peres, 45 rue des Saints- 
Peres, 75270 Paris Cedex 06, France. 

Facia! nerve axotomy is a good model for studying neuronal plasticity and regeneration in the 
penpheral nervous system. We investigated in the rat the effect of axotomy on the different 
subumts of excitatory glutamatergic AMPA (GLuRl-4), NMDA (NR1, NR2A-D) receptors post- 
synaptic density 95, vesicular gmtamate transporter 2, beta catenin andcadherin rnRNA levels 
Tt, u^T"^ ?° n ^^y^ U 3, 8, 3Q and 60 days after facial nerve axotomy by in 
situ hybridization and immunohistofluorescence. mRNAs coding for the GLuR2-4 NR1 NR2A 
fl v^ U rf f g,Utam k ater e ic rece P tors ^d for post-synaptic density 95, were les's abundant after 
axotomy The decrease began as early as 1 or 3 days after axotomy; the mRNAs levels were 
bwest 8 days post-lesion, and returned to. normal or near normal 60 days after the lesion. The 
NR2C subunit mRNAs were not detected in either lesioned or intact facial nuclei 
Inun^unohistochemistry using specific antibodies against GLuR2-3 submits and against NR1 
confirmed this down-regulation. There was also a large decrease in vesicular glutamate transporter 
2 immunostaining ,n the^otomized facial nuclei at early stages following facial nerve section In 
contrast^ no decrease of NR2A subunit and of post-synaptic density 95 could be detected at any 
time following the ]es,on."beta Catenin and cadherin immunoreactivity pattern changed around the 
i S ' ? aC !?l rao i oneuron ^ da y 3 after axotomy, and then, tends to recover atday post- 
lesion 60 days. Therefore, our results suggest a high correlation between restoration of 
nerve/muscle synaptic contact, synaptic structure and function in facial nuclei. To investigate the 
mechanisms involved ,n the change of expression of these proteins following axotomy, the facial 
nerve was^edwjj tetrodotoxin for 8 days. The blockade of action potential significantly 
decreased GLuR2-3, NRland NR2A mRNAs in the ipsilateral facial nuclei. Thus axotomy- 
induced changes in rnRNA abundance seemed to depend partly on disruption of activity. 

PMID: 16182453 [PubMed - in process] 



I 



372:FreeRadicRes.l996Jun;24(6):451-9. Related****.' Links 

Altered levels of scavenging enzymes in embryos subjected to a diabetic 
environment. J uul,ul 

EorsbergH, BorgLA, CagHer^/EnkssMlM 

Department of Medical Cell Biology, University of Uppsala, Sweden. 

Maternal diabetes during pregnancy is associated with an increased rate of congenital 
maJformafons m the offspring. The exact molecular etiology of the disfcroeT 
embryogencs.s „ unknown, but an involvement of radical oxygen specS ihthe 

otZcoUul^ r &ee f 7 gen rad ' Cals 3nd the antioxWa " t defence mechanisms 
m vivo, or high glucose in vitro alters the expression of the free oxygen radical 
scaveng lng enzymes superoxide dismutase (CuZnSOD and MnSOD) catahS and 

nZTZ Sit ^7 ^ ^ dUring ,3te agenesis, wi^dtd Cng of 
normal and d.abet.c rats on gestational days 1 1 and 12, and also evaluated day-1 1 ' 
embryos after a 48 hour culture period in 10 mM or 50 mM glucose concentration Both 

7^T^r ^ 8,U T CU,tUrC CWBcd re ^ dati0n i-^ed r e 

of congenital malformat.ons ,n the embryos. The .CuZnSOD and MnSOD enzymes were 

exprSe 2 Xv ^ ' ' ^ CuZ " S ° D > MnSOD and -ta.ase wT 

IZZlti bv Z f T T ""S MafcnS of MnSOD transcripts when 

chal enged by a d ia bet,c miheu. There was a good correlation between mRNA protein 

^SS^S^f?^ regU ' ati0n ° f ^ — primS a 'the 
fn" ZsXn^ ^ " ^ ^ ^ ^ <™tratu>n in Vitro 

TLZT a expression, concomitant with increased total SOD activity 

and a tentative decrease ,n catalase expression and activity in the embryos These 
find ing5 sup rt the notion of enhanced oxidative stfess y ^ ^se 

agent in diabetic teratogenesis. euoiogic 
PMID: 8804988 [PubMed - indexed for MEDLINE] 
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Induction of the estrogen receptor by growth hormone and glucocorticoid 
substitution in primary cultures of rat hepatocytes. 

Frey schuss B , Stavrcus Evers A , Sahtin L Eriksson H . 

Department of Reproductive Endocrinology, Karolinska Hospital, Stockholm, Sweden. 

Hepatic estrogen receptors (ER) mediate estrogenic effects on mammalian liver 
metabolism and are thereby involved in the regulation of important 
physiological/pathological processes, such as coagulation, atherosclerosis and 
hypertension. The regulation of the formation of the ER in primary cultures of rat 
hepatocytes was studied by assaying ER and ER mRNA under different endocrine 
conditions. The ER concentration was measured using two different methods, a ligand- 
binding technique and an ER enzyme immunoassay. The results obtained by the two 
methods showed good correlation, and linear regression analysis gave a correlation 
coefficient of 0.95. ER concentrations fell to low steady state levels within 16 h after 
establishing the cell culture and remained low in the absence of hormonal substitution 
IJpon medium supplementation with pituitary GH and the glucocorticoid dexamethasone 
(UHX) in combination, the ER concentration increased 6-fold from 4 2 +/- 1 0 to 25 8 +/ 
7.0 fmol/mg cytosolic protein. ER mRNA was measured by solution hybridization ' 
Substitution with GH and DEX in combination increased ER mRNA to 210 +/- 14% of 
control levels. No effect on ER mRNA stability was seen after hormone treatment It is 
concluded that the regulatory effects of GH and DEX on the hepatic ER in this in vitro 
system are very similar- to the effects of these hormones under in vivo conditions The 
inducible expression of the ER has never before, to our knowledge, been demonstrated in 
any mammalian liver cell culture system. 

PM1D: 8404593 [PubMed - indexed for MEDLiNE] 
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Cyclin Dl-ncgative mantle cell lymphoma: a clinicopathologic study based on 
gene expression profiling. s y on 

cta.cal pMhoIopc, and genelic featnres of 6 cases of cycli ^Dl- Mu ™» e MCL AllT/« 

S speS r Lrr .1 °T ■ US, ° n ; ° f CyCHn D, - ne g alive MCL do exist and are part of 
Pathol of S: Up - r ^ lat '°" ° f C > cl '» D2 or D3. m ay substitute for cyclin Dl in the" 
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Oxytocin receptors ia boviue cervix: distribution and gene expression 
during the estrous cycle. 

EtjchjAR, IvelTR, FieldsPA, Chang SM. Fielrkjvrj 

^"^^ 

Oxytocia (OT) receptor (OTR) concentrations were determined in the cervix of 
nonpregnant cows on cycle Days 0,3, 7-8, 17,and 19 (n = 3-4 cows each day)- HmOT 
was used as the labeled ligand. Mucosal and muscle layers of the cervix werlal o 
analyzed separately for both ligand binding and expression of the OTRgene „ ng a 
newly developed RNase protection assay (RAP). Cellular loca.izat.on of OTR protein 
was determined by .rnmunohistochemistry. All regions of cervix from cows at LLs had 
ugh conccntraaons of OTR; in the luteal phase, all were sharply down-regula 5^ 

0^ S iT C ° S ,a ?f *? ab ° Ut 3 °- f ° Id -ncenu-ationl than the muscle layer. 

OTR mRNA was read.ly detected by RAP in the mucosa from estrous cows while much 
weaker s.gna.s were found in the muscle. On Days 7- 1 7, the OTR mRNA sisals Too* 
mucosa and musclewere very faint or undetectable. Thus, there was a goolcorreTation 

S vSr^rl^ eXPreSSi9n ' WhiGh SUggeStS that OTR concentrations 
are ma-nly regulated at the transcnpfonal level. The epithelial cells at the luminal surface 

feScS? T Pri . nC l Pal ° f immUn ° r ea " ive ° TR = ™ scle showed 
E2 ^ f I 1 ® 1 * 1 *' PreVi ° USly ' ° T W3S f ° Und t0 Stimu,ate P-staglandin (PC) 
E2 output m vitro ,„ bovine cervical tissues. Since PGE2 is capable of softening the 

0^1ToSeT gS SUggeSt °7 ^ haVC 3 n ° Vd P^otogM function to cause 
sottening of the bovine cervix mediated by the release of PGE2. 

PMID: 8835394 [PubMed - indexed for MEDLINE] 



26:r l el , a ™ Res " 2005 ^b; 1 5(1): 15-20. Related Articles, Links 

Silencing of the thrombomodulin gene in human malignant melanoma. . 

fWaJ, Kaucda A, Umcbavashi V 7 OtsukaF, Sugimura T . UshijjmaT 

Carcinogenesis Division, National Cancer Center Research Institute, Tokyo, Japan. 

The loss of thrombomodulin (TM) expression is associated with rumour growth 
inliltration and lymph node metastasis in human tumours. In melanoma ceil lines TM is 
reported to mediate cell adhesion, and its introduction into TM-negative melanoma cell 
lines suppresses their growth. In this study, we analysed TM expression in surgical 
melanoma specunens arid the role of its promoter mediation in the loss of its 
expression. In 1 5 (75%) of the 20 specimens (five from a primary site and 1 5 from 
metastatic sites), melanoma cells lacked TM immunoreactivity. Methylation of the TM 
promoter region was detected in 10 (67%) of the 15 TM-negative specimens by 

nf7hT C xK7° ,ymeraSe Chai " rCaCti0n ' whereas "Ration was detected in two 
(40/.) of the five TM-pos.tive specimens. In cell lines, complete methylation of the TM 
promoter CpG island was detected in six (46%) of 13 melanoma cell lines, whereas no 
methylat.pn was detected in two cultured normal melanocytes. There was a K ood 
correlation between the methylated status of the CpG island and the loss of TM 
messenger RNA (mRNA) expression. Treatment of melanoma cell lines with a 
demethylatmg agent, 5-aza-2'-deoxycvtidine, induced demethylation of the promoter 
CpG island and the restoration of mRNA and protein expression. These findings suggest 
*at most human melanomas lack TM expression, and that methylation of the promoter 
CpO island is one of the mechanisms responsible. 
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A sampling of the yeast protcome. 

FutcfacrB, Latter Gl, Monardo P . McLaughlin at : Gari els JT . 

Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 1 1724 USA 
futcher@cshl.org ' ' 

In this study, we examined yeast proteins by two-dimensional (2D) gel electrophoresis 
and gathered qualitative information from about 1,400 spots. We found that there is an 
enormous range of protein abundance and, for identified spots, a good correlation 
between protein abundance, mRNA abundance, and codon bias. For each molecule of 
well-translated mRNA, there were about 4,000 molecules of protein. The relative 
abundance of proteins was measured in glucose and ethanol media. Protein turnover was 
examined and found to be insignificant for abundant proteins. Some phosphoproteins 
were ident.fied. The behavior of proteins in differential centrifugation experiments was 
examined. Such experiments with 2D gels can give a global view of the yeast proteome. 
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In tbis study, >ve examined yeast proteins by two-dimensional (2D) gel electrophoresis and gathered quan- 
titative information from about 1,400 spots. We found that there is an enormous range of protein abundance 
and, for identified spots, a good correlation between protein abundance, mRNA abundance, and codon bias. 
For each molecule of well-translated mRNA, there were about 4,000 molecules of protein. The relative 
abundance of proteins was measured in glucose and ethano) media. Protein turnover was examined and found 
to be insignificant for abundant proteins. Some phosphoproteins were identified. The behavior of proteins in 
differentia) centnfugation experiments was examined. Such experiments with 2D gels can give a global view or 
the yeast proteome. 



The sequence of the yeast genome has been determined (9). 
More recently, the number of mRNA molecules for each ex- 
pressed gene has been measured (27, 30). The next logical level 
of analysis is that of the expressed set of proteins. We have 
begun to analyze the yeast proteome bv using two-dimensional 
(2D) gels. 

2D gel electrophoresis separates proteins according 16 iso- 
electric point in one. dimension and molecular weight in the 
other dimension (21 \ allowing resolution of thousands of pro- 
teins on a single gel. Although modern imaging 3nd computing 
techniques can extract quantitative data for each of the spots in 
a 2D gel, there are only a few cases in which quantitative data 
have been gathered from 2D gels. 2D gel electrophoresis is 
almost unique in its ability to examine biological responses 
over thousands of proteins simultaneously and should there- 
fore allow us a relatively comprehensive view of cellular me- 
tabolism. 

We and others have worked toward assembling a yeast pro- 
tein database consisting of a collection of identified spots in 2D 
gels and of data on each of these spots under various condi- 
tions (2, 7, 8, 10, 23, 25). These data could then be used in 
analyzing a protein or a metabolic process. Saccharomyces 
certvisiae is a good organism for this approach since it has a 
well-understood physiology as well as a large number of mu- 
tants, and its genome has been sequenced. Given the sequence 
and the relative lack of in irons in S. cerevisiae, it is easy to 
predict the sequence of the primary protein product of most 
genes. This aids tremendously in identifying these proteins on 
2D gels. 

There are. three pillars on which such a database rests: (i) 
visualization of many protein spots simultaneously, (ii) quan- 
tification of the protein in each spot, and (iii) identification of 
the gene product for each spot. Our first efforts at visualization 
and identification for S. cerevisiae have been described else- 
where (7, 8). Heic we describe quantitative data for these 
proteins under a variety of experimental conditions. 

MATERIALS AND METHODS 
Strains ami inrJia. S. (ereviwte W303 \MATz* <i<ie2*l his3-J!,15 lcu?-3, 1)2 
irph] ura3-i atnl-JOO) was used (26). - Mel YNO (yca>i nilrotren base) medium 
was 1.7 g of YNB (Dilco) per liter. 5 e of ammonium sulfate per liter, and 
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adenine, uracil, and all amino acids except methionine; -Mel.-Cys YNB me- 
dium was the same but without methionine or cysteine. Medium was supple- 
mented with 27c glucose (for most experiments) or with 2% elhanol (fOT ethano) 
experiments). Low-phosphate YHPD was described by Warner (28). 

Isoloptc labeling of yeas! and preparation of cell extracts.- Yeast strains wctc 
labeled and proteins weje extracted as described by Garrets et al. (7, 8). Briefly, 
cells were grown to 5 X 10* cells per ml. at 3ft D C; 1 ml of culture was transferred 
to a fresh tube, and 03 mCi of ["^methionine (e.g.. Express protein labeling 
mix; New England Nuclear) was added to this J -ml culture. The cells were 
incubated for a further 10 to J 5 min and then transferred to a 1.5- mi microcen- 
trifuge tube, chilled on ice, and harvested by ccnirifugation. The supernatant was 
removed, and the cell pellet was resuspended in ]00 uJ of lysis butler (20 mM 
. Tm-HCI |pH 7.6], 10 mM NaF, 10 mM sodium pyrophosphate, 0.5 mM EDTA, 
0.3% dcoxycholate: just before use, phenyl me thy Isulfonyl fluoride was added to 
1 mM, leupeplin was added to 1 pg/ml, pepstalin was added to J jig/ml. tosyl- 
sutfonyl phenylalanyl chloromethyl ketone was added to 10 ng/ml, and soybean 
trypsin inhibitor was added to 10 jig/ml). 

The resuspended cells were transferred to a screw-cap J.5-ml polypropylene 
lube containing 0.28 g of glass beads (0.5- mm diameter; Biospec Products) or 
0.40 g ofzirconia beads (Oi-mm diameter; Biospec Products). After the cap was 
secured, the lube was inserted into a MiniBcadbeater 3 (Biospec Products) and 
shaken a I medium high speed at 4"C for I min. Breakage was typically 75%. 
Tubes were then spun in a microcentrifuge Jot 10 s at 5,000 X g at 4"C. 

With a very fine pipe lie tip, liquid was withdrawn from the beads and trans- 
ferred to a piechilled 1 5-m] tube containing 7 u,l of DNase I (0J>.mg/ro); Cooper 
pipduct no. 6330)-RNase A {0.25 mg/ml; Cooper product no.5679)-Mg (50 mM 
MgCK) mix. Typically 70 u.1 of liquid was recovered. The mixture was incubated 
on ice for 10 min to allow the RNase and DNase to work. 

Nest, 75 u.1 of 2X dSDS (2X dSDS is 0,6% sodium dodecyt sulfate |SDSJ, 2% 
mercapioethanol, and 0.1 M Tris-HCl |pH RJ) was added. The tube was plunged 
into boiling water, incubated for 1 min, and Ihcn plunged Into ice. After cooling, 
the tube was ccnlrifnged ot 4°C for 3 min at 1-1.000 X g. The supernatant was 
transferred lo a ftcsh tube and dozen at -70°C. About 5 u.1 of this supernatant 
was used for each 2D gcj. 

2D pDtyacrvlamide gels. 2D gels were made and run as described elsewhere 
(6-8). 

Image analysis of the ptls. The Quest II software system was used for quan- 
titaiive image analysis (20, 22). Two techniques were used to collect quantitative 
data (or analysis by Quest J| software. Firsl. before the advent of phosphor im- 
agers, gets were diied and ftuorogiaphed. Each gel was exposed to b'lm for three 
di lie j en t times (typically 1 day, 2 weeks, and 6 weeks) to increase the dynamic 
range of the data. The films were scanned along whb calibration strips to relate 
him optical density to disintegrations per minute in the eels and analyzed by the 
soltwaie to obtain a linear relationship between disintegrations per minuie in the 
spots and optical densities of ibe him images. The quantitative data are ex- 
pressed as parts per million of the total cellular protein. This value is calculated 
from the disintegrations per minute of l he sample loaded onto the gel and by 
comparing the film density of each data spot with density of the film over the 
calibration sirips o( known radioactivity exposed to the same film. This yields the 
disintegrations per minute per millimeter /or each spol on the gel and I hence its 
paits-per-minuic value. 

Aftei the advent of phosphor imaging, frel*, bearing labeled proteins weje 
exposed lo phosp hoi imager screens and scanned by a Fuji phosphor imager, 
typically for two expos tncs pc» gel. Calibration Mrips of known radicacuviry weie 
o posed simultaneously. Se;tn data from ihr phosphor imager was assimilated by 
Quest II software, and quantitative data *nt recorded (oi the i-pots on (he pels. 
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JViKwurements of protein turnover. Cells in esponcnlial phase were pulse- 
labeled with [ J3 S)meihionine, excess cold Met and Cys were added, and samples 
of equal volume were taken from the culture at intervals up to 90 min (in one 
experiment) or up to J 60 roin (in a second experiment). Incorporation of 35 S into 
protein was essentially 100% by the first sample (JO min). Extracts were made, 
and equal fractions of the samples were loaded on 2D gels (i.e., the different 
samples had different amounts of protein but equal amounts of 35 S). Spots were 
quant itated with a phosphor imaging and Quest software. 

The software was queried for spots whose radioactivity decreased through the 
time course. The algorithm examined all data points for all spots, drew a best-fit 
Imc through the data points, and looked for spots where this line had a statis- 
tically significant negative slope. In one of the experiments, there was one such 
spot. To the eye, this was a minor, unidentified spot seen onlv in the first two 
samples (10 and 20 mm). In the other experiment, the Quest software found no 
spots meeting the criteria. Therefore, we concluded thai none of the identified 
spots (and all but one of the visible spots) represented proteins with Ion* 
half-lives. " 

Centrifugal fractional ion. Cells were labeled, harvested, and broken with glass 
beads by the standard method described above excepl that no detergent (i.e., no 
deoxycholatc) was present in the lysis buffer. The crude rysate was cleared of 
. unbroken cells and large debris by centrifugal ion al 300 X g for 30 s. The 
supernatant of this centrifugal ion was then spun at 1 6,000 X £,for 10 roin to give 
the pellet used for Fig. 6B. The supernatant of the 16,000 X g r JO-min spin was 
then spun at J 00,000 X g for 30 min to give the supernatant used for Fig. 6A. 

Protein abundance calculations. A haploid yeast cell contains nbout A X 10"' 2 
g of protein {J, 15). Assuming a mean protein mass of 50 kDa, there are about 
50 X 10 molecules of protein per cell. There are about 1.8 methionines per JO 
kDa of protein mass, which implies 4.5 X If) 8 molecules of methionine per cell 
(neglecting the small pool of free Met). We measured (i) the counts per minute 
in each spot on the 2D gels, (ii) the total number of counts on each gel (by 
integrating counts over the entire gel), and (iii) the total number of counts 
loaded on the gel (by scintillation counting of the original sample) Thus, we 
know what fraction of the total incorporated radioactivity is present in each spot. 
After correcting lor the methionine (and cysteine [see below)) content of each 
protcm, we calculated an absolute number of protein molecules based on the 
fraction of radioactivity in each spot and on 50 X )0 6 total molecules per cell 
The labeling mixture used contained about one-fifth as much radioactive 
cysteine as radioactive methionine. Therefore, the number of cysteine molecules 
per proton was also taken into account in calculating the number of molecules ■ 
or proton, but Cys molecules were weighted one-fifth as heavilv as Met mole- 
cules. 

mRNA abundance calculations. For estimation of mRNA abundance we used 
1>AGE (scnal analysis of gene expression) data (27) and Aifymetiix chip hybrid- 
rtahon data (2% ; 30). The mRNA column in Table I >how S mRNA abundance 
calculated horn SAGE data alone. However, (he SAGE data came from cells 
growing ,n YEPD medium, whereas our protein measurements were from cells 
grow (m > ,„ YNB medium. In addition, SAGE data for low- abundance mRNAs 
S» "nw" 1 5la ' ,S,ic: ' 1 va " a,i »»- Therefore, we also used chip hybridization data 
{29* >0) for mRNA horn ceils grown in YNB. These hybridization data also had 
disadvantages. Fust, (he amounts of hieh- abundance mRNAs were systemati- 
cally underestimated, probably because of saturation in (he hybridi 2 ations. which 
used 10 M g of cRNA. For example, the abundance of ADW mRNA was 197 
cop,« per cell by SAGE but only 32 copies per cell by hybridization, and the 
abundance of ENQ2 mRNA was 248 copies per cell by SAGE but only 4) by 
hybridization. When the amount of cRNA used in the hybridization w as reduced 
V c!£i ,™ PP ^' ? m ° UntS ° f mRNA w " c similar to the amounts determined 
by SAGE (.9a, 29b). However, experiments using I M g of cRNA have been done 
oi only some genes (29a). Because amounts of mRNA were normalized to 
J5 : 000 per cell, and because the amounts of abundant mRNAs were underesti- 
m oK^' ,b ^ C * 3 ? - M ° ld ovcrcMi ™' e of the abundance of nonabundanl 
m n^A S " y' " ,C " ,J,,cd ,his Utaoj of 2 -2 by adding together the number of 
Tl AT T^™'" fr ° m 3 ,ar £ e numbtT of genes expressed M a low level for both 
SAGE data and hybridization data. The sum for the same genes from hybrid- 
nation data is 2Mold greater ihan that from SAGE data. 

To take into account ihese difficulties, wc compiled a list of "adjusted" mRNA 
abundance as follow. For all high-abundance mRNAs of our identified proteins 
we used SAGE data. For nil of these particular mRNAs, chip hybridization 
JuggCMcd that mRNA abundance was the same in YEPD and YNB media For 
mcdmm-abund.oce mRNAs, SAGE dura w CIC used. bu. when hybridizaiion 
data showed a significant difference between YEPD and YNB then the SAGE 
data were adjusted by the app.opriatc factor. Finally, tor' low. Sundance 
mRNAs, we used data from chip hybridizations from YNB medium but divided 
by 2.2 to normalise to the SAGE results. These calculations were completed 
without reference to protein abundance. 

. ?H' ™'™^ n ^P ,3, »> n <CA1) was taken from the yeaM protcome 

d, taW O PD) ( ]3) : loi which calculates were ma.lt acccm-ini! «o Sharp and 
U M). Biie/ly, ,he mde* uses a reference set ot hij>Juv express! *cnts to a^i K n 
a value ro each codon. ;,nd rhen a ..core for a is cnlcubteo from ihe 

frequency ot use of ihc vjjious codons in that gene (M) 

Statis.i^t :>nalvMs. The IMP p.og.a.n was used wi,h the ;„d of T Tullv The 
JMP piogram Mwwrd du. ncMher ndiNA no, protein :,rnmd;„H^ nor- 
mally disiirbt.led: thcn-lote, Spea.man ianli cmielaiion okIIVk'.m.s (, ) wctK - 



calculated. The mRNA (adjusted and unadjusted) and protein, data were also 
transformed so thai Pearson product-moment correlation coefficients (r ) could 
be calculated. First, this was done by a Box-Cox transformation of log-trans- 
formed data. This transformation produced normal distributions, and an r of 
0.76 was achieved. However, because the Box-Cox itansformation is coroplexf we 
also did a simpler logariihmic transformation. This produced a normal distribu- 
tion for the protein data. However, the distribution for the mRNA and adjusted 
mRNA data was close to, but not quite, normal. Nevertheless, w C calculated tire 
r p and found thai it was 0.76, identical to the coefficient from the Box-Cox 
transformed data. We therefore believe thai this correlation coefficient is not 
misleading, despite (he fact that the log(mRNA) distribution is not quite normal 



RESULTS 

Visualization of 1,400 spots on three gel systems. Yeast 
proteins have isoelectric poinis ranging from 3.1 to 12.8, and 
masses ranging from Jess than 10 kDa to 470kDa.lt is difficult 
to examine all proteins on a single kind of gel, because a gel 
with the needed range in pi and mass would give poor resohr- 
tion of the thousands of spots in the central region of the geJ. 
Therefore, we have used three ge) systems: (i) pH "4 to 8" with 
10% poiyacrylamide; (ii) pH "3 to 10" with 10% poryacryJ- 
amide; and (iii) nonequilibrium with 15% poiyacrylamide (7, 
8). Each gel system allows good resolution of a subset of yeast 
proteins. 

Figure 1 shows a pH 4-8, 10% poiyacrylamide gel. the pH 
at the basic end of ihe isoelectric focusing gel cannot be main- 
tained throughout focusing, and so the proteins resolved on 
such gels have isoelectric points between pH 4 and pH 6.7. For 
these pH 4-8 gels, we see 600 to 900 spots on the best gels after 
multiple exposures. 

The pH 3-10 gels (not shown) extend the pi range somewhat 
beyond pH 7.5, allowing detection of several hundred addi- 
tional spots. Finally, we use nonequilibrium gels with 15% 
acrylamide in the second dimension. These allow visualization 
of about 100 very basic proteins and about 170 small proteins 
(less than 20 kDa). In total, using all three gel systems, about 
1,400 spots can be seen. These represent about 1,200 different 
proteins, which is about one-quarter to one-third of the pro- 
teins expressed under these conditions (27, 30). Here, we focus 
on the proteins seen on the pH 4-8 gels. 

Although nearly all expressed proteins are presenl on these 
gels, the number seen is limited by a problem we call coverage. 
Since there are thousands of proteins on each gel, many pro- 
teins comigrate or nearly comigrate. When rwo proteins are 
resolved, but are close together, and one protein spot is much 
more intense than the other, a problem arises in visualizing the 
weaker spot: at long exposures when Ihe weak signal is strong 
enough for detection, the signal from the strong spot spreads 
and covers the signal from the weaker spot. Thus, weak spots 
can be seen only when they are well separated from strong 
spots. 

For a given gel, the number of delectable spots initially rises 
wnh exposure time. However, beyond an optimal exposure, the 
number of distinguishable spots begins to decrease, because 
signals from strong spots cover signals from nearby weak spots. 
At long exposures, the whole autoradrogram lurns black. Thus, 
there is an optimum exposuie yielding the maximum number 
of spots, and at this exposure the weakest spots are not seen. 

Largely because of ihe problem of coverage, the proteins 
seen are strongly biased toward abundant proleins. All identi- 
fied proteins have a CA1 of 0.18 or more, and we have iden- 
nfted no transcription factors or protein kinases, which are 
nonabundanl proteins. Thus, this leclmology is useful for ex- 
amining protein synthesis, amino acid metabolism, and glyco- 
lysis but no! for examining transcription, DNA replication, or 
the cell cvcle. 
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Spot i den (i Mention. The identification of various spots has 
been described elsewhere (7, 8), At present, 169 different spots 
representing 148 proteins have been identified. Many of these 
spots have been independently identified (2, 10, 23, 25). The 
main methods used in spot identification have been analysis of 
amino acid composition, gene overexpression, peptide se- 
quencing, and mass spectrometry. 

PuJse-cbase experiments and protein turnover. Pulse-chase 
experiments were done to measure protein half-Jives (Materi- 
als and Methods). Cells were labeled with [ 35 S]methionine for 
10m in. and then an excess of unlabeled methionine was added. 
Samples were taken at 0, 10, 20, 30, 60 ; and 90 min after the 
beginning of the chase. Equal amounts of 35 S were loaded from 
each sample; 2D gels were mn, and spots were quantitaled. 
Surprisingly, almost every spot was nearly constant in amount 
of radioactivity over the entire time course (not shown). A few 
spots shifted from one position to another because of post- 
translational modifications (e.g., phosphorylation of RpaO and 
Efbl). Thus, the proteins being visualized are all or nearly all 
very stable proteins, with half-lives of more than 90 min. Gygi 
et al. (10) have come to a similar conclusion by using the N-end 
rule to predict protein half-lives. This result does not imply 
that all yeast proteins are stable. The proteins being visualized 
are abundant proteins; this is partly because they are stable 
proteins. 

Protein quantitation. Because all of the proteins seen had 
effectively the same halMife, the abundance of each protein 
was directly proportional to the amount of radioactivity incor- 
porated during labeling. Thus, after taking into account the 
total number of protein molecules per cell, the average content 
of methionine and cysteine, and the methionine and cysteine 
content of each identified protein, we could calculate the abun- 
dance of each identified proiein (Tables 1 and 2; Materials and 
Methods). About 1,000 unidentified proteins were also quan- 
tified, assuming an average content of Met and.Cys. 

Many proteins give multiple spots (7, 8). The contribution 
from each spot was summed to give the total protein amount. 
However, many proteins probably have minor spots that we are 
not aware of, causing the amount of protein to be underesti- 
mated. 

When the proteins on a pH 4-8 gel were ordered by abun- 
dance, the most abundant protein had S,904 ppm, the 30th 
most abundant had 2,842 ppm, the 100th most abundant bad 
314 ppm, the 500th most abundant had 57 ppm, and the 
1,000th most abundant (visualized at greater than optimum 
exposure) had 23 ppm. Thus : there is more than a 300- fold 
range in abundance among the visualized proteins. The most 
abundant 10 proteins account for about 25% of the total pro- 
tein on the pH 4-8 gel. the most abundant 60 proteins account 
for 50%, and the most abundant 500 proteins account for 80%. 
Since it seems likely that the pH 4-8 gels give a representative 
sampling of all proteins; we estimate that half of the total 
cellular protein is accounted for by fewer than 100 different 
gene products, principally glycolytic enzymes and proteins in- 
volved in protein synthesis. 

Correlalion of protein abundance with mRNA abundance. 
Estimates of mRNA abundance for each gene have been made 
by SAGE (27) and by hybridization of cRNA to oligonucleo- 
tide arrays (30). These two methods give broadly similar re- 
sults, yet each method has strengths and weaknesses (Materials 
and Methods). Table J lists the number of molecules of mRNA 
per cell for each gene studied. One measurement (mRNA) 
uses data from SAGE analysis alone (27); a second incorpo- 
rates data from both SAGE and hybridization (30) (adjusted 
mRNA) (Table J; Materials and 'Methods). .We conelated 
pioiein abundance with mRNA abundance (Fig. 2). Tor ad- 



justed mRNA versus protein, the Spearman rank correlation 
coeificient, r,, was 0.74 (P < 0.0001), and the Pearson corre- 
lation coefficient, r py on log. transformed data (Materials and 
. Methods) was 0.76 {P < 0.00001). We obtained similar corre- 
lations for mRNA versus protein and also for other data trans- 
formations (Materials and Methods). Thus, several statistical 
methods show a strong and significant correlation between 
mRNA abundance and protein abundance. Of course, the cor- 
relation is far from perfect; for mRNAs of a given abundance, 
there is al least a 10-fold range of protein abundance (Fig. 2). 
Some of this scatter is probably due to post transcriptional 
regulation, and some is due to errors in the mRNA or proiein 
data. For example, the protein Yef3 runs poorly on our gels, 
giving multiple smeared spots. Its abundance has probably 
been underestimated, partly explaining the Jow protein/mRNA 
ratio of Yei3. It is Ihe most extreme outlier in Fig. 2. 

These data on mRNA (27, 30) and protein abundance (Ta- 
ble 1) suggest that for each mRNA molecule, there are on 
average 4,000 molecules of the cognate protein. For instance, 
for Actl (actin) there are about 54 molecules of mRNA per 
cell and about 205,000 molecules of protein. Assuming an 
mRNA halMife of 30 min (12) and a cell doubling time of 120 
min, this suggests that an individual molecule of mRNA might 
be translated roughly 1,000 times. These calculations are lim- 
ited to mRNAs for abundant proteins, which are likely to be 
the mRNAs that are translated best. 

A full complement of cell proiein is synthesized in about 120 
min under these conditions. Thus, 4,000 molecules of protein 
per molecule of mRNA implies that translation initiates on an 
mRNA about once every 2 s. This is a remarkably. high rate; it 
implies that if an average mRNA bears 10 ribosomes engaged 
in translation, then each ribosome completes translation in 
20 s; if an average protein has 450 residues; Ihis in turn implies 
translation of over 20 amino acids per s, a rale considerably 
higher than estimated for mammalians (3 to.8 amino acids per 
s) (18). These estimates depend on the amount of mRNA per 
cell (11, 27); 

The large number of protein molecules that can be made 
from a single mRNA raises the issue of how abundance is 
con I rolled for less abundant proteins. Many non abundant pro- 
teins may be unstable, and this would reduce the protein/ 
mRNA ratio. In addition, many nonabundant proteins may be 
translated at suboptimal rates. We have found that mRNAs for 
nonabundant proteins usually have suboptimal contexts for 
translational initiation. For example, there arc over 600 yeast 
genes which probably have short open reading frames in the 
mKT^A upstream of the main open reading frame (17a). These 
may be devices for reducing the amount of protein made from 
a molecule of mRNA. 

Correlation of codon bias with protein abundance. The 
mRNAs for highly expressed proteins preferentially use some 
codons rather than others specifying the same amino acid (14). 
This preference is called codon bias. The codons preferred are 
those for which the tRNAs are present in the greatest amounts. 
Use of these codons may make translation faster or more 
efficient and may decrease misincorporation. These effects are 
most important for the cell for abundant proteins, and so 
codon bias is most extreme for abundant proteins. The effect 
can be dramatic— highly biased mRNAs may use only 25 of the 
61 codons. 

We asked whether the correlation of codon bias with abun- 
dance continues for medium- abundance proteins. There are 
various mathematical expressions quantifying codon bias; here, 
we have used the CA.l (2-1) (Materials and Methods) because 
it gives a result between 0 and 1. The r 3 for CA1 versus protein 
abundance is O.SO [P < 0.0001), similar to the mRNA- protein 



TABLE I. Quantitative data" 



Function 



Name 



CA1 



Carbohydrate metabolism 



Protein synthesis 



Heat shock 



Amino acid synthesis 



Miscellaneous 



Adhl. 

Adh2 

Cil2 

Eriol 

Eno2 

Fbal 

Hxkl,2 

Jell 

Pdbl 

Pdcl 

mi . 

Pgil 

Pycl 

Tall 

Tdb2 

Tdh3 

Tpil 



roRNA Adjusted mRNA Prorcin (Giu) (It) 3 ) Protein (Eth) (10>) E/G ratio 



0.830 

0:504 

0.185 

0.870 

0.892 

0.868 

0,500 

0.251 

0.342 

0.903 

0.465 

0.681 

0.260 

0-579 

0.904 

0.924 

0.817 



197 
0 
1 

No Ma 
248 
179 
13 
0 
5 

226 
5 
14 
1 
5 
f>3 
460 

NoAVa 



Efbl 


0.762 


33 


Eftl,2 


0.801 


26 


Prtl 


0.303 


4 


RpaO 


0.793 


246 


Tifl,2 


0.752 


" 29 


Yei3 


0.777 


36 


HscS2 


0.581 


2 


Hsp60 


0.381 


9 


Hsp82 


0.517 


2 


Hspl04 


0.304 


7 


Kar2 


0.439 


5 • 


Ssal 


0.709 


2 


Ssa2 


0.802, 


10 


Ssbl? 


0.850 


50 


SscJ 


0.521 


2 


Ssel 


0.521 


8 


SnM 


0.247 


1 


Adel 


0.229 


4 


Ade3 


0.276 


2 


Ade5,7 


0.257 


2 


Arg4 


0.229 


1 


Gdh3 


0.585 


10 


GJnl 


0.524 


11 


His4 


0.267 


3 


Ilv5 


0.801 


6- 


Lys9 


0.332 


4 


Met6 


0.657 


NoNla 


Pro2 


0.248 


3 


Serl 


0.258 


2 


Trp5 


0.3J9 


5 



197 

2.8 

248 
179 . 
10.5 

5 

226 

5 
14 

0.7 

5 
63 
460 



16.5 
26 
0.7 
246 
29 
36 

2.9 
23 
1.3 
7 

. 10.1 
4.3 
5 

50 
2.6 
8 
1.1 

4 * 

1.7 

3.4 

8.1 
27 
11 
3 
6 
4 
22 
3 

1.2 

5 



Actl 


0.710 


54 


54 


AdkJ 


0.531 


No Nta 


Ald6 


0.520 


3 


3 


Atp2 


0.424 


1 


4.1 


Bmhl 


0.322 


46 


46 


Bmh2 


0.384 


1 


1.4 


Cdc4S 


0.306 


2 


2.4 


Cdc60 


0.299 


2 


0.86 


Erg20 


0.373 


5 


5 


Gppl 


0.603 


16 


5 


Gspl 


0.623 


3 


3 


IppJ 


0.620 


4 


4 


LcbJ 


0.173 


0.3 


0.8 


Mot) 


0.423 


0 


0.45 


P;»bl 


0.488 


3 


3 


Psnl 


0.600. 


15 


15. 


Rnr4 


0.497 


6 


6 


Sam J 


0.494 


5 


5 


Sam2 


0.497 


-\ 


35 


Sod) 


0.376 


36 


36 


Ubal 


0.212 


2 


2 


YKL056 


0.731 


62 


■62 


YLRJ09 


0.549 


2) 


21 


YMRH6 


0.777 


41 


41 



1,230 
0 
23 
430 
650 
640 
62 
0 
41 
280 
75 
360 
37 
110 
430 
1,670 
No Met 

358 
99' 
12 
277 
233 
34 

312 

35 

52 

70 

43 
303 
213 
270 

68 

96 

25 

14 
12 
14 
43 
148 
. 77 
35 
.152 
32 
190 
30 
35 
28 

205 
47 

181 
76 

191 

334 

32 
6 

92 
234 

115 
254 

19 

20 

41 
J48 
■ 44 

59 

63 
631 

34 
253 
930 
184 



972 
963 
288 
974 
215 
608 
46 
671 
33 
205 
53 
120 
34 
35 
876 
1,927 
No Mel 

362 
54 
6 
300 
106 
ND 

75 

82 
135 
161 
102 
421 
324 

85 

80 
48 
44 

27 
9 

4 ■ 

41 

55 
104 

23 
109 

17 

80 

12 

8 

12 

164 
43 

159 

109 

137 

147 

26 
2 

39 

158 

39 
147 

40 

J6 

19 

56 

37 

21 

20 
618 

20 
312 

40 



0.79 
>20 
12 
2.4 
0.33 
0.95 

>20 

0.73 . 

0.71 

0.75 



NR 
NR 



0.55 

036 
0.46 



0.67 

2.3 

2.6 

2.3 

2.4 

1.4 

1.5 

1.2 

1.7 



1.3 

3.5 

0.7 

0.52 

0.42 



0.78 



1.4 

0.72 



034 
0.58 



0.47 



0.44 
0.20 
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TABLE 2. Functions of proteins listed in Table J 



Name" 



YPD title lines* 



Adhl 
Adh2 
Cit2 
EnoJ 
Eno2 
Fbal 
.Hxkl ' 
Hxk2 
JcI3 
Pdbl 
PdcJ 
Pfltl 
Peil 
Pycl 
Tall 
Tdh2 
Tdh3 
TpiJ 

Efbl 
Eftl 
Eft2 
Prll 

RpoO (RPPO) 
TifJ 



Alcohol dehydrogenase I; cytoplasmic isozyme reducing acetaldehyde to etbanol, regenerating NAD* 
Alcohol dehydrogenase II; ox>d ( zes etbanol to acetaldehyde, glucose repressed & 
Citrate synthase, peroxisomal (nonmilochondriai); converts acetyl-CoA and oxaloacelate to citrate dIus CoA 
Ino LI 2 \l^^ Cei T 5 e S y S ra,1SC f C ° nVCrtS 2-P^pb^-D-grycerate to ^S^^^J^A 
F^^^ » Phos^oeno^rovate i„ «is 

HexokJnase I; converts bexoscs lo hexose phosphates in glycolysis; repressed by glucose 

w^T? ; COnVe - ,S hCX .° SeS '-° bCXOSC P hos P hatcs i^co'rysis and plays a re^Xory role in Wucose repression ■ 
Isoctrate lyase peroxisomal; carries out part of the glyoxylate cycle; required for gIuconerien«is repression 
Pyruvale dehydrogenase complex, El beta subunit giucuneogenesis 
P}Tuvate decarboxylase isozyme 1 

r&wr C K OW rf - ' Pha SUb " nit ' paH ° f 3 COmp)ex wi,h Pft2 P which out a key regulatory step in Rrycolvsis 

Glucose-6-phosphate (somerase, converts glucose-6- phosphate lc! Tfruclose-^phosphale «> '^ory siep in grycotyss 
Pyruvate carboxylase J: converts pyruvate to oxaloacelate for gl ^oncogenesis 
Transaldolase; component of nonoxidalive pan of pentose phosphate pathway 

r k-^ ia m e M e 1" P J OSp t! afe i e M r °£ enase 2 i inverts r>glycer aldehyde 3-phosphale lo U-dephosphoRrycerale 
Glyceraldehyde-3 : phosph a ie dehydrogenase 3; converts D-glyceraldehyde 3-phosphale to J^S^lSS^ 
Tr ) oseph W phate I someras e ;m.erconveMsglyceraldehvd e -3-phosphateanddihyd^ - 
Translation elongation factor EF-]p; GDP/GTP exchange factor for Teflp/ref2p 

Transition elongation factor EF-2; contains diphthamide which is not essential for activity- identical lo Ef l2r> 
Trans a , on eJongat.on factor EF-2; contains diphthamide which is no. essential for ac v £ dent S o Ef 1 S 

J^£Z^J^ m * n subu,lit (p90); has an RNA rccofini,ion doLn 

Translation initiation factor 4A (eIF4A) of the DEAD box family 
Translation mmation factor A A <elF4A) of the DEAD box family 
Translation elongation factor EF-3A; member of ATP-binding cassette superfamily 

Chaperonin homologous to E. coti HtpG and mammalian HSPiX) 

Mitochondrial chaperonin ihai cooperates with HspJOp; homolog of £ coti GroEL 

Heat-.nduc.ble chaperonin homologous to £ cofi HtpG and mammalian HSP90 

^hWc^^ ,ndUCed ,hermo,oIe -"« ™« '<* resolubihzing aggregates of denatured proteins; important for [psPf 

" andlo^nK P™* "v****™ -oss the endop.asmic reticuJum membrane 

Cytoplasmic chapeione; heal shock piolein of ihe rlSr>70 family 

Cytoplasmic chaperone; member of Ihe HSP70 family 

Heal .shock protein of HSP70 family involved in ihe translalional apparatus 

Heal shock prolem of HSP70 family, cytoplasmic 

^TtapSffi shoe k IXZZjfigjS T " Mp *"* * Ch ^ 0nin of pro.ein chains 

— protein 

^T&X^^ P>™phosphohyd,o.ase/his,idino. dehydrogenase; 2nd. 3rd. and ,0,„ steps of 

K "n„ r , ,^h'e^Sir' ;,0hy,),UXy - a,:i,, "-^misomerase) (alpl.a-ke.o.p.hyd.oxyiacy.) reducloisomer.sc); second s.ep i» Va. 

cobalamin independent ' ■"LU.yllelrjhvdroplciovl Ir.glulamate homOTOIeine niclhylnansferase), melhionine synthase, 

J-Glulamyl pn.«phate .educlase (phosphoglulamale dehydrogenase), proline bio™,.helic en7vme 
Phospboserme l.ansam.nase; .nvolved in synthesis of sen'ne ftom 3-phosphoelSe " 
Tryptophan synthase, last (5lh) step in .ryplophan biosynlhesis pathway ^ 

Adenilaletn^trTpP tpt' l"^ 1 ^ o'he, cy.oskele.al f onc ,io„s 
AOen)ljle kinase (G IP.AMP phosphotransferase ). cytoplasmic 
Cylosohc acetaldehrde dehydrocenase 

Deta subunit of Fj.ATP synthase; 3 copies »,e found in each FJ oligomer 
Homo og of mammalian 14-3-3 protein; has strong similarity lo Bmh2p 
P^^^T^!'"', 43 ; 3 &' ore " ,; has 5,,on S similfi'y to Bmhjp 

'« «» "«> "omotvpic m,mb,a„e fusion 

"""^ in ™'" J •»«'^ .Hiongh nuclea, pores 

P ^»r,r ° f J,U) " W ,C - P»» " f 3 ' - J HNA-pioccxsing comple, (cleavage lacot ,); has 4 RNA 

R&«S^ a SSSr : GRI> — Py-Phosphorylase 

5-Adenosylmelhionine ivnlhcrase 1 

5-Adenosy!meihionine synthetase 2 

Copper zinc superoxide dismulase 

Ubiqui tin- :ictiva ling (EJ) enzvme 
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Adj. mRNA 

FIG. 2. Correlation of protein abundance with adjusted mRNA abundance. 
The number of molecules per cell of each protein is plotted against the number 
of molecules per cell of the cognate mRNA, with an r p . of .0.76. Note the 
logarithmic axes. Data for mRNA were taken from references 27 and 30 and 
combined as described in Materials and Methods. 



correlation, confirming a strong correlation between CAI and 
protein abundance (Fig. 3). The relationship between CAI and 
protein abundance is log iinear from about 1,000,000 to about 
10,000 molecules per cell. We have no data for rarer proteins. 

It is not clear whether CAJ reflects maximum or average 
levels of protein expression. The proteins used for the CAl- 
protein. correlation included some proteins which were not 
expressed a I maximum levels under I he condition of the ex- 
periment (Hsc82, Hspl04, Ssal, Adel, Arg4, His4, and others). 
When these proteins were removed from consideration and 
the correlation between CAI and the remaining (presumably 
constitutive) proteins^ was recalculated, the r s was essentially 
unchanged (not shown). 

The equation describing the graph in Fig. 3 is log (protein 
moJecuJes/cell) = (2.3 X CAI) + 3.7. Thus, under certain 
conditions (a CAI of 0.3 or greaier; a constitutively expressed 
gene), a very rough estimate of proiein abundance can be 
made by raising 10 to the power of [(2.3 X CAI) + 3.7]. 

The distribution of CAJ over the genome (Fig. 4) consists of 
a lower, bcJl-shaped distribution, possibly indicating a region 
where there is no selection for codon bias, and an upper, flat 
distribution, starting at a CAJ of about 0.3, possibly indicating 
a region where there is selection for codon bias. Almost all of 
the proteins whose abundance we have measured are in the 
upper, flat portion of the distribution. In ihe lower, beJJ-shaped 
region, we do not know whether there is a correlation beiween 
CAI and protein abundance. 

Changes in protein abundance in glucose and ethauol. A 
comparison of ceils grown in glucose (Fig. 1 A) with cells grown 
in ethanol (Fig. IB) is shown in Table ]. As is well known, 
some proteins are induced tremendously during growth on 
ethanol. Two striking examples aTe the peroxisomal enzymes 
Jell (isocitrate lyase) and Cit2 (citrate synthase), which are 
induced in ethanol by more lhan 100- and 12- fold, respectively 
(Fig. 1; Table J). These enzymes are key components of the 
glyoxylate shunt, which diverts some ncetyl coenzyme A 
(aceryJ-CoA) horn ihe iricaiboxylic acid cycle lo glnconeogen- 
esis. S. cerevisioc requires large amounts o/ carbohydrate for its 
cell wall; in ethanol medium, this carbohydrate comes from 
gluconeogenesis. which depends on ihe glyoxylate shunt and 
on ihe glycolytic pathway miming in reverse. The need for 



gluconeogenesis also explains why glycolytic enzymes are 
abundant even in ethanol medium. Thus, 2D gel analysis shows 
the prominence of the glycolytic and glyoxylate shunt enzymes 
. in cells grown on ethanol, emphasizing that gluconeogenesis, 
presumably largely for production of the cell wall, is a major 
metabolic activity under these conditions. 

During gluconeogenesis, substrate-product relationships are. . 
reversed for the glycolytic enzymes. One might expect that not 
all glycolytic enzymes would be well adapted to the reverse . 
. reaction. Indeed, 2D gels show that in ethanol, Adh2 (alcohol 
dehydrogenase 2) is strongly induced (16), while its isozyme 
Adhl is not greatly affected. Adhl and Adh2 each interconvert 
acetaldehyde and ethanol. Adhl has a relatively high K m for 
ethanol (17 mM), while Adh2 has a lower K m (0.8 mM) (5). 
Thus, it is thought that Adhl is specialized for glycolysis (ac- 
etaldehyde to ethanol), while Adh2 is specialized for respira- 
tion (ethanol to acetaldehyde) (5, 29). Similarly, Enol (enolase 
1) is induced in ethanol, while its isozyme Eno2 (enolase 2) 
decreases in abundance (Table 1) (4, 19). Enol is inhibited by . 
2-phosphoglycerate (the glycolytic substrate), while Eno2 is 
inhibited by phosphoenolpynivate (the gluconeogenic sub- 
strate) (4). Perhaps Enol has a lower K m for phosphoenol- 
pynivate than does Eno2, though to our knowledge this has not 
been tested. Thus, the 2D gels distinguish isozymes specialized 
for growth on glucose (Adhl and Eno2) from isozymes spe- 
cialized for ethanol (Adh2 and Enol). 

Many heat shock proteins (e.g., Hsp60, Hsp82, Hspl04, and 
Kar2) were about twofold more abundant in ethanol medium 
than in glucose medium. This is consistent with the increased 
heat resistance of cells grown in ethanol (3). 

Enzymes involved in protein synthesis (Eftl, RpaO, and Tifl) 
were about twice as abundant in glucose medium as in ethanol 
medium. This may reflect the higher growth rate of the cells in 
glucose; 

Phosphorylation of proteins. To examine protein phosphor- 
ylation, we labeled cells with 32 P and ran 2D gels to examine 
phosphoproleins. About 300 distinct spots, probably represent- 
ing 350 to 200 proteins, could be seen on pH 4-8 gels (Fig. 5B). 
We then aligned autoradiograms of three gels, each with a 
different kind of labeled protein ( 32 P only [Fig. 5B], 32 P phis 
35 S [Fig. 5A), and 35 S only [not shown, but see Fig. 1 for . 
example]). Jn this way, we made provisional identification of 
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some of the 32 P-labeJed spots as particular 35 S-labeIed spots. 
AJ1 such identifications are somewhat uncertain, since precise 
alignments are difficult, and of course multiple spots may ex- 
actly comigrate. Nevertheless,, we believe that most of the 
provisional identifications are probably correct. Among the 
major 32 P- labeled proteins are the hexokinases Pfxkf and 
HxJcZ the acidic ribosome-associated protein RpaO, the trans- 
lation factors Yef3 and Efbl, and probably Hsp70 heat shock 
proteins of the Ssn and Ssb families. RpaO and EfbJ are quan- 
titatively monophosphorylaled. 

Many yeast proteins resolve into multiple spots on these 2D 
gels (7). Yef3 has live or more spots, at least four of which 
comigrate with 32 P. Tpil has a major spot showing no 32 P 
labeling and a minor, more acidic spot which overlaps with 
some 32 P label. TO has at least seven spots (7); two of these 
overlap with some 32 P label, but five do not (Fig. 5). Eftl has 
at least three spots (7), and none of these overlap with 32 P, 
although there are three nearby, unidentified 32 P-labeJed spots 
(a, c, and d in Fig. 5). Spots thai seem to be extra forms of 
Mei6, Pdc], Eno2, and Fbal can be seen in Fig. 6A, but there 
is bttJe *P at these positions in Fig. 5. Thus, phosphorylation 
explains some but not all of the different protein isoforms seen. 

The celUvde is regulated in part by phosphorylation. We 
compared 32 P-lnbeled proteins from cells synchronized in G, 
with o -factor, in cells synchronized in G T by depletion of Gj 
cyclms, and in cells .synchronized in M phase with nocodazole. 
Only very minor differences were seen, and these were difficult 
to leproduce. The cell cycle proteins regulated by phosphory- 
lation may not be abundant enough for this technique to be 
applied easily- 
Centrifugal fractionation. We fractionated 35 S-labeIed ex- 
tracts by centiifngntion (Materials and Methods). Figure 6A 
shows the proteins in the supernal an I of a high-speed 
(100 r 000 x g : 30 min) centrifugation, while Fig. 6B shows the 
proteins in the pellet of a low-speed (I6 : 000 X ^ 10 min) 
cemrifugation. Many proteins ;ire tremendously enriched in 
one baciion or I he oihci, while others are present in both. 



Most glycolytic enzymes (e.g., Tdh2, Tdh3, Eno2, Pdcl, Adhl, 
and Fbal) are enriched in the supernatant fraction. The only 
exception is Pfkl (not indicated), which is found in both pellet 
and supernatant fractions. Many proteins involved in protein 
synthesis (Eftl, Yef3, Prll, Tiff, and RpaO) are in the pellet, 
possibly because of the association of ribosomes wiib the en- 
doplasmic reticulum. However, Efbl is in the supernatant, as is 
a substantia] portion of the Eft]. Perhaps surprisingly, several 
mitochondrial proteins (Atp2 [not shown] and Jlv5) are largely 
in the supernatant. Perhaps glass bead breakage of cells re- 
leases mitochondrial proteins. The nuclear protein Gspl is in 
the pellet fraction. The enrichment produced by centrifugation 
makes it possibJe to see minor spots which are Otherwise poorly 
resolved from surrounding proteins. Figure 6B shows that the 
previously identified Till spot is surrounded by as many as six 
other spots that cofractionate. We observed six identical or 
very similar additional spots when we overexpressed Tifl from 
a high-copy- number plasmid (not shown). Signal overlaps only 
one or two of these spots in 3? P-labeJing experiments (Fig. 5), 
and so the different forms are not mainly due to different 
phosphorylation states. 

DISCUSSION 

Our experience with developing a 2D gel protein database 
for S. cerevbiae is summarized here. With current technology, 
we can see the most abundant J ,200 proteins, which is about 
one-third to one-quarter of the proteins expressed. The re- 
maining proteins will be difficult to see and study with the 
methods that we have used, not because of a lack of sensitivity 
but because weak spots are covered by nearby strong spots. 

Of the ),200 proteins seen, we have identified M8, with a 
bias toward the most abundant proteins. Steady application of 
the methods already used would allow identification of most of 
the remaining proteins. Gene over expression will be particu- 
larly useful, since it is not affected by the lower abundance of 
ibe r emaining visible prole ins. 



6.5 



PH 



4.3 





16 kDa 



dcray ot signal inicnsirj xenon cxp,n u ,„ ,„:„l c j „ crk , \„„ „„, sllOTn , f'„ V ''""""f" "I = ^ labeled spo.; ami (iii) The 

«p, .imcn.s <wt .b,, F* «>. posi.i I thf „,;„,„ ^J*,", " n, 14 K.*.^' ™. m,n V '°"" S '". ?'.»«? *- » —"p^n 

indiralrd by :. l a , er riirfc; positions „l scw„ | 0 „„j „| Till u,,- inilicaicj by smi.llci riiclcs. 



' l:.l.< lit ¥ . Tlw iiuijrti tinm ol Tpi). which is .ml labrled s 



A 




16kDo 

6.7 



11 4.3 

pH 



Vol. 19, 1999 



SAMPLING OF THE YEAST PROTEOME 7367 



2D gels of the kind that we have used are not suitable for 
visualization of rare proteins. However it will be possible to 
study on a global basis metabolic processes involving relatively 
abundant proteins, such as protein synthesis, glycolysis, glu- 
coneogenesis, amino acid synthesis, cell wall synthesis, nucle- 
otide synthesis, lipid metabolism, and the heat shock response. 

Gygi et a). (10) have recently completed a study similar to 
ours. Despite generating broadly similar data, Gygi et al. 
reached markedly different conclusions. We believe that both 
rnRNA abundance and codon bias are useful predictors of 
protein abundance. However, Gygi et al. feel that mRNA 
abundance is a poor predictor of protein abundance and that 
"codon bias is not a predictor of either protein or mRNA 
levels" (10). These different conclusions are partly a matter of 
viewpoint. Gygi et al. focus .on the fact that the correlations of 
mRNA and codon bias with protein abundance are) far from 
perfect, while vye focus on the fact that, considering the wide 
range of mRNA and protein abundance and the undoubted 
presence of other mechanisms affecting protein abundance, 
the correlations are quite good. 

However, the different conclusions are also partly due to 
different methods of statistical analysis and to real differences 
in data. With respect to statistics, Gygi et al. used the Pearson 
product-moment correlation coefficient (r p ) to measure the 
covariance of mRNA and protein abundance. Depending on 
the subset of data included, their r p values ranged from 0.1 to 
0.94. Because of the low r p values with some subsets of the 
data, Gygi el al. concluded that the correlation of mRNA to 
protein was poor. However, the r p correlation is a parametric 
statistic and so requires variates following a bivariate normal 
distribution; that is, il would be valid only if both mRNA and 
protein abundances were normally distributed. In fact, both 
distribuiions are very far from.normal (data not shown), and so 
a calculation of r p is inappropriate. There was no statistical 
backing for the assertion (hat codon bias fails to predict pro- 
tein abundance. 

We have taken two statistical approaches. First, we have 
used the Spearman rank correlation coefficient (rj. Since this 
statistic is nonparametric. there is no requirement for the data 
to be normally distributed. Using the r s , we find that mRNA 
abundance is well, correlated with protein abundance (r s ~ 
0.74), and the CA1 is also well correlated with protein abun- 
dance (r, = 0.80) (and also with mRNA abundance [data not 
shown]). For the data of Gygi et al. (10), we obtained similar 
results, though with their data the correlation is not as good; r } 
= 0.59 for the mRNA-lo-protein correlation, and r s - 0.59 for 
the codon bias- to- protein correlation. 

In a second approach, we transformed the mRNA and pro- 
tein data to forms where they were normally distributed, to 
allow calculation of an r p (Materials and Methods). Two trans- 
formations, Box-Cox and logarithmic, were used; both gave 
good correlations with our data |e.g„ r p — 0.76 for lbg( adjusted 
RNA) to log( protein)]. We were not able to transform the data 
of Gygi et ai. to a normal distribution. 

Finally, there are also some differences in da In between the 
rwo studies. These may be partly due to the different measure- 
ment techniques used: Gygi et al. measured protein abundance 
by cutting spots out of gels and measuring the radioactivity in 
each spot by scintillation counting, whereas we used phospho- 
rimaging of intact gels coupled to image analysis. We com- 
pared our data to theirs for the proteins common between the 
studies (but excluding proteins whose mRNAs are known to 
differ between rich and minimal media, and excluding Tiff, 
which was anomalous in differing by 100- fold between the two 
d:iia sets). The r\ between the two protein data sets was 0.88 
{P < 0.0001). Although this is a strong correlation, the fact that 



it is less than 1.0 suggests that there may have been errors in 
measuring protein abundance in one or both studies. After 
normalizing the two data sets to assume the same amount of 
protein per cell, we found a systematic tendency for the protein 
abundance data of Gygi et al. to be slightly higher than ours for 
the highest-abundance proteins and also for the lov/est-abun- 
dance proteins but slightly lower than ours for the middle- 
abundance proteins. These systematic differences suggest some 
systematic errors in protein measurement. AJlhough we do. not 
know what the errors are, we suggest the following as a rea^ 
sonable speculation. For the high est- abundance proteins, we 
may have underestimated the amount of protein because of a 
slightly nonlinear response of the pbosphorimager screens. For 
the lowest-abundance proteins, Gygi et al. may have overesti- 
mated the amount of protein because of difficulties in accu- 
rately cutting very small spots out of the gel and because of 
difficulties in background subtraction for these small, weak 
spots. The difference in the middle abundance proteins may be 
a consequence of normalization, given the two errors above. 

The low-abundance proteins in the data set of Gygi et al. 
have a poor correlation with mRNA abundance. We calculate 
that the r s is 0.74 for the top 54 proteins of Gygi et al. but only 
0.22 for the bottom 53 proteins, a statistically significant dif- 
ference. However, with our data set, the r s is 0.62 for the top 33 
proteins and 0.56 (not significantly different) for the bottom 33 
proteins (which are comparable in abundance to the bottom 
53 proteins of Gygi el al.). Thus, our data set maintains a good 
correlation between mRNA and protein abundance even at 
low protein abundance. This is consistent with our speculation 
that protein quantification by phosphorimaging and image 
analysis may be more accurate for small, weak spots than is 
cutting out spots followed by scintillation counting. Our rela- 
tively good correlations even for nonabundant proteins may 
also reflect the fad that we used both SAGE data and RNA 
hybridization data, which is most helpful for the least abundant 
mRNAs. In summary, we feel that the poor correlation of 
protein to mRNA for the nonabundant proteins of Gygi el al. 
may reflect difficulty in accurately measuring these nonabun- 
dant proteins and mRNAs, rather than indicating a truly poor 
correlation in vivo. It is not surprising I ha I observed correla- 
tions would be poorer with less-abundant proteins and 
mRNAs, simply because the accuracy of measurement would 
be worse. 

How well can mRNA abundance predict protein abun- 
dance? With r p - 0.76 for logarithmically transformed mRNA 
and protein data, the coefficient of determination, (r p ) 2 , is 0.58. 
This means that more than half (in log space) of the variation 
in protein abundance is explained by variation in mRNA abun- 
dance. When converted back to arithmetic values, protein 
abundances vary over about 200-fold (Table 1), and (r p ) 7 = 
0.58 for the log data means that of I his 200-fold variation, 
about 20-fold is explained by variation in the abundance of 
mRNA and about ]0-fold is unexplained (but could be. due 
partly to measurement errors). For proteins much less abun- 
dant than those considered here, we imagine the in vivo cor- 
relation between mRNA and protein abundance will be worse, 
and other regulatory mechanisms such as protein turnover will 
be more important. 

Some important conclusions can be drawn from this sam- 
pling of the protcome. First, there is an enormous range of 
protein abundance, from nearly 2,000.000 molecules per cell 
for some glycolytic enzymes lo about 100 per cell for some cell 
cycle proteins f?6a). Second, aboul half of all cellular pioicin 
is found in fewer than 100 different gene products, which are 
mostly involved in carbohydrate meiabolisui oi piotcin synthe- 
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sis. Third, the correlation between protein abundance and CAI 
is Jog linear as far as we can see, which is from about 10,000 
protein molecules per cell to about 1,000,000. This is somewhat 
surprising, because it implies that selective forces for codon 
bias are significant even at moderate expression levels. Jt also 
means that codon bias is a useful predictor of protein abun-. 
dance even for moderately low bias proteins. Fourth, there is a 
good correlation between protein abundance and mRNA 
abundance for the proteins that we have studied. This validates 
the use of mRNA abundance as a rough predictor of protein 
abundance, at least for relatively abundant proteins. Fifth, for 
these abundant proteins, there are about 4,000 molecules of 
protein for each molecule of mRNA. This last conclusion 
raises questions as to how the levels of nonabundant proteins 
are regulated and suggests that protein instability, regulated 
translation, suboptimal rates of translation, and other mecha- 
nisms in addition to transcriptional control may be very impor- 
tant for these proteins. 
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Pre-translational regulation of cytochrome P450 genes is responsible for 
ck S rhosis SPeC C ° f iDdivid,,aI P450 enz y mes ^ong patients with 

GeprgeJ, LiddjeC, Murray BythK, FarreU GC . 

We have recently reported that disease-specific differential alterations in the hepatic 
express.on of xen 0 biot 1C -metabolizing cytochrome P450 (CYP P450) enzymes occur in 

n CYP pro ems are modulated at pre- or post-translatiorial levels, we have now exammed 
the hepatic levels of mRNA for CYPs IA2,2C9,2E1 and 3 A4 b ; solution hybriSoT 
m the same livers of 20 controls (surgical waste from histologicaUy normalTf 32 
case,ofhepatoce,ularand 18 of cholestatic severe clonic liver disease. CYpIa2 
mRNA and CYP 1 A .mmunoreactive protein were both reduced in livers with 
hepatocellular and cholestatic types of cirrhosis. In contrast, CYP3A4 mRNA and protein 
were reduced only m liverefrom patients with hepatocellulaV diseases.^ A2 and'SS 
there were significant correlations between mRNA species and the respective protein 
contents.(rSl A2 = 0.74, rS3A4 - 0.64, P < O.0001). CYP2C9 mR^Twas r Idu<Sl 
pauents wtth both cholestatic and hepatocellular types of li ver disease but 2C p^tem 

CyV™, IS ^ 'T With Ch ° ,eStatiC dySfonCtion - *"» coodio^ST 
CYP2C9 mRNA and protein, was also significant (rs - 0.36, P < 0.005) but mRNA levels 
accounted for only 13% of the variability in protein ranldngs. This is proba^ 

S^eS C r° teinS ^^.^beingdetectedbytheLi^C 
antibody CYP2E1 mRNA and protein were reduced in patients with cholestatic liver - 

decreased. CYP2E1 mRNA was s.ghificantly correlated with CYP2E1 protein but 
ac«,u„ted for ordy ,8% of the variability in protein rankings (rs = 0.43, P < 0 0005) 

■SSSSKSS * 6 * ^ indiCatC that of xSic- 

metabo izmg CYP enzymes among patients with cirrhosis is due, at least in part to pre- 

translafonal mechanisms. The lack of a strong correlation between CYP2E1 SnA and 
protem suggests that this gene, like its rat orthologue, may be subject to pre-tlsIahoTai 
as well as translat.onal and/or post-translational regulation. transiational 

PMID: 7741759 [PubMed - indexed for MEDLINE] 
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Cyclooxygenasc-2 expression it. macrophages: modulation by protein 

Kinase C-aipha. 

Giroux M , Descotcaux A 

Cydooxygenase-2 (CQX-2) is an inducible enzyme responsible for high levels of PG 
products dunng inflammation and immune responses Previous studt with 
pharmacological mhibitors suggested a role for protein kinase C (P1CC) in PG production 
pos S ,b. y by regulating COX-2 expression. In this study, we addressed the ro totfS5. 
alpha ,n the modulation of COX-2 expression and PGE2 synthesis by me overexprSng 

£w ^4Tw egatlV " m l T tant ° f ,hb ,S0en2 y me in the mouse macrophage^ 2e 
to l£?"J' '"Tf ** effCCt ° f Vad0US StimuH °" CGX " 2 -P^ion, namely 
LduceTcOxTmRNA T T""'" P^^ 6 L-^ania donovani. Whereas LPS- * 
induced COX-2 mRNA and protein expression were down-regulated in DN PKC-alpha- 
overexpressmg clones, IFN-ganuna-mdueed COX-2 expression was up-reguS fnDN 
PKC-alpha-overexpressing clones with respect to normal RAW 264 7 cells 
Measurements of PGE2 ,evels revealed a strong correlation between PGE2 secretion and 
I g tT 1 . ° X " 2 mRNA 3nd Pr0tei " levels ™ DN PKC-alpha-overexpresstt 
S J andtM T ier ' to r e r US SU8gCSt 3 r ° ,e f -^C-alpha in the modulanon o 8 
^m^^'' 1 ^ COX " 2 eXPreSSi ° n ' 33 We " 35 b ^-nna-induced . 
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Transcript profiling of human platelets usiug microarray and serial 
analysis of gene expression. 

Gnatcnko DV, Dunn JJ, McCorkle SR . Weissniann P , Perrotta PL Bahou WR 

Department of Medicine, Program in Genetics, State University of New York Stony 
Brook 1 1 794-8 151, USA. ' 

Human platelets are anucleate blood cells that retain cytoplasmic mRNA and maintain 
functionally intact protein translational capabilities. We have adapted complementary 
tediniques.of microarray and serial analysis of gene expression (SAGE) for genetic 
profiling of highly purified human blood platelets. Microarray analysis using the 
Affymetrix HG-U95Av2 approximately 12 600-probe set maximally identified die 
expression of 2147 (range, 13%-I7%) platelet-expressed transcripts, with approximately 
22% collectively involved in metabolism and receptor/signaling, and an 
overrepresentation of genes with unassigned function (32%). m contrast, a modified 
SAGE protocol using the Type IIS restriction enzyme Mmel (generating 2 1 -base pair 
[bp] or 22-bp tags) demonstrated that 89% of tags represented mitochondrial (mt) 
transcripts (enriched in 16S and 12S nbosomal RNAs), presumably related to persistent 
mt-transcription in the absence of nuclear-derived transcripts. The frequency of non-mt 
SAGE tags paralleled average difference values (relative expression) for the most 
"abundant" transcripts as determined by microarray analysis, establishing the 
concordance of both techniques for platelet profiling. Quantitative reverse transcription- 
polymerase chain reaction (PCR) confirmed the highest frequency of mt-derived 
transcripts, along with the mRNAs for neurogram'n (NGN, a protein kinase C substrate) 
and the complement lysis inhibitor clusterin among the top 5 most abundant transcripts. 
For confirmatory characterization, immunoblots and flow cytometric analyses were 
performed, establishing abundant cell-surface expression of clusterin and intracellular 
expression of NGN. These observations demonstrate a strong correlation between high 
transcript abundance and protein expression, and they establish the validity of transcript 
analysis as a tool for identifying novel platelet proteins that may regulate normal and 
pathologic platelet (and/or megakaryocyte) functions. 

PM1D: 12433680 [PubMed - indexed for MEDLINE] 
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Transcript profiling of human platelets using microarray and serial analysis 
of gene expression 

Dmitri V. Gnatenko, John J. Dunn, Sean R. McCorkle, David Weissmann. Peler L. Perrotta, and Wadie F. Bahou 



Human platelets are anucleate blood cells " 
that retain cytoplasmic mRNA and main- 
tain functionally intact protein transla- 
tions! capabilities. We have adapted 
complementary techniques of microarray 
and serial analysis of gene expression 
(SAGE) for genetic profiting of highly 
purified human blood platelets. Microar- 
ray analysis using the Affymetrix HG- 
U95Av2 approximately 12 600-probe set 
maximally identified the expression, of 
2147 (range, 13%-17%) platelet-expressed 
transcripts, with approximately 22% col- 
lectively involved in metabolism and re- 
ceptor/signaling, and an overrepresenta- 
tion of genes with unassigned function 
(32%). In contrast, a modified SAGE proto- 
col using the Type IIS restriction enzyme 

Introduction 



Mmet (generating 21-base pair I bp) or 
22-bp tags) demonstrated that 89% of 
tags represented mitochondrial (mt) tran- 
scripts (enriched in 16S and 12S ribo- 
•somal RNAs), presumably related to per- 
sistent mt-transcription in the absence of 
nuclear-derived transcripts. The fre- 
quency of norvmt SAGE tags paralleled 
average difference values (relative expres- 
sion) for the most "abundant" transcripts 
as determined by microarray analysis, 
establishing the concordance of both 
techniques for platelet profiling. Quantita- 
tive reverse transcription-polymerase 
chain reaction (PCR) confirmed the high- 
est frequency of mt-derived transcripts, 
along with the mRNAs for neurogranin 
(NGN, a protein kinase C substrate) and 



the complement lysis inhibitor clusterin 
among the top 5 most abundant tran- 
scripts. For confirmatory characteriza- . 
tion, immunoblots and flow cytometric 
analyses were performed, establishing 
abundant cell-surface expression of clus- 
terin and intracellular expression of NGN. 
These observations demonstrate a strong 
correlation between high transcript abun- 
dance and protein expression, and they 
establish the validity of transcript analy- 
sis as a tool for identifying novel platelet 
proteins that may regulate normal and 
pathologic platelet (and/or megakaryo- 
cyte) functions. (Blood 2003;101:2285-2293) 
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Human blood platelets play critical roles in normal hemostatic 
processes and pathologic conditions such as thrombosis, vascular 
remodeling, inflammation, and wound repair. Generated as cytoplas- 
mic buds fjom precursor bone marrow megakaryocytes, platelets 
are anucleate and lack nuclear DNA, although they retain 
megakaryocyte- derived mRNAs. 1 - 7 Platelets contain rough endo- 
plasmic reticulum and polyribosomes, and they retain the ability 
for protein biosynthesis from cytoplasmic mRNA. 3 Quiescent 
platelets generally display minimal translations! activity, although 
newly fanned platelets such as those found in patients with 
immune thrombocytopenic purpura (ITP) synthesize various o>gran- 
ule and membrane glycoproteins (GPs). including GPJb and 
GPilb/llla (aj, b p 5 ). Furthermore, stimulation of quiescent platelets 
by agonists such as a-thrombin increases protein synthesis of 
various platelet proteins, including Bcl-3. 4 Like nucleated cells, the 
rapid translation of preexisting mRNAs may be regulated by 
iniegrin ligation to extracellular matrices. - In the case of platelets, 
the primary inteerin involved in. this process appears to be a Hb $ 3 
with cooperative signals mediated by the collagen receptor « : p f , 6,7 



Jntegrin- mediated platelet protein synthesis appears to be regulated 
at the level of translation initiation involving the eukaryotic 
initiation factor 4E (elHE). Jnstead of directly influencing e)F4E 
activity via posttranslational modifications (ie, phosphorylation), 
platelet e]F4E activity best correlates with its spatial redistribution 
to the niRNA-cnriched cytoskeleton. 3 Furthermore, because pro- 
tein translation is partially inhibited by the immunosuppressant 
rapamycin, it suggests that adhesion- and/or aggregation-induced 
outside- in- signaling function to regulate protein synthesis through 
the mTOR (mammalian target of rapamycin) pathway M * 9 

Despite the biologic importance of platelets and their intact 
protein synthetic capabilities, remarkably little is known about 
platelet mRNAs. Younger platelets contain larger amounts of 
mRNA with a greater capacity for protein synthesis, as determined 
by using fluorescent nucleic acid dyes such as thiazole orange. 11 ' 
This assay has been used as a quantitative determinant of younger 
or "reticulated" platelets (RPs). Indeed increased reticulated plate- 
lets arc typically found in patients with conditions associated with 
rapid platelet turnover such as ITP; typically RP percentages in 
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such patients approach J0% to 20% of all platelets, considerably 
higher than in healthy control subjects." Interestingly, high RPs 
have been associated with enhanced thrombotic risk when identi- 
fied in patients with thrombocytosis, 10 suggesting that quantita- 
tively increased mRNA levels may be associated with the prothrom- 
botic phenotype. Whether this is related to globally altered gene 
expression profiles or to select changes more evident during 
situations of rapid platelet turnover remains unknown. Certainly, 
technical limitations of this assay limit its utility in defining 
prolhrombotic genotypes, 1 ^ 12 and it cannot identify differentially 
expressed genes that may be causally implicated in disordered 
platelet phenotypes. 

Toward the goal of defining the molecular anatomy of the 
platelet genome, we have adapted complementary techniques of 
microarray and serial analysis of gene expression (SAGE) for 
genetic profiling of highly purified human blood platelets, Microar- 
ray technology represents a "closed" profiling strategy limited by 
the target genes imprinted onto gene chips. In contrast, SAGE is an 
"open" architectural system that can be used to identify novel genes 
and to quantify differentially expressed mRNAs.' M5 The sequence 
of each tag along with its positional location uniquely identifies the 
gene from which it is derived, and differentially expressed genes 
can be identified in a quantitative manner because the tag frequency 
reflects the mRNA level at the time of cellular harvest and analysis. 
By using both technologies, we have identified a number of 
previously uncharacterized genes that appear to be expressed in 
human platelels, while simultaneously establishing the dominant 
frequency of mitochondrial-expressed genomes comprising the 
platelet mRNA pool. These observations provide a panoramic 
overview of the platelet transcriptome,- while additionally provid- 
ing insights into the molecular pathways regulating platelet (and/or 
megakaryocyte) function in normal and pathologic conditions. 



Materials and methods 

Reagents and supplies 

Thermus aquatints (Tnq) polymerase was purchased from (Roche, Indianap- 
olis. IN). T'l DMA ligase was pui chased from Invitrogen (Carlsbad, CA), 
and restriction enzymes were fiom New England Biolabs (Beverly. MA).' 
except, for Mmeh which was obtained from the Center for Technology 
Transfer (Gdansk, Poland). All oligonucleotides were synthesized on an 
Applied Dicsysicms (Foster City, CA)3-channeI synthesizer and ore listed 
in Table I. Monoclonal antibodies used for flow cytometric analysis 
included the FITC (fluorescein isoihiocyanaie)-conjugnred anti-CD-I) 
(<*n D Po) immunoglobulin GJ (IpGl; Inimunotech, Miami, FL); phyco- 
erythrin I PE)- conjugated antiglycophorin (IgG2; Bccton Dickinson Pharm- 
inccn. San Diego. CA); and peridinin chlorophyll protein (PERCP)- 
' conjugated am i- CD J > (IgOl: Bccton Dickinson Pharmingcn). 

Platelet isolation, purification, and immunodetection 

AH human subjects provided informed consent for an IRB (Institutional 
Review Board) -apprnved protocol completed in conjunction with the 
General Clinical Research Center jt Stony Brook University Hospital. 
Peripheral blood (?0 mLt from healthy volnnleers drawn into 2 ml of 4% 
sodium citrate (0.-1% v 0 l\ol linn I concentration) was used to isolate 
erythrocytes, by difirumi;.! rent rihigat ion (lMH>g) or to isolate pure 
leukocytes by densiu ■ »i:idicni cei.tr jfngation as* previously described 
Platelets collected from healihy volunteers by aphe.esrs wei'e used within 
24 hour* ot collet i ion. A fin :,ddiiinn of 2 mM £1)7 A (ethylcnediamineict- 
laaeetic acid), aphciois-dt- mod platelets from a single donor were 
centrifuged at J40,.> for 15 minutes at ?5°C. To minimize leukocyte 
contamination, only ihv upper 9/IDof the platelet- rich plasma (PHP) was 



used for gel filtration over a BioGeJ A50M column ( J 000 mL total volume) 
equilibrated with HBMT (HEPES-buffered. modified Tyrodes buffer: 10 
mM 1JEPES (Af-2-hyujoxyeihylpjpera2ine-;Y.2-ethDnesu]fonic acid) pH 
7.4. J 50 mM NaCI, 2.5 mM KCl 0.3 mM NaH 2 P0 4 , 12 mM NaHCOj, 
0.2% bovine serum albumen [BSA], 0.J% glucose, 2 mM EDTA). 
Gel-filtered platelets (GFPs) were subsequently filtered through 2 5-jim 
nonwetiing nylon filament filler (BioDesign, Carrnel, NY) at 25°C and 
harvested by centrifugation al 1500g for JO minutes at 25°C. Platelels were 
gentry and thoroughly resuspended in 10 mL HBMT buffer and incubated 
with 120 >L murine monoclonal anti-CD45 antibody conjugated to 
magnetic microbeads (Milienyi Biotec, Bergisch Gladbach, Germany) on a 
totaling platform, for 45 minutes at 25°C. Magnetic separation columns 
were used to capture CD45* cells (leukocyte fraction) by positive selection 
(MACS I J; Milienyi Biolec). Purified platelets were concentrated by 
centrifitgalion at I500g and immediately used for total RNA isolation. 

The efficiency of platelet purification was documented at each step by 
flow cytometry." Briefly, aliquots containing 2 X 10* platelets were 
incubated with saturating concentrations of FITC-conjugated arirj-CD4I, 
PE-conjugated antiglycophorin, and PERCP-conjugatcd an!i-CD45 for 15 
minutes in the dark at 25°C, wasted wiih phosphate-buffered saline (PBS), 
and fixed in PBS/1% formalin. Samples were analyzed using a FACScan 
(fluorescence-activated cell sorter scan) flow cytometer (Becton Dickinson) 
using CELLQuest soflware designed to quantify Ihe number of CD45 + and 
glycophorin- positive events in the sample (expressed as the number of 
events per 1 00 000 CD41 + events). For some experiments, fixed platelets 
were permeabilized with 0. 1% Triton-X/PBS for 30 minutes at 25°C prior 
to the addition of primary antibodies, all as previously described. 17 

Platelet protein detection was completed by sodium dodeeyl sulfate 
(SDS)-polyacrylamide gel electrophoresis (PAGE) and immunoblot analy- 
sis as previously described, using the species-specific horseradish pcroxidase- 
conjugoted secondary antibody and enhanced chemiluminescence.' 6 Anti- 
bodies included the anticlusierin monoclonal antibody (QiiideJ, Santa Clara, 
CA; 1:1000 primary and 1:10 000 secondary) and the antineurogranin 
rabbit polyclonal antibody (Chemicon International, Temecula, CA; 1:1000 
primary and 1:10 000 secondary). 

Molecular analyses and microarray profiling 

Purified, individual cell fractions were resuspended in 1 0 mLTrizol reagent 
(Invitrogen), transferred into diethylpyiocaibonate (DEPC)-treated Corex 
(Springfield, MA) tubes, and serially purified and precipitated by using 
isopropanol essentially- as previously described. 14 Total cellular RNA was 
harvested by centrifugation at 12 50% for 20 minutes al 4°C, washed 2 
limes wtih 75% ethanol (10 mL/mbe), and resuspended in 100 pX 
DEPC-treaied water. Platelet mRNA quantitation was performed by using 
fluorescence-based real-lime PCR (polymerase chain reaction) technology 
(TaqMan Real-Time PCR; Applied Biosysiems.. Foster City CA). Oligonu- 
cleotide primer pairs were generated by using Primer3 soflware (www- 
genome. wi.mii.edu), designed to generate approximately 200-base pair 
fbp) PCR products al the same annealing temperature : and are outlined in 
Table I. Purified platelet mRNA (4 pg) was used for firsi-strand cDNA 
synthesis using oligo(dT) and Superscript II reverse transcriptase (Invitro- 
gen). For real-lime reverse transcription |RT}-PCR analysis, the RT 
reaction was equally divided among primer pairs and used in n 40-cycIe 
PCR reaction for each tntget gene by using the following cycle: 94°C for 30 
seconds, 55°C for 30 seconds, 72°C for 1 minute, and 7>C for 10 seconds 
(40 cycles total). mRNA levels were quantified by rnoniioiing real-time 
fluoiommic intensity of SVBR green I. Relative mRNA abundance was 
determined from triplicate assays performed in parallel r 0 i' each primer pair 
and was calculated by using the comparative threshold cycle number (A-Cl 
method) as previously described. " 

Gene expression profiles were completed bv usniL' the approximately 
I? 600- probe set MG-U95Av2 gene chip (AlTvmeim. Santa Clara Ca\ 
Total cellular RNA (5 |lg ) was u,cd for cDNA syndesis by "using 
Superscript Choice system (J.ife Technologies. Rockvi||e : MD) and an 
olicn(dT) primer containing the 77 polymerase recognition sequence 
(Primer Si; Table I), followed by cDNA ( >«ir ificaiion using Gf"X spin 
columns. In vitro transcription was completed in the presence of biorinyl- 
aied ribonucleotides by using a BioAnay Mii:h Yield RNA Transcripi 
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Table 1 . Oligonucleotide primers 



Gene and primer 
direction 



S1 


Oligo (dT) 


Cos se tie A 




Cassette B 


. SAGE 


S2 


SAGE • 


S3 . 


SAGE 


GP4 


Glycoprotein IIB \f) 


GP5 


Glycoprotein IIB [RJ 


GP6 


Glycoprotein IIIA |F] 


GP7 


Glycoprotein DIA JRJ 


PAR18 


PARI |F) 


PAR19 


PARI [RJ 


SR8 


16S rRNA [F] 


SR9 


16S rRNA |R) 


NADH10 


NADH2 [FJ 


NADH11 


NADH2 |R) 


THYM12 


Thymosin 04 [FJ 


THYM13 


Thymosin p4 [RJ 


CLUS14 


Clusterin|F| 


CLUSt5 


CrusterinJR). 


NRG 16 


Newogranin (F) 


NRG 17 


Neufogian'm JRJ 




T-cefl receptor 


7CR18 


8-chain [F] 




T-cell receptor 


7CR19 


p-thain jRJ 


CD45?0 


CD45 |F| 


CD-1521 


CD45 |R| 



Sequence (5' - 3') 



5'- TTTG G A TTTG CTGGTCG AGTAC AA CT AG GCTT Aft T CC GA CATG - 3 ' 
3*- * CCT AAACGACCAGCTCATGTTGATCCGAATAAGGCTp- 5 ' 
5 * - pTTCATGGCWAGACGTCCGCCACTAGTGTCGCAACTGACTA • - 3 ' 
3 ' - NNAAGTACCGCCTCTG CAGG CGGTGATCACAGCGTTG A CTGAT - 5 ' 



Nucleotide 
Position 



5' - Bn - G GCCAGTGAATTGTAATACGACTCA CTATAGGGAGGCGG - (dT) „-3* 



5'-Bn-GGATTTGCTGGTCGAGTACA-3* 




5 ' - Bn - TAGTCAGGTGCGACACTAGTGCC- 3 ' . 




5 ' - AGGGCTTTGAGAGACTCATCTGTA - 3 ' 


2094-21 17 


5 ' - ACAATCTTGCTGTTTGGATTCTG - 3 ' 


, 2301-2279 


S' - TATAAAGAGGCCACGTCTACCTTC- 3 ' 


2335-2358 


5 ' - CACTTCCACATACTGACATTCTCC- 3 ' 


2532-2509 


5 ' - AATGTCAGTTCTGATATGGAAGCA - 3 ' 


2585-2608 


5 ' - CCCAAATGTTCAAACTTCTTTAGC- 3 ' 


2776-2753 


5 ' - TGCAAAGGTAGCATAATCACTTGT- 3 ' 


2586-2609 


5 ' - GTTTAGGACCTGTGGGTTTGTTAG - 3 ' 


2785-2762 


5 ' - CTAGCCCCCATCTCAAATCftTATAC- 3 ' 


487W898 


5 ' - AATGGTTATGTTAGGGTTGTACGG- 3 ' 


5075-5052 


5 ; - AAGACAGAGACGCAAGAGAAAAAT- 3 * 


135-158 


S ' - GCAGCACAGTCATTTAAACTTGAT- 3 ' 


336-313 • 


5 ' - CCAACAGAATTCATACGAGAAGG- 3 ' 


■ 1006-1028 


S ' - CGTTATATTTCCTGGTCAACCTCT- 3 ' 


1222-1199 


S * - GCCCTTTTAGTTAGTTCTGCAGTC- 3 ' 


1351-1374 


5 ' - TTTTCTTTAAGTGAGTGTGCTTGG- 3 ' 


1567-1544 


5 ' - CCACAA CT ATGTTTTGGTATCGT- 3 ' 


131-153 


5 ' - CTAGCACTGCAGATGTAGAAGCT- 3 ' 


332-310 


5 ' - GCTCAGAATGGACAAGTA - 3 * 


3771-3788 


5 * - CACACCCATACACACATACA - 3 ' 


428CM261 



IF) .novates forward (sense) strand; |R], reverse (onlisense) strand; Bn r bblin; p. a phosphorylaled 5' end (cassettes Aond B) 
arrows, corresponding sequence lor S2 and S3 within cassettes A and B. respectively; bold, the Mmet site; and N A C T or G 
acccss-on numbers: glycoprotein IIB (J02764), glycoprotein IIIA (M35999). PARI (M62424) 16S rRNA and NADH2 (NC 001807} ' 
neoro9ranin(X99076).TCRp-chain{AF043182),CD45(YO0638). 

•Indicates an amino-modified 3' end in both cassettes; — , not applicable. 



underlining, W/alfl sites in cassettes A and B; 
nucleotide position based on the following 
thymosin P4 (M17733), clusterin (M25915), 



Labeling Kit (Enzo Diagnostics.. Fnrmingdale, NY), and, after metal- 
induccd fragmentation, 15 p.g" biotinylated cRNA was hybridized to the 
MG-U°5Av? oligonucleotide probe array for 16 hours at 45*C. After 
washing, the cRNA was detected with streptavidin-phycoeryihrin (Molecu- 
lar Probes, Eugene, OR) and analysis was completed by using a Hewlett- 
Packard Gene Array Scanner (Aflymcirix). The average difference value 
(AD) for each probe set was quantified using MAS 4.0) software 
lA/Tymcirw), calculated as an average of fluorescence differences for 
perfectly matched versus single-nucleotide mismatched 25-mcr oligonucle- 
otides (16 to 20 oligonucleotide pairs per probe set). The software is 
designed to exclude "positive calls" in the presence of high average 
differences with associated high mismatch intensities. 

SAGE profiles 

IMau-lct SAGF. libraries were generated essentially as previously de- 
scribed." modified as outlined in Figure J for the use of Mme) as ihc 
lagging cnzvme.'M'his type MS restriction enzyme cleaves 20 of IS bp past 
its mmpalmdromic (TCCRAC) recognitron sequence, thereby generating 
longer tags I? I- or 22- met) than those obtained using BsmTi its rhc standard 
tagging enryrne (13- 14 bp tags). These longer J/md- Generated tags 
potentially provide for more definitive "lag-io-gene" ideniifiraiion and aie 
particularly useful in characterizing expression pmierm in the absence of 
complete genomic sequence data (comprehensive meihods detailed in Dunn 
ct Briefly, poly (A) rnKNA was isolated from 10 pg total platelet RNA 
using the ohpo-idT) Si primer conjugated to magnetic beads (Dynal 
Biotech. Lake Success, NY), followed by cDNA synthesis using Super- 
script II icver;.e transcriptase f Invjirogen). The cDNA was then digested 



with the restriction enzyme NtaU] (anchoring enzyme), ligated to cassette A 
using 74 DNA ligase, and. after the beads were extensively washed, the 
cDNA was digested with Mme) to release the tags from the beads. After 
purification, tags were ligated to degenerate cassette B linjters (specifically 




Fiymp t. Schema outlining the modified SAGE piotoc.ol used in plate id 
nn;ityies. The final tags o,c fl.tnr-.ed by the Mailt [r>nchorif»g enzyme) CA1G 
:.t'nuence, thereby providing lag- to- gene identification when exported to a relational 
database (refer to "FJiointorni.-iric onoiyses" and Table I tor details). 
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designed lo anneal to ibe nonuniform Mmel overhangs), and PCR-ampIified 
using biotinylated primers S2 and S3 for 30 cycles (95*C for 30 seconds; 
. 58°C for 30 seconds; 72°C for 30 seconds) using Platinum Taq DNA 
polymerase (Gibco BRL). A fraction (20%) of the pooled PCR products 
were then subjected to one round of linear amplification using primer pair 
S2/S3, followed by a second round of 25 amplifications using primer S2 
alone (95°C for 30 seconds.. 58°C for 30 seconds, 72°C for 30 seconds). 
Primer S3 was subsequently added for one cycle (95°C for 2.5 minutes, 
58°C for 30 seconds, 72°C for 5 minutes); the latter steps were collectively 
adapted to exclude hetcroduplex formation.' 8 Unincorporated primers were 
removed by incubation with 200 U Escherichia coli exonuclease I for 60 
minutes at 37*C PCR products were then pooled and digested with N!a\W 
to release tags, and biotinylated linker arms were cleared using streptavidin- 
coated imrmmoafttntty magnetic beads (Dynal Biotech). Tags were con- 
catamerized using 5 U/pX T4 DNA ligase, and products more than 100 bp 
were isolated by size- fractionation in low- melting agarose gels. The DNA 
was purified by GFX spin columns, and the concatamers were cloned into 
the Sph\ site of pZero (Invitrogen). After transformation into E coli TOPJ0 
cells, recombinant clones were isolated and sequenced in 96-welI microliter 
plates using an ABI 377 sequencer and ABI Prism BigDye terminator 
chemistry (Perkio-E)mer Applied Biosy stems. Branch burg, NT). 

Bioinformatic analyses 

Functional grouping of genes determined to be present by Asymetrix MAS 
4.01 sofhvare was performed using a dChip program linked lo the National 
Center for Biotechnology LocusLink, which is an annotated reference 
database for genes and their postulated functions. 2 * Of the approximately 
J 2 600-probe sets represented on the Affymelrix HO-U95Av2 Gene chip, 
functional annotations exist for approximately 81 00 with the remainder 
categorized as unknown. Microarray data were visualized and analyzed 
using BRB- Array Tools software (Version 2.1), kindly developed and 
provided by Dr Richard Simon and Amy Peng {Jinus.nci.nih.gov/BRB- 
Array Tools.html). A logarithmic (base 2) transformation was applied lo the 
average difference values for individual data sets for del enni nation of 
microarray concordances. Discordancy was defined as a 2- log difference in 
the maximum log intensities between individual experiments. " 

SAGE tags were extracted by using in-house SAGE software uniquely 
modified to identify Mme\ tags. The software ensures that only unambigu- 
ous 21- lo 22-bp tag sequences ate extracted for transcript profiling. Tags 
with ambiguities (Ns), lengths other than 21 or 22 bp, or with ambiguous 
orientations were extracted to separate fries for manual editing or further 
examination. Finalized data were exported to a relational database for tag 
quantification and genetic idem ificai ion. :o 



Results 

Platelet purification 

To ensure that the RNA profiles accurately represented those of 
circulating blood platelets, a number of complementary methods 
weie implemented to remove contaminating nucleated leukocytes. 
Purification methods incorporating gel filtration, a 5-p.m leukocyte 
reduction filter, and magnetic CW3 immtmodepletion allowed for 
ihc cumulative enrichment of highly purified platelets. The efficacy 
of tins purification method was initially established by using 
peripheral blood platelet-rich plasma as the starting material. The 
final product contained no more than 3 to 5 leukocytes per I X I0 5 
platelets as determined by paiallcl flow cytometric analysis, 
representing an approximate Wfold reduction of nucleated 
leukocytes. These results corrchied well with molecular evidence 
loi leukocyte depletion as determined by RT-PCR using both CD-15 
and T-cell receptor fi- chain (TCRp) primers (see Figure 2). 
Because the tola! RNA yield from peripheral blood platelets was 
insufficient for micros ray studies, we adapted the protocol lo 
platelet apberesis donots with nearly identical final purity (Figure 



A B 




Figure 2. Determination of plate ret purity. (A) Total cellular RNA (1.8 ?g) from 
platetet-rich plasma (PRP) of purified platelets from a single apheresis donor were 
analy2cd by RT-PCR (35 cycles) using oligonucleotide primers specific lor glycopro- 
tein lib (GPIlb), T<ell receptor p^-chain fT CRp). or CD45; tO pi of the 50 iiL reactions 
were analyzed by ethidiunvstained agarose gel electrophoresis. Minimal to no 7CR3 
gene product was visually evident only in PRP. Size markers corresponding to 
Araelll-rostncled <f>X174 DNA are shown. (B) Real-time RT-PCR was completed by 
using 1.8 jig total RNA and TCRp*-specific oligonucleotide primers optimized lor 
quantitative analysis by real-time PCR. ,B On the basis of parallel determinations 
using RNA isolated from known amounts of purified leukocyte standards, the 
leukocyte-depletion protocol represents an approximate 2.5-log purification from the 
starting PRP. Results are representative of one complete set of experiments repeated 
on 2 separate occasions, and data points represent the mean from triplicate wefts, 
with standard errors of the mean fSEM) (ess lhan 1% (not shown). 

2). The platelet recovery was nearly 65% of the starting material, 
yielding approximately 2.3 X 10" platelets from an initial aphere- 
sis pack containing approximately 3.6 X 10" platelets. The bulk of 
the losses occurred during the initial centrifugal ion and filtration 
steps. The purification protocol was less effective at removing 
erythrocytes, although there were less than 50 glycophorin-positive 
cells per J X JO 5 platelets after the final purification step. Nonethe- 
less, these cells represent unlikely sources for contaminating 
cellular RNA (see "Cellular microarray analysis" below). 

Cellular microarray analysis 

The purified platelet RNA was sufficient for microarray studies and 
was used for cRNA generation and hybridization lo Ihe Asymetrix 
HO-U95Av2 GeneChip. The anatomic profile of platelet RNAs 
from 3 healthy male donors was determined by using AlTymetrix 
software. Of the 12 599 probe sets imprinted onto the chip, a 
maximum of 2J47 (J7%) transcripts were computationally identi- 
fied as "present" by the Affymerrix software, 152 (1.2%) were 
equivocal, arid nearly 82% were absent. As a fraction of the total 
genes present on the chip, the percentage of platelet-expressed 
genes (J5%-17%) was generally lower than that obtained from 
other human cell types in which 30% to 50% of genes are present as, 
determined by Affymerrix software (j. Schwedes, personal commu- 
nication, May 2002). The "limited number" of platelet-expressed 
transcripts presumably reflects the lack of ongoing gene transcrip- 
tion in the anuclcate platelet. Because less than 1% of circulating 
red blood cells contain residual RNA, it is unlikely that any of these 
transcripts are erythrocyte derived, although this was formally 
addressed by isolating total cellular RNA from' 20 mL of whole 
blood (corresponding to an -3-log fold excess of erythrocytes lhan 
thai identified itr our final sample). I hc total cellular yield of RNA 
from this starting material was approximately 250 ng. suggesting 
that loss than I ne erythrocyte- derived RNA was present in the 
purified platelet preparations. Despite this, however both a- and 
(3- glob in tt adscripts— along with both the ferritin heavy and light 
chains — were identified as abundant transcripts ( fable 2). Al- 
though the most parsimonious explanation would be residual coni;uni- 
naiing reticulocytes, this is not supported by our erythrocyte contami- 
nation estimates, and their significance remains unresolved. 

As a means of better dissecting the molecular anatomy of the 
platelet, expressed genes were gronprd on the basis of assigned 
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7ab!e 2. Top 50 human platelet-expressed genes 



Accession no. 


Gene symbol 


AD values, range* 


Gene transcript 


Leukocyte expression^ 


Ml 7733 


TMSB4X 


140 142-307 852 


Thymosin (14 mRNA, complete cds 


+ 


X99076 


NRGN 


101 510-148 279 


Neurogronin gene 


+ 


M25079 


HBB 


40 839-229 566 


3-gtobin mRNA. complete cds 


+ 


M25915 


ClU 


84 720-140 246 


Complement cytolysis inhibitor (cluslerm) complete cds 




J04755 


FTHP1 


82 980-148 621 


Ferritin H processed pseudogeno. complete cds 




D78361 


OA21 


73 098-118 140 


mRNA for ornithine decarboxylase antizyme 




X04409 


GNAS 


77 761-94 781 


mRNA ror coupling protein G(s) o-subunrl (alpha-Si) 




. M25897 


PF4 


62 811-126 908 


Platelet factor 4 mRNA. complete cds 




AB021288 


B2M 


61 689- JOS 921 


p2-m)uog!obuIin 




X00351 


ACTB 


25 143-73 775 


mRNA lor p-actrn 




D21261 


7AGLN2 


76 687-101 931 


mRNAforKIAA0120gene . 




AL031670 


FTLL1 


69 865-99 966 


Ferritin, fight polypeptide t 


+ 


U59632 


' GP11B 


41404-110 328 


Platelet glycoprotein top* chain mRNA 


- 


M2112* 


CCL5 


47 308-106 "399 


T-celr-specific protein (RANTES) mRNA, complete cds 




X13710 


GPX1 


4 1 318-96 878 


Unspliced mRNA for gtutathtono peroxidase 


- 


J00153 


HBA1 


21 326-144 201 


Alpha gtobin gene cluster on chromosome 16 


+ 


M229I9 


MY16 


46 337-106 833 


Ncwnuscle/srnooth muscle alkali myosin light chain gene 


+ 


L20941 


FTH1 


52 787-74 763 


Ferritin heavy chain mRNA, complete cds 


- 


J03040 


SPARC 


51 156-74 261 


SPARC/osteonecitn mRNA, complete cds 


'- 


X56009 


GNAS 


45 543-72 096 


GSA mRNA lor a subunit of GsGTP binding protein ' 


.- 


X5B536 


HIA 


31 183-82 613 


mRNA for major HLA dass 1 locus C heavy chain 


. + 


M54D95 


PPBP 


46 571-67 169 


Connective tissue activation peptide III mRNA 


- 


U34995 


GAPD 


35 095-70 250 


Normal keratinocyte substraction library mRNA. clone H2?a 


+ 


L40399 


MLM3 


32 107-73 364 


Clone 2apt12 (mutL protein homolog 3) mRNA 


_ 


X77548 


NCOA4 


31 452-61 036 


cDNA tor RFG (RETprolo-oncogene RET/PTC3) 


_ 


U905S1 


H2AFL 


35 086-51 892 


Histone 2A-like protein (H2A/I) mRNA 


_ 


Ml 1353 


H3F3A 


31 614-55 813 


H3.3 histone class C mRNA 


- 


212962 


RPL41 


36 003-54 853 


■ noRNA tor homologue to yeasl ribosomal protein L41 


-t 


X06956 


TUBA1 


20988-61798 


HALPNA 44 gene lor a -tubulin 




AB02895O 


TLN1 


24 571-58 611 


mRNA for KIAA 1027 protein 


- 


Y12711 


PGRMC1 


33 680-43 174 


mRNA for putative progesterone binding protein 




Ml 6279 


MIC? 


30 894-48 166 


Integrated membrane protein (MIC?) mRNA 


- 


D78577 


WVHAH 


. .24 785-50 437 


Brain 14-3-3 protein p-chain 




AF 070505 


10P3B 


20 027-67 945 


Clone 24675, unknown cDNA 




AA524802 


Unknown 


23 846-39 481 


CDNA. IMAGE clone 954213 




AB009010 


UBC 


28 745-38 389 


mRNA tor polyubiquitin UbC 


+ 


X57985 


H2AFO 


21678-52 108 


Genes lor hislones H2B.1 and H2A 




X54304 


MIXB 


25 733-34 109 


mRNA lor myosin regulatory fight chain 




M 14539 


F13A1 


23 691-48 474 


Factor Xtll subunil o -polypeptide mRNA, 3' end 




AI540958 


Unknown 


24 872-41 118 


cDNA. PEC 1.?_ 15_HOt.r 5' end /clon 




AL050396 


FLNA 


13 634-55 235 


cDNA DKFZp 5B6K1720 




X5684t 


HLA-E 


12 890-49 327 


Nonctassical MHC class 1 antigen gene 




M26252 


PKM2 


15 450-47 70G 


7CB (cytosolic thyroid hormone- binding protein) 




M 14630 


P7MA 


19 314-45 08B- 


Prolhyrnosin alpha mRNA 




AF 045229 


RGStO 


19 156-34 243 


Regulator ot G protein signaling 10 mRNA 




AA477898 


Unknown 


16 863-44 756 


cDNA. Z&34t08.ft5' end 




X95104 


ft 1 


15 216-37 456 


mRNA to* nonmuscle type cofitro 




M34480 


11 GA2B 


8 627-45 495 


Ptaletet gtycoprolem lib (GPttb) mRNA 




283738 


H?BFC 


10 001-31 306 


HH?B/e gene 




119779 


H2AFO 


17 319-38 951 


Histone H2A.2 mRNA. complete cds 





•Gene expression quant ideations were calculated as the overage difference (AD) value (matched versus mismatched oligonucleotides) tor each probe set using Affymetria 
Ger>eC»iip software, version 4.01. The range of values from 3 distinct platelet rnicr oar rays is, shown: the normalization value tor all microarray analyses was 250. 

(Transcripts ore Kink ordered (highest to lowest) using BRB-Airay Toots soltware hy log intensities of AD values obtained from 3 different healthy donors; 33 of the top 40 
transcripts wet* listed among the lop 50 in all 3 microarray sets. 

Heuhocyle opiession was delennmedby microarray analysis using unified peripheral Mood leukocytes, followed by construction ot rank- intensity plots for comparison to 
platelet top 50 Transcripts/ 0 "top leukocyte-derived transcripts identified within the ranked lop 50 platelet transcripts are depicted by a ( t ) present, or (- ) absent. 

cds indicates coding sequence 



gene annotations, and this analysis was used to provide 3 pan- 
oramic definition of. ihe plrtlelel transcriptome.-Of the genes t ' i;)1 
could be cataloged within assigned "clusters/' those involved in 
metabolism (1J%) and tecept or/signaling (11%) represented the 
largest croups. Also evident in these analyses is the relatively large 
peicentnge of genes involved in functions unrelated to these key 
groups lie. miscellaneous. 25%). and the overrepresentation of 
genes with unknown f miction (3?%) as annotated by Affyinctjix 



and RcfSeq databases.-' These results identify a vast array (nearly 
one half) of platelet genes (and gene products) that presumably 
have impoitant, but poorly characterized functions, in platelet 
and 'or mega k myocyte biology. 

Although microaitay analysis is not truly quantitative, rank- 
oidering using the mean log- intensities fiom 3 independent microar- 
ray analyses allowed for the categorization of the top platelet 
transcripts (Table 2). Computational analyses demonstrated that 
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only 10 of the top 100 genes were discordant among the 3 platelet 
microarrays, although 71 of J 00 genes were discordant between 
platelet and leukocyte arrays. An inventory of the top 50 platelet 
genes is listed in Table 2, which also delineates those found to be 
highly expressed in peripheral blood leukocytes by parallel microar- 
ray experiments with this purified cellular fraction (data not 
shown). Further analysis of these cell subsets demonstrated that 
approximately 25% (n = 547) of the total platelet transcripts were 
platelet restricted. Furthermore, only 10 of the 50 most highly 
expressed genes were found to overlap, confirming the distinct 
cellular profiles of each transcriptome. Of the 12 overlap genes, 3 
corresponded to globin or ferritin chains (again suggesting the 
presence of contaminating reticulocytes in both purified fractions), 
and another 4 were involved in actin cytoskeletal reorganization 
and human leukocyte antigen (HLA) expression, gene products that 
regulate critical functions in both cell types. Given the importance 
of cytoskeletal reorganization in downstream platelet activation 
events, it is not unexpected that components of the actin machinery 
system would demonstrate prominent transcript expression. Previ- 
ous estimates suggest that 20% to 30% of the total platelet 
proteome is comprised of actin with other components such as 
ac tin- binding protein, mysosin, and talin accounting for an addi- 
tional 2% lo 5% of the total protein.'- 72 The mRNAs encoding the 
actin-related machinery are overrepresented in. our microarray 
analysis, with 8 such transcripts found among the 50 highest 
pi a tele (-expressed genes. Interestingly thymosin p4 demonstrated 
the highest expression pattern. In unstimulated platelets, 30% to 
40% of actin is polymerized as F-aclm," whereas the balance of 
nctin monomers (G- act in) are polymerization inhibited by seques- 
tering proteins such as proftlin (100 pM) and thymosin 34 (600 
p-M)." The high thymosin (34 transcript expression not only 
correlates with its known abundance in platelets but nlso supports 
the importance of aclin inhibitory proteins in maintaining the 
nonstimulated state of circulating platelets. 

Platelet SAGE analyses 

Although these initial studies identified the distribution and relative 
expression patterns of the genes within the Asymetrix data set. 
they do not allow for analyses of genes that arc unrepresenied by 
these oligonucleotide chips. Unlike closed microarray profiling 
strategies, SAGE is an open architectural system that is ideally 
suited for novel gene and pathway identification. Accordingly, the 
platelet RNA used for microarray studies was used for platelet 
SAGE A total of 2033 tags were initially cataloged, of which 1800 
(89%) corresponded to mitochondrial-dcrived genes. These results 
were quite different from those obtained by microarray analyses, 
but the discrepancy can be resolved by the nonrcpresenlation of the 
mitochondrial genome on the gene chip. The mitochondrial ge- 
nome is a compact approximately l6.fi-kiloh.isc (kb) sequence 
encoding 13 genes and 2 ribosomal summits.- 4 Primary mitochon- 
drial transcripts are polycisironic and typically contain premature 
termination or unpredictable splice sites, resulting in multiple 
_ poly a deny rated transcripts from individual genes. - 4 - :jl Indeed, the 
overall distribution of plate let- derived mitochondrial SAGE lacs is 
quite similar to that found in muscle. 7 - All 13 genes containing 
Kin)}] sites were detected, whereas neither of the non-AValll- 
comaming genes were identified (nicotinamide adenine dimiclco- 
tidc [NADU] dehydrogenase summit 4L and adenosine triphos- 
phatase |ATPase] S). Most of the tags were from the I6S and I2S 
ribosomal RNAs— which collectively accounted for 68% of the 
total mitochondrial lags — wirh the fewest tags represented by 
NADU dehydrogenase subnnits 3.5. 6. and cytochrome c oxidase I 
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Figure 3. Schema of the mitochondrial genome with SAGE tag distributions 
(only tags with identical matches are displayed). The abundance of the SAGE 
tags (n = t800) at individual Wtelll sites (arrows) within the mitochondrial fteavy 
strand is shown on the bottom, whereas those tags corresponding lo the mitochon- 
drial tight strand are delineated above the arrows (the presence ol an unaccompanied 
airow implies no SAGE tags at that Mai II site). The gene products of ml-DNA 
(RefSeq accession no. NC_001807) are delineated by the open rectangles, whereas 
stippled boxes represent tRNA genes and control rogjons (the single tag represented 
by. the {') refers lo milochondrial transfer RNA-serine). Mote that NADH6 is encoded 
by the tight strand and that there are no Affatll sites within the ATPase8 gene segment. 
CO{nJ, cytochrome c oxidase subunit; Cyl b. cytochrome b. 

(Figure 3). Thc.NADH dehydrogenase subunit 6 RNA is the only 
mRNA encoded by the light (L) strand of mitochondrial DNA and 
. was the least abundantly delected transcript. 

The unusually high preponderance of im'tochondrial-derived 
genes is not inconsistent with the known enrichment of these 
genomes in human platelets, 1 - 54 and presumably reflects persistent 
transcription from the mitochondrial (mt) genome in the absence of 
nuclear- derived transcripts. This overTepresentation of mlDNA in 
platelets is considerably greater than that of its closest cell type 
(skeletal muscle), in which mt genomes represent approximately 
20% to 25% of all SAGE tags. 25 Interestingly, the energy metabo- 
lism of platelets is nol dissimilar from that of skeletal muscle, bolh 
cell types actively using glycolysis and large amounts of glycogen 
for ATP generation. 26 Like muscle, platelets are metabolically 
adapted to rapidly expend large amounts of energy required for 
aggregation, granule release, and clot retraction. Similar lo the 
situation in all eukaryolic cells, platelet mitochondria represent the 
primary source of ATP, which is generated from oxidative phosphor- 
ylalion reactions occurring wilhin these organelles. Mitochondria 
are also responsible for most of the toxic reactive oxygen species 
generated as by-products of oxidative phosphorylation and are 
central regulators of the apoptotic process in other cellular types. 
The mlDNA encodes polypeptides found within 4 of the 5 
multifunctional complexes that regulate oxidative phosphorylation 
within the platelet milochondria. 27 Whether the continued genera- 
lion of these polypeptides has a role in platelet energy metabolism 
and/or Ihe.apoptotic mechanisms regulating platelet survival re- 
mains speculative, a I (hough not inconsistent with our observations. 

Comparative analysis of SAGE and microarray 
transcript abundance 

Complete SAGE libraries require the sequencing of up to 30 000 
lays for an exhaustive cataloging of individual mRNAs, especially 
those with limited copy number s. ,3 - 2? Given the preponderance of 
ml- derived transcripts, comparable sampling would have required ' 
sequence analysis of nearly 300 000 SAGE lags, an inordinate 
number for comprehensive analysis of the platelet transenpiome. 
hor platelets, alternative methodologies incorporating subtractive 
SAGE will be required for more comprehensive Hansen pi profil- 
ing.'* Our initial sampling of nonrnitochondrial genes remains 
informative, however, and entirely consistent with, the results of 
platelet microarray studies. As shown in Table 3, SAGE lags for the 
genes encoding thymosin (M, microglobulin, nemogranin. and 
the platelet glycoprotein lb3 polypeptide were among the most 
frequently identified platelet genes, similar to the rank -ordered 
results deter mined by nucroajray analysis. To formally confirm the 
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Table 3. SAGE-identi Tied nonmitochondrial tags 


Frequency 


CATG + SAGE lags* 


Accession do. t 


Gene 


Microarrayt 


26 


GTT GTGGTTAAT CTGGT 


NM_00404B.1 


p2-microg1obu!in (B2M), mRNA 


PPP 


21 


TTGGTGAAGGAAGAAGT 


NM_021 109.1 


Thymosin ^4 ; X chromosome (7MSB4X). mRNA 


P 


B 


AGCTCCGCAGCCAGGTC 


NM_002620,t 


P!ale)el (actor 4 VDrianl 1 (PF4VJ); mRNA ' 


P 


8 


AGCTCCGCAGCCGGGTT 


NM_002619.1 


Platelet (actor 4 (PF4), mRNA 


P 


7 


T GTATAAAGACAACCTC 


NM_0O2704.1 


Proplatelet basic protein ({MhromboglobuKn) 


Pp 


5 


GGGCACAATGCGGTCCA 


NM_0O0407.t 


Glycoprotein Ibp polypeptide, mRNA 


P 


3 


AGGTAATAAAAGGTAAT 


NM_003512.1 


. H2A histone family, member L (M2AFL), mRNA 


P 


3 


AGT GGCAAG7 AAAT GGC 


NMJ)21914:2 


Cofilin 2 (muscle) (CFL2). mRNA 


N/A . 


.3 


TGACTGTGCTGGGTTGG 


NM_006176.1 


Neurogranin {protein kinase C substrate. RC3) mRNA 


P 


3 


TT GGGGTTT CCTTT ACC 


NM_ 00203?. 1 


. Ferritin, heavy polypeptide 1 (FTH1). mRNA 


P 


2 


CCCTTGTGACTACCTAT 


NM_025158.1 


Hypothetical protein FU22251 (FLJ22251), mRNA 


N/A 


2 


C C7 GT AACC CCAGCT AC 


' NM_032779.t ' 


Hypothetical protein FLJ14397 (FL J 14397). mRNA 


N/A 


2 


CTTGTAGTCCCAGCTAC 


NM.01796Z1 


Hypothetical prolein FLJ20825 (FLJ20825), mRNA 


N/A 



TJnique tags identified more lhan once. 

j Refers to the RefSeq accession no. 71 Note lhat this number does not necessarily conespond to the accession no. provided by Affymetrix software annotations {Table 1). 

^Presence (P) or absence {A) is based on resulls from 3 distinct platelet microarray experiments. Capitalized "P." designates a gene that is in the lop 50 on all 3 microarray 
experiments, whereas small "p" designates those transcripts nol in the lop 50. Two of the genes (p2-rrncroglobulin and fi-thromboglobulin).are represented by 3 and 2 probe 
sets, respectively, on the HG-U95Av2 gene chip; for 02-M, all 3 probe sets were in the lop 50 genes, whereas for thymosin p4 1 of 2 was in the top 50 (or all experiments (the 
other probe set was in the top 75 for all experiments). N/A indicates oligonucleotide not present on Affymetrtx HG-U95Av2 gene chip. 

sented here. Overall, a total of 126 unique tags were identified, the 
majority of which (94) were represented only once. Of the total 
unique tags, nearly one half represented novel genes not present on 
the Asymetrix U95Av2 GeneChip. Of the genes with unique tags 
identified more than once, there was excellent concordance with 
microarray expression analysis, with nearly all of the SAGE tags in 
Table 3 corresponding to platelet top 75 microarray transcripts. The 
platelet factor (PF) 4 variant represents a single aberration because 
this was rank-ordered approximately 350 by microarray, although, 
its SAGE lag frequency was identical to that of the predominant 
PF4 transcript. The lack of extensive nonmitochondrial SAGE 
sampling precludes any further extrapolations from this apparent 
aberration. Of note, a subset of these tags had long poly(A) tracts; 
although Ihcy all corresponded to genes identified in the RefSeq 
database. 7 ' We cannot exclude the possibility of a SAGE artifact 
for this small subset of tags (~2%, representing 46 of 2033 tags), 
although the authenticity of the vast majority of tags (—98%) 
clearly validates the methodology. These tags are most likely 
explained by (he unique biology of the platelet (ie, mRNA decay in 
the absence of de novo transcription) or to mRNA degradation 
occurring during ihe extensive purification methods. In summary, 
even with a remarkably limited sampling, the power of this 
approach in gene identification of relatively abundant and less 
abundant transcripts is evident. It is clean however, given the 
unique molecular anatomy of the platelet (ie. abundance of 
mitochondrial transcripts), thai SAGE adaptations will be required 
for more comprehensive genetic profiling. 79 

Protein immuhoanalysis of platelet clusterin and newoyrnnin 

Although'mosi of the "most abundant" transcripts would conform 
to a priori predictions for platelet- expressed mRNAs, a number of 
transcripts were identified thai had been poorly characterized in 
human platelets. To further establish the authenticity of highly 
expressed transcripts suc h as clusterin and neurogranin, confirma- 
tory protein analyses were completed. As shown in Figure 5, both 
proteins weie clearly detected in purified platelet lysates: further- 
more, their eel hilar platelet distributions conformed to those 
predicted based on previously proposed functions. Note for ex- 
ample lhat clusterin — functionally characterized as a complement 
lysis inhibitor able tn Mock the terminal complement i end ion— is 
primarily expressed on the extratellulat platelet membrane. 3 " 



results independently obtained by SAGE and microarray analysis, 
quantitative RT-PCR was completed by using oligonucleotide 
primers specific for 2 abundant mitochondrial transcripts, 16S 
rRNA and NADH2 thymosin p4 (high- abundance by microarray 
and SAGE), 2 incompletely characterized high-abundance tran- 
scripts (neurogranin and clusterin; see "Protein immunoanalysis of 
platelet clusterin and neurogranin"), a low- abundant transcript 
(T-cell receptor 0- polypeptide), and the genes encoding proteins 
with well-established quantitative determinations (ie, glycoprotein 
a HBp3 [-~ 50 000 receptors/platelet]; protease-acuvated receptor- 1 
(PARI) [—1800 receplors/plaletel]).' As shown in Figure 4, these 
analyses reveal excellent concordance between SAGE and microar- 
ray studies, demonstrating the predominant frequency of the 
.mitochondrial- derived I6S rRNA/NADH2 transcripts, with incre- 
mentally lower expression of other transcripts as initially demon- 
strated by microarray (16S > NAD} 1 2 > thymosin f£4 > neurogra- 
nin > clusterin > i>,, B p., > PARI > TCR0). 

Given the small number of nonmitochondrial SAGE tags 
available for analysis (n = 233), limited conclusions can be drawn 
using traditional (nonsublraciion) platelet SAGE libraries as prc- 




' Figure 4. Quantitative rcaMirnc Rt-PCR nnaiysis of plaletft transcripts. Reol- 
lime RT-PCR was completed by t/sinri pmrtioiJ ptolelel RNA nntJ oligonucleotide 
primei pairs specifrtolly oVsigned using PrimeO software to generate srmiJaily-si2ed 
(— ?00-bp) PCR products. ot,'(irr»;eiJ lo the some annexing temperature. In graph. 
(D) represents Kb. (»/ repiescnlj, II la. |.' ; ) represents PARI, {k) represents T6S 
iRNA. (T) represents NADU?. | ▼) rep< events thymosin, |0) represents clusterin. (♦) 
represents neurooranin, and {•) represents 1 CP0 Curves are representative ot one 
complete set ol e>perimtnls (repeated hvici>). and line plots reflect average 
rieleiminalinni tiom 3 v.elK pwtornii-d in pniiillcl with GEM less lhan l"/i, lur stl 
data points. 
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Non-perroeabilized 






Figure 5. Immunoanalysis of ptatetet neurogranin and 
clustering (A-O) Get-fittered ptetefete were either feed in 
3,7% formaktehytto Crxx^pemioabiTiTixf) ox fixed with rjerme- 
abiSzation in the presence of 0. 1% Trrton-X; foDowred by Row 
cytometric analysis using antiduslerin. anti-tlb/llb. a anSnoi*: 
rogranin antibodies and the FIT ^conjugated speoes- 
spectfic secondary antibody fin^ C, the WCwpgated 
antirabbit and antimouse controls are" essentially superim- 
posed). (E F) Ten micrograms of solubifized HepG2 cefe 
(hepatocyte ceB fine), human brain, or purified pteteteltysates 
were analyzed by SDSfAGE," and immunoblot analysis 
were compteted by using i:10TO dI»utions of e'rthw antbcuro- 
grarm (18% SDS^PAGE) or anfciiisterin (8% SDS^AGE) 
antibodies. The antidustorin antibody recognEed 2 ptolelet 
immunoreactive species under. shorter exposure. Although 
the relative neorogrorw) and dusterin protein abundances 
are suboptimofly quantified by these anoJysos. ptatolet dus- 
terin appears to demonstrate considerable expression when 
compared with that previously identified in hepotocytes. 3 ! 



Given the importance of complement activation in platelet destruc- 
tion, the prominent expression of cell-surface dusterin might 
suggest a role for this protein in normal and pathologic events 
regulating platelet survival Interestingly, a clusterin-deficient 
knockout mouse has been generated that demonstrates enhanced 
cardiac dysfunction in a model of autoimmune myocarditis. 31 
Although these mice apparently have normal baseline heinograms 
(B- Aronow, personal communication, October 2002), it remains 
unestablished if they would be predisposed to immune-type 
thrombocytopenia in systemic models of autoimmunity. 

Similarly, the gene encoding an intracellular effector protein 
that may have key roles in downstream platelet activation events 
has now been demonstrated to have abundant transcript expression 
in human platelets. Neurogranin is a highly expressed platelet 
transcript with its gene product demonstrating a primarily intracel- 
lular pattern of distribution. Neurogranin is generally described as a 
brain- specific. Ot 2 * -sensitive calmodulin-binding phosphoprotein 
that is preferentially expressed in neuronal cell bodies and den- 
drites."- 13 It is a specific protein kinase C (PKC) substrate that can 
also be modified by nitric oxide and other oxidants to form 
intramolecular disulfide bonds. Both its phosphorylation and 
oxidation stale attenuate its binding affinity for calmodulin. 33 In 
stimulated platelets, PKC generation is linked to various activation 
pathways such as calcium- regulated kinases, mitogen- activated 
protein (MAP) kinases, and receptor tyrosine kinases. 1 Thus, these 
observations suggest that platelet neurogranin may function as a 
previously unidentified component of a PRC-dependent activation 
pathway coupled to one (or more) of ihese effector proteins. 



Discussion 

These data provide documentation for a unique platelet mRNA 
profile that may provide a tool for analyzing platelet molecular 
networks. Nonetheless, the molecular analysis of the platelet 
Iranscriptome may be confounded by the constant decay of 
mKNAs in the absence of new gene transcription, a sihiaiion thai 
may. for c.\ ample, limit the identification of low- abundance 
transcripts. Similar h\ hecanse the circulating platelet pool contains 



a mixed population of variably aged platelets, a "static" mRNA 
profile represents an average of this heterogeneous blood pool 
Despite these potential limitations, the combination of genomic 
and proteomic technologies are likely to provide powerful tools for 
the global analysis of phtelel function. Current strategies for 
cataloging "whole cellular proteomes" are generally accomplished 
by using 2 developing methodologies:, ( I) high resolution 2-dimen- 
sional poly acryl amide gel electrophoresis (2-DE) with mass spec- 
tromerric sequence identification, 34 and (2) microcapillary liquid 
chromatography with tandem mass spectrometry (pXC-MC/ 
MC). 35 Further modifications of both procedures have been devised 
for direct comparative studies between 2 cellular proteomes. The 
introduction of 2-DE differential gel electrophoresis has now made 
rt possible to detect and quantify differences between experimental 
sample pairs resolved on the same 2-dimensional gel. 36 Likewise, 
the application of isotope-coded affinity tags to ilLC-MC/MC 
represent a novel means or quantitative analyses between cellular 
proteomes. 37 The success of both approaches relies on the availabil- 
ity of comprehensive genomic databases and mathematical algo- 
rithms for optimal protein identification. Indeed, mathematical 
modeling studies have demonstrated the need to delineate both 
protein and mRNA expression levels for optimal definition of 
intracellular networks. 3 * Our data present an initial framework for 
delineating platelet function by defining the molecular anatomy of 
human platelets, information that is likely to provide important 
clues into the dynamic protein interactions regulating normal and 
pathologic platelet functions. Furthermore, because the platelet 
transcriptome mirrors the mRNAs derived from precursor 
megakaryocytes, these analyses may provide insights into the 
biochemical and molecular events regulating megakaryocytopoi- 
esis and/or proplatelel formation. 
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Overexpression of a DEAD box protein (DDX1) hi neuroblastoma and 
retinoblastoma cell lines. 

Godbout R , Packer M Bie W. 

Department of Oncology, Cross Cancer Institute and University of Alberta, 1 1560 
University Ave., Edmonton, Alberta T6G 1 Z2, Canada. 

The DEAD box gene, DDX1, is a putative RNA helicase that is co-amplified with 
MYCN in a subset of retinoblastoma (RB) and neuroblastoma (NB) tumors and cell lines. 
Although gene amplification usually involves hundreds to thousands of kilobase pairs of 
DNA, a number of studies suggest that co-amplified genes are only overexpressed if they 
■ provide a selective advantage to the cells in which they are amplified. Here, we further 
characterize DDX1 by identifying its putative transcription and translation initiation sites 
We analyze DDX1 protein levels in MYCN/DDX1 -amplified NB and RB cell lines using 
polyclonal antibodies specific to DDXI and show that there is a good correlation with 
DDXI gene copy number, DDXI transcript levels, and DDXI protein levels in all cell 
lines studied. DDXI protein is found in both the nucleus and cytoplasm of DDXI- 
amplified lines but is localized primarily to the nucleus of nonamplified ceils. Our results 
indicate that DDXI may be involved in either the formation or progression of a subset of 
NB and RB tumors and suggest that DDXI normally plays a role in the metabolism of 
RNAs located in the nucleus of the cell. 



PM1D: 9694872 [PubMed - indexed for MEDLINE] 
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Modulation of gap junction mediated intercellular communication in TM3 
Leydig cells. 

Coldenbcrg RC , Fortes PS, Cristancho JM Morales MM . Franci CR . Varanda WA 
Campos de Carvaltio AC : ' 

Institute of Biophysics Carlos Chagas Filho, UFRJ, Brazil. 

Long-term modulation of intercellular communication via gap junctions was investigated 
inTMJ Leydig cells, under low and high confluence states, and upon treatment of the 

m^°^ dl 5 erent timeS Wi * activators °f Protein kinase A (PKA) and protein kinase C 
(PK.C). Cells in low confluence were readily coupled, as determined by transfer of the 
dye Lucifer Yellow; on reaching confluence, the cells uncoupled. Western blots and RT- 
PCR revealed that connexin 43 (Cx43) was abundantly expressed in TM3 Leydig cells 
™«its expression was decreased after the cells achieved confluence. Stimulation of PKA 
or PKC induced a decrease in cell-cell communication. Staurosporin, an inhibitor of 
protein kinases, increased coupling and was able to prevent and reverse the uncoupling 
actions of dibutyryl cAMP and 12-0-tetradecanoyl-phorbol-13-acetate (TPA) Under 
modulation by confluence, Cx43 was localized to the appositional membranes when cells 
W A e nD 0UP ' ed md W3S main,y in Uie eytoP^m when they were uncoupled. In addition 
cAMP and TPA reduced thesurface membrane labeling for Cx43, whereas staurosporin 
increased it. These data show a strong correlation between functional coupling and the 
membrane distribution of Cx43, implying that this connexin has an important role in 
intercellular communication between TM3 cells. Furthermore, increased testosterone 
secretion m response to luteinizing hormone was accompanied by a decrease in 
intercellular communication, suggesting that gap junction mediated coupling may be a 
modulator of hormone secretion in TM3 cells. 



PMID: 12740021 [PubMed - indexed for MEDLINE] 
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Expression level of Ubc9 protein in rat tissues. 



Golcbiowski F, Szulc A, Sakowicz M , Szutowicz A . Pawelczvk T , 

Department of Molecular Medicine, Medical University of Gdansk 80-211 Gdansk 
Poland. ' ' 

Ubc9 is a homologue of the E2 ubiquitin conjugating enzyme and participates in the 
cova ent (inking of SUMO-1 molecule to the target protein. In this report we describe a 
simple and efficient method for obtaining pure human recombinant Ubc9 protein The 
purified Ubc9 retained its native structure and was fully active in an in vitro sumoylation 
assay with the promyelocyte leukaemia (PML) peptide as a substrate. In order to better 
understand the physiology of Ubc9 protein we examined its levels in several rat tissues 
rmmunoblot analyses perfomied on tissue extracts revealed quantitative and qualitative 
deferences in the expression pattern of Ubc9. The Ubc9 protein was present at a high 
level in spleen and lung. Moderate level of Ubc9 was detected in kidney and liver Low 
amount of Ubc9 was observed in brain, whereas the 1 8 kDa band of Ubc9 was barely 
visible or absent in heart and skeletal muscle. In heart and muscle extracts the Ubc9 
anhbodjes recognized a 38 kDa protein band. This band was not visible in extracts of 
other rat tissues. A comparison of the relative levels of Ubc9 mRNA and protein 
indicated that the overall expression level of Ubc9 was the highest in spleen and lung In 
spleen, lung, kidney, brain, liver and heart there was a good correlation between the 18 
kDa protein and Ubc9 mRNA levels. In skeletal muscle the Ubc9 mRNA level was 
unpreportionally high comparing to the level of die 18 kDa protein. The presented data 
indicate that m the rat the expression of the Ubc9 protein appears to have some degree of 
tissue specificity. & 
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ABSTRACT 

Motivation: Protein abundance Is related lo mRNA 
expression through many different, cellular processes. 
Up to now, there have been conflicting results on how 
correlated the levels of these two quantities are. Given that 
expression and abundance data are significantly more 
complex and noisy than the underlying genomic sequence 
information, it is reasonable to simplify and average them 
in terms of broad proteomic categories and features (e.g. 
functions or secondary structures), for understanding their 
relationship. Furthermore, it wilt be essential to integrate, 
within a common framework, the results of many varied 
experiments by different investigators. This will allow one 
to survey the characteristics ol highly expressed genes 
and proteins. 

Results: To this end, we outline a formalism tor merging 
and scaling many different gene expression and protein 
abundance data sets into a comprehensive reference 
set, and we develop an approach for analyzing this in 
terms of broad categories, such as composition, function, 
structure and localization. As the various experiments are 
not always done using the same set of genes, sampling 
bias becomes a central issue, and our formalism is 
designed to explicitly show this and correct for it. We apply 
our formalism to the currently available gene expression 
and protein abundance data for yeasl. Overall, we found 
substantial agreement between gene expression and 
protein abundance, in terms of the enrichment ol structural 
and functional categories. This agreement, which was 
considerably greater than the simple correlation between 
these quantities lor individual genes, reflects the way 
broad categories colled many individual measurements 
into simple, robust averages. In particular, we found 

*Ti> w Item cor ic.spoodr nrc .should be stMrcs.srit. 
; (hc:-f auHVn.s rciilrirfiircj equally to ihi.s wnjt,. 



that in comparison to the population of genes in the 
yeast genome, the cellular populations of transcripts and 
proteins (weighted by their respective abundances, the 
transcriptome and what we dub the translatome) were both 
enriched in: (i) the small amino acids Val, Gly, and Ala; 
(ii) low molecular weight proteins; (Hi) helices and sheets 
relative to coifs; (iv) cytoplasmic proteins relative to nuclear 
ones; and (v) proteins involved in "protein synthesis; 'cell 
structure,' and 'energy production/ 
Supplementary information: http://genecensus.org/ 
expression/translatome 
Contact: mark.gerstein@yale.edu 

INTRODUCTION 

High throughput experimentation, measuring mRNA 
(Schena el al, 1995; Eisen and Brown, J 999; Fere a 
and Brown, J 999; Lipshutz er at. t 1999) nnd protein 
expression (Anderson and Seilhamer, J 997; Fulcher ei al, 
1999; Gygi er aL 1999a; Ross-Macdonald et al.. J 999; 
Lopez, 2000; MacBeath and Schreiber, 2000; Nelson 
et ni T 2000; Zhu et nl. t 2000) are currently rhe single 
richest source of genomic information. However, how to 
best interpret this data is still an open question (Bassert 
et at., J 996; Wit res and Friedman, J 999; Zhang, J 999; 
Gcisicjn and Jansen, 2000; Searls, 2000; Sherlock. 2000; 
Claverie, 1999; Einarson and GoJemis, 2000; Epstein and 
Butow. 2000; Shapiro and Harris, 2000). Understanding 
how protein abundance is related to jnRNA transcript 
levels is essential for interpreting gene expression, protein 
interactions, structures and functions in a cellular sys- 
tem (Haizimanikalis eral., 1999). Moreover, as protein 
concentration is the more relevant -variable with respect 
to enzyme activity, it connects genomics to the physical 
chemistry of the cell (Kidd et nl.. 2001). Protein abun- 
dance may also be invaluable for diagnostics and lor 
Jerri mining drug targets (Coithals et aL. 2000). 
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Previously, we surveyed the population of protein 
features — such as folds, amino acid composition, and 
functions — in yeast, and other recently sequenced 
genomes (Gerstein, J 997, ]998a,b; Gerstein and Hegyi, 
J 998; Hegyi and Gerstein, 1999; Das and Gerstein, 
2000; Lin and Gerstein, 2000), and we extended this 
concept to compare the population of features in the 
yeast transcript ome to that in the genome (Orawid et 
ai, 2000; Jansen and Gerstein, 2000). Others have also 
done related work (Frishman and fvjewes, 1997; Tatusov 
et ai, 1997; Jones, 1998; Wailin and von Heijne, 1998; 
Frishman and Mewes, 1999; Wolf et oi, 1999). Here, we 
present a new methodology to compare the features of the 
mRNA expression population with the protein abundance 
population. 

Precise terminology is essential for this comparison. 
Unfortunately, 'proteome* is used inconsistently. Pro- 
teome can logically be used to describe all the distinct 
proteins in the genome (Qi et oi T 1996; Cavalcoli et ai y 
1997; Fey et ai, 1997; Carrels et ai, 1997; Gaasterland, 
1999; Jones, 1999; Sali, 1999; Tekaia et ai, 1999; 
Bairoch, 2000; Cambillau and Claverie, 2000; Dooliltle, 
2000; Pandey and Mann, 2000; Rubin et a!., 2000) and, 
in this context, it is equivalent to what others may refer 
to as the coding part of the genome. However, in papers 
on two-dimensional (2D) electrophoresis, it is often used 
to describe the sum total of proteins in a cell, taking 
into account the different levels of protein abundance 
(Shevchenko et ai, 1996; Gygi et ai, 2000a; Lopez, 
2000; Wash bum and Yates,. 2000). Jn an efTon lo be clear, 
we propose the term 'lransIntome : for this second usage 
of proteome. 

With this definition, we aie able to refer compactly to 
three different cellular populations. These are illustrated 
in Figure I. 

(i) We use" the term genome when we refer to the 
population of open reading frames, where each ORF 
counts once. 

(ii) We use the term transcriptome when we refer to 
the population of mRNA iianscripis. This lerm was 
originally coined by Velculesen et ai (1997). Note 
thai each ORP may give rise to different numbers 
of transcripts. Consequently, the transcript ome is 
essentially ilie same as the genome bur with each 
ORF weighted by its expression level. 

(iii) The next level is the cellular population of pjoteins. 
As each protein represents a translated transcript, 
we make an analogy with the term transcriptome 
and use the term translatome as described above 
to describe this third population. Thus, the irans- 
laiome is a subset of the genome where each 
ORF is weighted by its associated level ol protein 
abundance. 



Note that one could also, less compactly call the trans- 
latome a 'weighted proteome.* However, doing so assumes 
one of the two aforementioned definitions of proteome. To 
avoid ambiguity, we studiously avoid the use of proteome 
altogether in the paper. 

Differences between the translatome and the transcrip- 
tome exist given that transcripts from different genes 
can give rise to different numbers of proteins, due to 
different rates of translation and protein degradation. 
Posl-transcriptional modifications further affect the 
translatome. 

In our analysis of the transcriptome and translatome, we 
focus on global protein features rather than the compari- 
son of individual genes. Previous analyses have shown that 
differences between mRNA expression and protein abun- 
dance levels can be quite dramatic for individual genes. 
. This may either be due to the noise in the data or lo funda- 
mental biological processes. However, our analyses show 
that the variation between transcriptome and translatome 
is much smaller for global properties that are computed by 
averaging over the properties of many individual genes. 

METHODS 
Data sources used 

For our analysis we culled many divergent data sets, 
representing protein abundance and mRNA expression 
experiments and also other sources of genome annotation. 
These are all summarized in Table I . 

Biases in the data 

The databases that annotate the specific genes may 
not always be accurate (lshii et ai, 2000). Gene Chip 
experiments suffer with regard to cross hybridization 
and the saturation of probes. SAGE data degrades for 
lowly expressed mRNAs. 2D gels are unable to resolve 
membrane, proteins (approximately 30% of the genome) 
and basic proteins (Gerstein, 1998c; Krogh et ai, 2001). 
In addition, the procedures for identification and quan- 
tification of the protein spots are subject to. uncertainties 
(Hay nes and Yates, 2000). Human biases include the 
lack of low abundance proteins (Fey and Larsen, 2001; 
Gygi et ai, 2000b; Harry et ai, 2000) and the differences 
between laboratories in sample preparation. Our reference 
expression data set attempts to resolve these problems. 

Data set scaling 

/ \ refe re nee set for mHK' \ a p r es si on. Wi t h m a n y d i f f e r - 
em mRNA expression data sets available, it is worthwhile 
to integrate them into a single unified reference set, with 
the intention of reducing the noise and errors contained in 
the individual data sets and to obtain a unified estimate of 
the normal expression state in a cell. 
We adopt an iterative scaling and merging formalism. 
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Fig. 1. Schema! ic overview of (he analysis. On ihc left- side we outline the terms we use to describe the process of gene expression. The 
coding section of ihe genome is transcribed into a population of mRNA transcripts called the 'transcriptome/ The transcripts in turn are 
translated to a population of proteins; we use the term 'translatome' for this protein population rather than the alternative 'proteome* because 
the latter term may be confounded with the protein complement of ihe genome (which is nol necessarily associated with a quantitative 
abundance level). 

The matrix in Ihe middle schematically shows an analysis of the three stages of expression. In general, we define a protein 'population' as 
a set of genes associated with a corresponding number of expression or abundance levels ( 'weights'): In the matrix each row represents a 
weight and each column a gene set. In particular, we differentiate between the mRNA reference expression set (GmRNA — ^Gcd)» which 
essentially covers the complete genome, and the reference protein abundance set (Gp wt ) which contains the proteins in data sets 2-DE#l and 
2-DE #2 (see Table 1) because the protein abundance set is a significantly smaller subset of the genome. By definition, this subset contains 
only proteins that can be identified by 2-D gel electrophoresis and is therefore biased in this sense. The enrichment figures throughout this 
paper, through a comparison of the right- and left- sides of this figure, show the results of the experimental biases of 2D gels on the data set. 
Each pie chart represents a composition of a particular protein feature F (for instance, an amino acid composition) in a population (represented 
by the symbol ;*). We can further look at the 'enrichment' of this feature in one population relative to another (represented by the symbol A, 
see Section 'Methods' for an explanation of the formalism). 



which we summarize below. We pre sen! a more del oiled 
review of the methods on our web site. 

We start with the values 'of one gene chip data sel V t 
where / is used throughout as a subscript to denote gene 
number. We then transform (he values of the next Gene 
Chip data sel X ( lo >',- with the following non- linear regres- 
sion: min£ ( . (Yi - t/,) ? with r,- = AX? where A and B 
are the parameters of the regression". Note thai two Gene 
Chip seis may not be defined for the same sel oi genes, 
so we have to perform the fit only over the genes com- 
mon to both sets. The motivation for scaling is that the 
dynamic range of observed .ex press ion levels varies some- 
what between different data sets. although cell types and 
growih conditions are very similar. Reasons lor dispar- 
ity may include different calibration procedures for relat- 
ing fluorescence intensity to a cellular concentration (mea- 
sured in copies of transcripts per cell) or different pro- 
tocols for hiii vesting and j e vet se- transcribing ihe cellular 
mRNA. 

We then merge and average the data to create a new 



reference sel V as follows: 

\Y> - V ( \ 

II U; and Yi aie both defined for gene / and < a 

Yi 4 U, 

Then \'i = \ {Yi -f- U t ) 
Else i I only Y\ exists, V; = 
Else V, = U,. 

As presented above, where only one data set has a value 
foj the cones ponding ORF, we incorporated thai value 
and did not exclude it. When both data sets have values 
for an ORF, we averaged the values if tbey were within 
15% of each other; otherwise, we just stayed with the 
original chip data set U { . We used a = 15% in order to 
prevent outliers from skewing ibe result. This \5% value is 
a reasonable threshold for excluding outliers though other 
values (e.g. 10 or 20%) would give similar results (data 
not shown). Other data sets are subsequently included in 
the same procedure, continuing the iteration frurn the new 
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expression values V/. The initial iteration starts with the 
Young Expression Set, as , since we have the highest 
confidence in its accuracy. 

The SAGE data (Velcuiescu et oi, J 997) was not 
included in the above procedure since it is of a funda- 
mentally different nature. An advantage of the SAGE 
technology over Gene Chips is that there is no possible 
signal saturation for high expression levels, as is possible 
for chips (Futcher et al y 1999). Conversely, SAGE values 
are less reliable for lowly expressed genes since there 
is a chance that one might not sequence a SAGE tag 
corresponding to such a gene altogether. Therefore, if 
after the last iteration, the average Gene Chip expression 
level Vi was both above a certain threshold and below 
the SAGE expression level Si for the same gene, it was re- 
placed with the SAGE value; otherwise the average Gene 
Chip value was kept. This gave us our final expression 
set >v mRNA . Our treatment of the SAGE data is modeled 
after that in Futcher et ol (J 999), and like them, we used 
/f-16. 

This incorporation of the SAGE data into the reference 
data set ensures that the highly expressed outliers are as 
accurate as possible. 

Rather than plain arithmetic averaging, this overall scal- 
ing procedure with the a cutoff avoids 'artificial averages* 
that combine very different values for a particular gene. 
Some expression values might be statistical outliers. In 
addition, it may be possible that the expression levels of 
a variety of genes can only be within mutually exclusive 
ranges or modes, such as when two alternative pathways 
are swiichedon or off. Simply averaging these would give 
values that are less representative of the particular mode 
values. This situation is analogous to (hat in averaging 
together an ensemble of protein structures (i.e from NMR 
. structure determination). Each structure could be stereo- 
chemically correct, with all side-chain atoms in predefined 
rotamer configurations. However, an average of all struc- 
tures could yield one that is stejeochemically incorrect if 
this involved averaging over particular side-chains in dif- 
ferent rotameric stales. 

With teg;ud to our regression analysis, we have investi- 
gated both non- linear and linear fits but found a non-linear 
procedure to be more advantageous. The non-hnear rela- 
tionship between different expiession data sets perhaps 
reflects salutation in one or more of the Gene Chips — not 
an uncommon phenomenon. This non-linenniv is imme- 
diately evident on scatter plots ol two data sets against 
one another (see website). Accordingly, the non- linear 
frt piodnccs a smaller residual than the lineai fit: 9S297 
(non- linear) versus 122 182 (linear) foi the .scaling of ihe 
Church data set and 59828 (non-linear) versus 67-162 
(linear) for the Samson data set. 
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A reference set for protein abundance. We followed a 
similar procetture to calculate a reference protein abun- 
dance set from the two gel electrophoresis data sets. We 
first scaled the two data sets against the mRNA expres- 
. sion reference data set, getting regression parameters Cj 
and Dji 

where the subscript j indicates the data set 2-DE #1 or 
2-DE #2 respectively; Pj j is the protein abundance value 
in data set j y and w m RNA ; i the corresponding reference 
expression value, and Cj and Dj are the parameters of 
the non-linear regression. 

Using these parameters, we transformed the values of set 
2-DE #2 onto 2-OE.#l. Then we combined both sets into 
the reference protein set wprd by averaging them, if both 
values existed. Otherwise, by using the existing value, viz: 

wp«oi.i-= {Pi. i + £?,\2)/2 if both R i and exist. 
Else if only P u exists, wp [0i i = P;j 
Else if Qi -y exists, u/proi./ = (?;.2- 

Enrichment of features 

Formalism. In Ihe next part of our analysis, we want 
to group a number of proteins together into various 
categories based on common features, and characterize 
those features that are enriched in one population relative 
to another, i.e. the translatome population of proteins 
as measured by 2D gels relative to ihe lianscriptome 
population of transcripts or the genome population of 
genes. To this end, we set tip a formalism that could 
be applied universally to all the attributes that we were 
interested in. Due lo the limitations of the experiments, 
the translatome, transcriplome, and genome populations 
are defined on different sets of genes, and sometimes we 
want to remove this 'selection bias* by forcing them to be 
compared on exactly the same set of "genes. This is a key 
aspect of our formalism as presented in Figure I . 

We call an entity like |w, G] a 'population; where G 
is a set describing a particular selection of genes from the 
genome and w is vector of weights associated wjih each 
element of this population. In particular, we foc us On three 
main populations here: 

(>) IT Gcfnj is the population of genes in the genome, 
all 6280 genes weighted once (w = J). 

(ii) h v mRi\v\ : GmRNA ) is the observed population of the 
transcripts in the lianscriptome. i.e. the 62*19 genes 
in the reference expression set weighted by their 
reference expression value: 
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(iii) [wproi, Gp Iol ) >s the observed cellular popuJation of 
the proteins in the translatome, i.e. the 181 genes 
in the reference abundance set weighted by their 
reference abundance value. 

(The set of genes in the genome Gcen is approximately 
equal lo the genes in set G m R^ A , such that we can use 
both symbols interchangeably.) We can also use this nota- 
tion to describe specific experiments — e.g. [v^z, Gt ac z3 
describes the gene set and weights relating to the txanspo- 
son abundance set. 

Furthermore, we define Fj as the value of a feature F 
in ORF j. For example, F could be the composition 
of leucine (a real number) or a binary value (0 or J) 
indicating wheiher an ORF contains a trans membrane 
segment. Given these definitions, the weighted average of 
feature F in population [w, G] is: 



fi(F, KG)) 



_ ZjzG W J F J 



The weighted averages of two populations [w, G] and 
[v, S) can be compared by simply looking at their relative 
difference A: 



A(/\K5). K G}) = 



[v, S}) - jx(F, [w, G)) 
M(/=".K GJ) 



where v and w are weights for the sets of ORFs S and G 
respectively. We call A the 'enrichment' of feature F 
because it indicates wheiher F. is enriched (if A is 
positive) or depleted (if A is negative) in population [v, S) 
relative lo (>v, G). 

Usually, the gene sei G is defined by the particular 
experiment, for which the weight w was measured. 
. However, it is also possible to combine the gene set 
associated with one experiment with expression levels 
from another set. One may want ro do this to compute 
the enrichment only on the genes common to both 
populations, for which there are defined values for both w 
and v, viz: &(F. (u : S n G). |u>, S ft Gj). In practice, 
this is most relevant for comparing Gp, n and G m RNA- 
Since Gp lot is completely a subset of C^na, we need 
not explicitly deal with intersections il we calculate all 
statistics directly over Gp, ol . 

One can adjust Ihe weight vectors to take into account 
different types of averaging. For ins lance, when com- 
puting the amino acid 'composition (F ~ on) from the 
amino acid compositions of individual ORFs Fj — ooj 
(Vy e G). we weight by ORF length. In the case of. 
expression weights, we h:ive: 

where Nj is a me asm e of the length of ORF / (such as the 
number of amino acids). 



On the other hand, when compuling the average molec- 
ular weight per amino acid, we need to normalize by the 
number of amino acids per ORF, which is equivalent to 
choosing the following weights: 



W>mRNA ? j 



V/ 6 G. 



. Application of methodology to quantitative 
abundance sets 

Having defined our formalism, we applied it to a diverse 
set of protein features in yeast. 

Amino acid enrichment. As shown in Figure 2a, we used 
pur methodology to measure the enrichment of indi vidua] 
amino acids in both the translatome and the transcriptome 
relative to the genome. We found that three amino acids — 
valine, gJycine and alanine — were consistently enriched in 
both transcriptome and translatome populations. 

In Figure 2a we compare different gene Sets. Jn Fig- 
ure 2b we focus mainly on the variation in enrichments 
when all the comparisons are restricted to the set of 18J 
genes (Gp rot O G m RNA = Gp rot ) common to all data sets. 
Thus, the differences between the populations now only 
reflect the effects of differential transcription of certain 
genes and differential translation of certain transcripts. 
We find here an enrichment specifically of cysteine in the 
translatome in relation to the transcriptome. 

To measure the statistical significance of the results on 
amino acid enrichment, we have performed a control anal- 
ysis on a randomized data set (Figure 2d). We randomly 
permutated the expression values of the ORFs J000 times 
and then recomputed the enrichments. This allowed us lo 
compute distribulions for ihe amino acid enrichmenls and, 
from integrating these, one-sided /?- values indicating the 
•significance of the observed enrichments. 

Amino acid enrichment in Trnnsposon da la set. We 
also tried to extend our methodology, ineffeclively, to 
cope with the serni-quaniitative Transposon set. We used 
only those 450 ORFs thai consistently yielded either no 
expression or high expression, as binary data, on or off. We 
show ihe enrichmenls of amino acids computed from this 
filtered Transposon abundance set in Figure 2a. Overall, 
the enrichments from this set seemed to be attenuated in 
comparison to other data. 

Biomass enrichment. A corollary U> amino acid enrich- 
ments is the determination of (he average biomass of the 
transcriptome and translatome populations (shown in Fig- 
ure 2c). We found that the average molecular weight of 
a protein in both populations was. on average, lower than 
in the genome population. These preliminary obser vol ions 
suggest a cell preference to use less energetically expen- 
sive proteins for those that arc highly transcribed or trarrs- 
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Fig. 2. Amino acid and biomass enrichment, (a) Shows ihe amino acid enrichments between different populations as indicated by (he legend 
to the right of the plot (the legend is ordered in the same way as the schematic illustration in Figure I). The bars indicate the enrichment of the 
transcriptome relative to Ihe genome, whereas the circles indicate the enrichment of the translatome relative to the genome. In addition, we 
also show the enrichment for protein abundance from the Transposon abundance set, represented by the circles with ihe line through them, 
(b) Shows a different view of amino acid enrichment from that contained in (a), now focusing on changes, and thus restricting the comparison 
to the genes common to all the data sets. The graph is ordered according to the enrichment from transcriptome to translatome (black squares). 
We focus here only on the changes for the abundance gene set (G Pro ,) to exclude the effects that arise from looking at different subsets. In this 
. view ihe enrichments from genome to Iranscriptomc (while squares) and from genome to translatome (white diamonds) look more similar 
than do the analogous sets in (a). To make comparison with (a) easier we again show the enrichment from genome to the transcriptome for 
the complete gene set (Ocen- shown in bars), (c) Shows biomass enrichment. The left panel depicts the average molecular weight per ORF 
(in units of kDa) and the right panel, the average molecular weight per amino acid (in units of Daltons) in each of the three stages of. gene 
expression. The numbeis inside ihe circles indicate ihe average molecular weights. The values next to the arrows indicate the enrichments 
in biomass between different populations. Both the circle diameters and the anew widths are functions of the corresponding values (the 
hollow arrow indicates a positive V3l«e). It is very clear that the average molecular weight per ORF is much lower in the translatome (by 20 
or 15%) and transcriptome (by 29%) than in the genome. This relative depletion of biomass mainly takes place as o result of transcription; ihe 
effect of translation is less clear, depending on the populations compared. On ihe other hand, the depletion in the average molecular weight 
per ammo acid (-3.3% from genome to translatome) is an order of magnitude smaller than in the average weight per ORF. This shows 
that the yeast cell favors the expression of shorter ORFs over longer ones, and agrees with our earlier observation that there is a negative 
■ correlation between maximum ORF Icngih and mRNA expression (Jansen and Gci stein, 2000); it seems that this effect mainly Inltes place 
during transcription rather than translation. (d)This plot shows that the amino acid enrichments are statistically significant. We have assessed 
significance by randomly permuting the expression levels among the genes and Then icromputing the amino acid enrichments. This procedure 
can be repeated and used to generate distributions nf random enrichments thai can then be compared against Ihe observed enrichments. In 
the plot the giay bars represent the observed enrichments already shown in Figuie 3a. On top of the gray bars we show standard boxplots 
of enrichment distributions based on 1000 random permutations. (The middle line represents the distribution median. The upper and lown 
sides of the bo.> coincide with the upper and lower quartiles. Outliers are shown as dots and defined as data points that are outside the range 
nf the whiskers, (he length of which is 1.5 the interquartile distance.) Hased on the random distributions, wc can compute one-sided /?- values 
for the observed enrichments. Amino acids lot which the /> values arc less than JO - - are shown in bold font. 



toted However, wc also found thru the average molecu- 
hr weight pet amino acid differed much less between (he 
ttanscriptonie and the I raws la tome on the one hand, and the 



genome on the other hand (though it was still slightly less). 
This finding indicates that lower molecular weights in the 
ttanslaiome and ttanscriptonie relative to the "enome are 
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predominantly due to greater expression of .shorter pro- 
teins rather than the incorporation of smaller amino acids. 

Secondary, structure composition. We also used our 
methodology to study the enrichment of secondary- 
structural features. Secondary structural annotation was 
derived from structure prediction applied uniformly 10 all 
the ORFs in the yeast genome as described in Table J. 
As shown .in Figure 3a t all three populations — genome, 
transcriplome, and translatome — had a fairly similar 
. composition of secondary struct tires — sheets, helices, and 
coils. The differences. bet ween populations were marginal 
and based only on the small subset of genes. 

We also found that Transmembrane (TM) proteins 
were significantly depleted in the transcriptome (see 
website and caption). These results ate consistent with 
our previous analyses (Jansen and Get s tein, 2000). The 
protein abundance data does not h;>ve any membrane 
proteins. 

Subcellular localization. Fig me 3 c shows the enrich- 
ment of proteins associated with die various subcellular 
compliments. For clarity, we divided the cell into five 
distinct subcellular compartments, (see Table I). We 
found that, in comparison to ihc genome., both the i r an- 
sa iptome and translatome ate enriched in cytoplasmic 
proteins. This is tare whether we make our comparisons in 



relation to the relatively large reference rnRNA expression 
set or the smaller reference protein abundance set. As 
Figure 3c shows, the 2D gel experiments are clearly 
biased towards proteins from the cytoplasm. However, in 
the biased subset G*p, m transcription and translation lead 
to an even higher fraction of cytoplasmic proteins in the 
translatome. 

Functional categories. Finally, we compared the enrich- 
ment of various functional categories in both the trans- 
latome and the trnnscriptome (see Figure 3b). This gives 
us a broad yet informative view of the cell as a whole. As 
described in Table I, we used the top-level of the MIPS 
scheme for the functional category definitions. We found 
broad differences between the 'various populations, with 
some of the functional categories showing strikingly high 
enrichments. 

DISCUSSION AND CONCLUSION 

We developed: (i) a methodology for integrating many dif- 
ferent types of gene expression and protein abundance into 
a common framework and applied this to a preliminary 
analysis; (ii) a procedure lor scaling :md merging different 
mRNA and protein sets together; and (tii) an approach for 
computing the enrichment of various proteomic features in 
the population of transcripts and proteins. We showed that 
by analyzing broad categories instead of individual noisy 
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Fig. 3. Breakdown of the tianscriptome and translalome in terms of bioad categories relating to structure, localization, and function. All 
of the subfigures are analogous to the schemaiic illustration in Figuie I. (a) Represents ihe composition of secondary structure in the 
different populations, (b) Represents the distribution of subcellular localizations associated with proteins in the various populations. We used 
standardized localizations developed earlier (Diawid and Gerslein, 2000), which, in turn, were derived from the MIPS, YPD, and SwissPiol 
databases (Bairoch and Apweiler. 2000; Costanzo et al, 2000; Mewcs et o/., 2000). The subcellular localization has been experimentally 
determined for less than half of the yeast proteins, so our analysis applies only to this subset, (c) Shows the division of ORFs into different 
functional categories (according to the MIPS classification) in the various populations. Only the largest functional categories of the lop level 
of the MIPS classification arc shown. The group 'oiher' contains ihe smaller top-level categories lumped together. This 'other' group is 
different from ihe group 'unclassified,' which contains genes without any Junctional description. 



data points, we could find logical trends in the undei lying 
data. For example, individual transcription factors might 
have higher oi lower protein abundance than one expects 
from their mRNA expression, hut the category 'transcrip- 
tion factors' as a whole has a similar representation in the 
Iranscriplome and translatome. 

We found, as pjeviously described (Fuicher et al. )999; 
Gygi et nl. : 1999b; Green baum et o/. ; 2001), a weak 
correlation between individual measurements of mRNA 
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and piotein abundance. The outliers of this correlation 
tend 10 be associated with cellular organization. One 
might conceive of using these outliers (i.e. those with 
significantly tlifteie.nl transcriptional and translational 
behavior) to find consensus regulator)' sequences. One 
possible method would involve using predicted mRNA 
Mine lines l.laegcr er n/.. 1990; ZuKei; 2000) lo find and 
investigate consensus stiuctural elements in these outbeis 
to which ihe yeast translation:)!' machinery is known to be 
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Table 1. Data sets 



Description 



Sire lORFs) 



Reference 



mRNA expression 
Young 

Church 

Samson 

SAGE 

Reference expression 

Protein abundance 
2-DE81 

2-DE#2 
Ttansposon • 

Reference abundance 

Annotation 

Annotated localization 

TM segments. 

MIPS functions 

GOR secondaiy structure 



Gene chip profiles yeast cells with • 
mutations thai affect transcription 
Gtne chip profiles of yeast cells 
under four different conditions 
Comparing gene chip profiles for 
ytasi cells subjected to alkylating agent 
Yeast cells during vegetative growth 

Scaling and integrating the mRNA 
expression set into one data source 



Measurement of yeast protein 

abundance by 2D gel 

electrophoresis and mass spectrometry 

Similar I6 2-DE set #1 

Large-scale fusions of yeast genes 
with lacZ by tiansposon insertion 

Scaling and integrating the 2-DE 
data sets into'one data source 



Subcellular localizations of yeast 
proteins 

Predicted TM and 
soluble proteins in yeast 
Functional categories for yeast 
ORFs 

Piedictcd secondary structuie 
yeast ORFs 



5455 

6263 

6090 

3778 
6249 

156 

71 
1410 



2133 
(6280) 

2710 
(6280) 

3519 
(6194) 

6280 



Holsiege*ra/.(i99B) 
Roth el al. (1998) 
Jcl insky and Samson ( 1 999) 
Velculcscu e t at. (1997) 

Gygi etal. (1999a» 

Futcher etal. (1999) 
Ross-Macdonald et at. ( 1999) 



Drawid and Gerstein (2000) 
Gerstein(l998a.b,c) 
Mewes ci al. (2000) 
Getstein(l998a.b.c) 



This table provides an overview of the data sets used in mil analysis. The table is divided into three sections. The top section lists different mRNA eapression 
sets. The middle section shows the protein abundance data sets used. The bottom section contains different annotations of protein features. The column 'Data 
set' lists a shorthand refeiencc to each data set used throughout this papei. The next columns contain a brief description of the data sets, the number of ORFs 
contained in each of ihcrn, and the literature reference. In contrast to the other data we investigated, the reference expression arid abundance data sets have 
been calculated for the purpose of our analysis (see text). An expanded version of the table is available on our web site. 
Some further infojmalron on the genome annotations: 

iMcaiizasion. Protein localization information from YPD, MIPS and SwissPiot were merged, filtered and standardized (Bairoch and Apweiler. 2000; 
Costanzo et at., 2000; Mewes el at., 2000) inlb five simplified cumpanmcnls — cytoplasm, nucleus, membfane. e*ltacel)ular (including pi ore ins in ER and 
frolgi). and mitochondrial— according to the protocol in Drawid ,- f al. (2000). Tlus yielded a standardized annotation of protein subcellular localization for 
2 1 33 out of 6280 ORFs/ 

7M if A -mfn/j. In 2710 out of 6280 yeast ORFs TM segments ate piedictcd to occui. janeing from low to high confidence (732 ORFs). The I M prediction was 
pnformed as follows: the values fiom the scale foi amino acids in a window of size 20 (the typical size of aTM hcii*) wcie averaged and then compared 
against a cutoff of - I kcal roof A value under this cutoff was talirn to indicate the existence of aTM heli.v Initial hydrophobic soetches corresponding to 
signal sciences for membrane mseirion we,c excluded. (Thcje have the pattern of a charged iesidue within the first seven, followed by a SDetf h of 14 with 
an avnage hydiophobiciry under the cutoff.) These parameter have been used, teited, and ic fined on surveys of membrane protein in genomes. Sine' 
rnrrnbiane ptoteins had at least two TM- segments with an averse hydiophi&iriiy less ib.m - 2 kca) mol - '.(RoM etnl., 1995; Gerstein eml. 2000. Snmorii 
rt ni. 2000; Senes ft a!.. 2000). 

I'umttani. MIPS functional categories have been a< signed to 3519 cm ol 619-1 ORFs. (The itrmamdcr are assig ncd to category ?S" vt 09; w hich 
conoponds 10 unclassified fund ion.) 



se nsitive (McCarthy, 1 99b'). 

in jehlion lo functional categories, wc found ihice 
1 re nils that were paniculaily notable: (i) the 'cellular 



oiganizntion/ 'protein synthesis/ and energy pjodiKtio/v 
categories were increasingly enriched as we moved horn 
genome to transenptome 10 translniome. .In the fKinscrip- 
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tome and translaiome population relative to the genome; 

(ii) proteins with 'unclassified function' are significantly 
depleted, perhaps reflecting a bias against studying them; 

(iii) proteins in the 'transcription' and 'cell growth, cell 
division, and DNA synthesis' categories were consistently 
depleted. This reflects the fact that many of these proteins, 
such as transcription factors, act as 'switches' such that 
only small quantities of the protein are necessary to 
activate or deactivate a process. These results concur with 
previous calculations (Jansen and Gerstein, 2000) wherein 
we found the transcript ome is enriched specifically witr 
proteins involved in protein synthesis and energy. 

Limitations given the smaH size of the protein 
abundance data 

Even with the extended . coverage made possible by 
merging many data sets together into reference sets, the 
analysis is still limited by the minimal data. This was 
most applicable to the protein abundance measurements, 
potentially biasing our statistical results towards certain 
protein families. Moreover, the 18 J proteins in Gp, m do 
not represent a random sample. They are skewed towards 
highly expressed, well-studied proteins. Our methodology 
attempts to control for this gene-selection bias through 
our enrichment formalism, which allows one to rather 
precisely gauge various aspects of the bias. Conversely, 
many protein features in. both the translaiome and the 
trariscriptome are dominated by highly expressed proteins. 
Under these circumstances, it is often sufficient to look at 
this smaller number of dominating proteins to characterize 
the whole population.- This is similar to ihe development of 
the codon adaptation index for yeast (Sharp and Li, 1987). 
While based on only 24 highly expressed proteins, it has 
proven to be robust in predicting expression levels for the 
entire genome. 

We believe that (he essential formalism and approach 
that we develop wjlfremain quite relevant for future data 
sets' (Smith, 2000). 
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Galaiiin in pituitary adenomas. 

K, jLaudry M, Schmidt WE , B^kjeltT, Hulting AL . 

' ^7^T °5 1 M ° ,eC e ular J Medicine > E » docrine ^ Diabeies Urut, Karol.nska Hospital 
b-l/176Stockliolm, Sweden. Eva.Grenback@ks.se 

Tumor galanin content was measured in extracts from human pituitary adenomas using a 
specific RIA method for monitoring human galanin! Twenty-two out of twenty-four 
hmiors contained galanin with notably high levels in corticotroph adenomas, varymg 
levels m clinically inactive tumors, and low levels in GH secreting adenomas Tumor 
ga anm and ACTH contents were closely correlated in all tumors. In four young patients 
w.«i rmcroadenomas and highly active Mb Cushing tumor galanin was inversely related 
to tumor volume. 1 he molecular form of tumor galanin, studied with reverse-phase 
HFLC, was homogeneous with the majority of tumor galanin coeluting with standard 
human gaJanin. In the rumors analysed with in situ hybridization there was a good 
correlauon between galanin peptide levels and galanin niRNA expression In some 
rumors galanin mRNA and POMC levels coexisted, in others they were essentially in 
different cell populations. Levels of plasma galanin-Ll were not related to tumor galanin 
concentration, and galanin levels were in the same range in sinus pe.rosus close to the 
pituitary venous drainage as in peripheral blood. Corticotropin releasing Hormone 
injections in two patients caused ACTH, but no detectable galanin release into sinus 
perrosus. Our results demonstrate that corticotroph, but not GH adenomas, express high 
levels of galanin, in addition to ACTH, and that in some tumors both polypeptides are 
synthesised ,n the same cell population. However, galanin levels in plasma were not 
influenced by the tumor galanin content. 
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Protein abundancy and mRNA levels of the adipocyte-type fatty acid 
binding protein correlate in non-invasive and invasive bladder transitional 
cell carcinomas. 

Gromova I, Gromov P . Wolf H . CeUs JE . 

Department of Medical Biochemistry and Danish Centre for Human Genome Research 
The UniversHy of Aarhus, Aarhus C, Denmark. 

The adipocyte type fatty acid-binding protein (A-FABP) is a small molecular weight fatty 
ac.d-b.ndmg pro e,n whose expression correlates both with the grade of atypia and the 
stage of bladder trans.t.onal cell carcinomas (TCCs). To determine if the protein 
abundancy correlates with the mRNA levels in non-invasive and invasive lesions we 
have.ana lysed fresh TCCs (grade II, Ta; grade III, T2-4) by two-dimensional 
polyacrylarmde gel electrophoresis (2D-PAGE) and measured the mRNA levels using the 
reverse transcription linked polymerase chain reaction (RT-PCR). Overall the results 
showed a good correlation between protein abundancy and mRNA levels, indicating that 
the lack of expression of the protein observed in some lesions reflects low levels of 
rranscnphon of the A-FABP gene rather than translation^ regulation. In addition our 
stud,es showed that the loss of A-FABP protein observed in some tumors is not 

ca°S inThf A ra?p TT ^ Skin fa " y acid - b inding.protein PA-FABP, as is the 
case in the A-FABP knockout mice. 
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Abstract The adipocyte type Fatly acid-binding protein (A- 
FABP) is a small molecular weight fatty acid-binding protein 
whose expression correlates both with the grade of atypia and 
the stage of bladder transitional cell carcinomas (TCCs). To 
determine if the protein abundancy correlates with the 
mRNA levels in non-invasive and invasive lesions, we have 
analysed fresh TCCs (grade II. Ta; grade 111. T 2 J by two- 
dimensional polyacrylamide gel electrophoresis (2D-PAGE) 
and measured the mRNA levels using the reverse transcription 
linked polymerase chain reaction (RT-PCR). Overall, the 
results showed a good correlation between protein abundancy 
. and mRNA levels, indicating that the lack of expression of 
the protein observed in some, lesions reflects low levels of 
transcription of the A-FABP gene rather than translation^ 
regulation. In addition, our studies showed that the loss of A- 
FABP protein observed in some tumors is not compensated 
by an increase in the skin fatly acid-binding protein PA- 
FABP, as is the case in the A-FABP knockout mice. 

Introduction 

Bladder cancer accounts for 4.7% of till human cancers 
diagnosed. The spectrum of bladder tumors is broad with 
various histological types that include transitional celt 
carcinomas (TCCs), squamous cell carcinomas (SCCs), and 
adenocarcinomas (1-3). TCCs are by far the most prevalent 
tumors as they represent nearly 95% of all bladder cancers in 
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the Western Hemisphere. At first presentation, about 70% of 
the urinary bladder TCCs are diagnosed as differentiated 
superficial lesions that are confined either lo the mucosa 
(Ta), or to the underlying connective tissue (T,). The rest 
correspond to highly invasive, pooTly differentiated tumors. 

Non-invasive TCCs occur as two distinct growth patterns, 
papillary and non-papillary (Hat) lesions (1,2); that display 
significant differences in their malignant- potential and that 
are believed to originate from different genetic alterations 
(4-6). Papillary carcinomas usually correspond to low-grade 
lesions which frequently recur multiple times. These tumors 
begin as areas of hyperplasia that later undergo a process of 
dedifferentiate (grades MV). Invasive tumors may arise 
from these lesions, but often are derived from non-papillary 
carcinoma in situ that usually is of high grade at presentation 
and tend to invade and progress to muscle invasion and 
metastatic disease. 

To date, many attempts have been made to pinpoint genetic 
changes that underly progression of bladder cancer as well as 
to identify molecular markers that correlate with tumor 
progression. Cytogenetic studies and molecular genetic data 
have shown that chromosomes 3p, 4p, 4q, 5q. 8p, 9p, 9q, 
lip 13q, I4q, I7p and I8q are frequently altered in bladder 
urothelial tumors (4.5 and refs. therein), and as a whole they 
have supported the notion that bladder cancer is a mulustep 
process. Recently, Spruck et at (6) showed that chromosome 
9 alterations occur early during development, while p53 
mutations appear later in the process and confer invasive 
properties. The situation however is reverse m the case ol 
Cis, as a large fraction of these lesions contain p53 mutations 
(5 6 8,9). Besides pointing towards two divergent pathways of 
bladder tumor progression, these studies implied that the order 
in which the genetic changes occur is important in determining 
the outcome of the lesion. . 

Assessment of bladder cancer is based on a ^ orou - n 
pathological examination of biopsy material which establishes 
the historical type of the tumor, its degree of differentiation 
(orade), and depth of invas.on of the bladder wall (staging) 
00- 12). In spite of strict criteria for the pathological assessment 
of these lesions, there exist a significant inter-palho.og.St 
variation, a fact that emphasises the need for objective 
markers that may aid their classification. With this in mind 
we ore exploring the possibility of using proicomc (13) 
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expression profiles of these lesions as fingerprints to define, 
their grade of aiypia and eventually their stage (3 J 4). So far. 
more than 400 tumors of various grades and stages have been 
analysed by two-dimensional polyacrylamide gel 
electrophoresis (2D PAGE), and preliminary experiments 
have shown that even though the overall protein expression 
profiles of tumors of the same grade and stage are very similar, 
there are important differences suggesting that morpho- 
logically Identical' TCCs may be further subdivided (I). Of 
the biomarkers of TCC progression identified so far, the 
adipocyte-type fatty acid binding protein (A-FABP) is 
perhaps one of the most interesting as the levels of this 
polypeptide have been shown to correlate both with the grade 
of atypia as well as with the stage of the disease (3). Given 
the putative importance of A-FABP as a progression marker, 
and since Anderson and Seilheimer (15) recently showed that 
post-transcriptional regulation of gene expression is a frequent 
phenomena in higher organisms, we have compared the levels 
of A-FABP mRNA and protein in non-invasive and invasive 
bladder TCCs expressing and lacking this protein. 

Materials and methods 

Tumors. Fresh bladder tumors were obtained immediately 
after transurethral resection.. The grade and clinical stage of 
. the tumors were determined by the pathologist at the Aarhus 
Municipal hospital. Clean tumors devoid of blood clots were 
divided into small pieces for 2D PAGE and DNA, and RNA 
preparation- The latter were immediately frozen in liquid 
nitrogen and store at -80"C until use. 

l n SJ-methionine labeling and 2D-PACE. In a few cases, 
small tumor pieces were Jabeled with ("S) methionine as 
previously described (3). 2D-PAGE was performed 
according to published procedures (16; see also hup: 
//biobase.dk/cgi-bin/celis). 

RT-PCR. Frozen tumor samples were ground to powder in 
liquid nitrogen and total RNA was isolated using the acid 
guanidium isothiocyanaie/phenol chloroform extraction 
procedure (17). The samples were treated with RNase-free 
DNases I (Pharmacia) to eliminate contaminating genomic 
DNA using the protocols recommended by the supplier. 
PoIy(A)* RNA was prepared using Poly (A) : Quick columns 
according lo the manufacturer's instructions (Slratagene). The 
synthesis of cDNA for RT-PCR reactions were carried out 
using the Gibco BRJL Superscript Kit. Two pg of total RNAs 
was mixed with oligo -dTH primer, PCR buffer, MgCI 2 
(25 mM). O.i M DDT and 10 mM dNTP. The mixture was 
incubated at 42"C for 5 min followed by the addition of 
Superscript 11 reverse transcriptase and further incubation at 
42'C for 50 min. The reaction was terminated by raising the 
temperature to 70*C for 15 min, followed by additional 
incubation at 37"C for 20 min in the presence of RNase H lo 
deplete the RNA. Primers for known and cloned genes weie 
purchase from Pharmacia as follows: A-FABP, Upper (from 
186-208 bp) 5'-GATCATCAGTGTGA ATGGGGAT-37 
lower (from 374-397 hp) 5'- CATCCTCTCGTTTTCTCTiT 
ATG-3*. B- actio upper 5 ' - G A G G1TG G CTCTG A CTGT A C C 
AC-3/lower 5 -CTCA1TCAGCTCTCGGA ACATCTCG -3\ 



Table 1. Expression of A-FABP in noninvasive and invasive 
bladder TCCs; 
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=The levels of a A-FABP protein were determined based on the 
visual analysis of Coomassie Brilliant Blue stained gels and 
represent the average estimate of at least two different runs. Tumors 
scored as positive differed significantly with respect lo the levels of 
the protein, and therefore are indicated with either four (very high), 
three (high), two (medium) and one (low) cross (see also Fig. I); 
'The mRNA levels were determined based on the intensity of 
Ethidium Bromide siained cDNA bands separated on agarose gel 
using the Bio-Rad Gel Doc 1000 system and represent the average 
estimate of at least three independent experiments. Cor/esponding 
rhRNA levels are represented by crosses as described above. 



PCR reactions were performed in a Progene thermal cycler 
using the Advantage Klen Tag Polimerase (Clontech)- The 
cycling parameters consisted of 30 sec of denaiuralion at 
94"C, with annealing of 30 sec at 60X for B-aclin or at 
64"C for A-FABP. The extension lime was for 2 min at 68*C 
for 29-40 cycles with the final extension of 7 rnin at 68X. The 
PCR products were separated on 1.5% agarose gel 
electrophoresis followed by ethidium bromide staining and 
photography under UV light. 

Results 

A-FABP protein levels in non-invasive and invasive TCCs. 
One hundred suspected TCCs removed at the Department of 
Urology* Skejby Hospital, were analysed by high resolution 
2D PAGE ;md Coomassie Brilliant Blue staining. 01 these, 
10 grade Jl. Ta TCCs (Table 1) were chosen to con elate A- 
f ABP proicin and mRNA levels as these lesions yielded 
acceptable protein profiles both in lertns of their purity as 
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Fieure I IEF 2D gels of whole cellular extracts from non-invasive and 
invasive TCCs. A. TCC 532-1; B, TCC-692-1; C.TCC763- I; D.TCC709-I 
and E.TCC 711- 1. Only ihe relevant area of ihe gels are shown. 



p- a ctin* 
A-FABP- 



assessed by monitoring for the absence of vimenlin 
(contamination with connective tissue) and desmin 
(contamination with smooth muscle cells), as well as 
polypeptide resolulion. In addition, reasonable amounts of 
these tumors were available for mRNA preparation. 

Table J shows the levels of A-FABP protein expression in 
the 10 tumors analysed by 2D PAGE. The data were scored 
entirely based on (he visual analysis of Coomassie Brilliant 
Blue stained gels and represent an average estimate of at 
least two different runs. Tumors scored as positive differed 
significantly with respect to the levels of this protein, and 
therefore are indicated with either four (very high), three 
(high), two (medium) and one cross (low). Representative 
example of Coomassie stained 2D gels of tumors exhibiting 
very high (TCC 532-1, Fig. IA), high (TCC 692-1; Fig. ID), 
medium (TCC 763-1, Fig. 1C) and undetectable levels (TCC 
709-1 and TCC -711-1 Fig. ID E) of A-FABP are shown m 
Fig. 1 (only the relevant area of the gels are shown). 

A-FABP mRNA levels in non-invasive grade II, Ta TCCs. 
Since in many instances only a limited amount of fresh tumor 
was available, we used RT-PCR to determine the levels of A- 
FABP mRNA in the ten TCCs analysed by 2D PAGE (Fig. 1). 
Following amplification, the PCR products were analysed by 
conventional 1.5% agarose gel electrophoresis and visualised 
by eihidium biomitlc staining as shown in Fig. 2. The amount 
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Figure 2." RT-PCR analyse of A-FABP mRNA expression in non-invasive 
iGrlL Ta) and invasive TCCs (Or HI. T r T»). For RT-PCR analysis, the ss c- 
DNA was synthesized by Reverse Transcriptase using total RNA, and used 
for RT-PCR amplification. The PCR products were resolved on 1:5* 
agarose gels and visualised under* UV light following ethidium bromide 
siaining. The A-FABP panels show the results of amplifications where the 
pair of gene specific primers was used to generate the 220 bp DNA 
fragment. AmpuficatioD of A-FABP was obtained after 30 cycles of PCR. 
The B-aclin panels represent the amplification of the B-actin gene, which 
was used as an internal control lo confirm that equal amounts of c-DNA were 
used in each reaction. 



Q f cDNA in each lane was normalised using several house- 
keeping genes so as lo achieve a more accurate assessment of 
Ihe expression of ihe A-FABP mRNA. As shown .n Fig. 2. 
TCC 532- 1 exhibited the highest amount or A-FABP mRNA. 
followed by TCCs 692-1, 763-1, 616-1. 581-1. 154-1. 166-5 
and 533-1 Undetectable levels of A-FABP mRNA were 
observed in the case of TCCs 607-1 and 709-1 (F.g 2). 
Reialive mRNA levels for the ten TCCs are g.ven in Table I. 

A-FABP protein and mRNA levels in invasive grade III. T„ 
TCCs. Of the invasive TCCs (grade 111. T 2 J analysed by 2D 
PACE only six yielded reasonable protein profiles for further 
study As shown in Table 1. n6ne of these lesions expressed 
detectable levels of A-FABP as determined by Coomassie 
Brilliant Blue staining (Fig. IE. TCC 71 1-1). In line with these 
results, ihe RT-PCR analysis of these tumors also revealed 
undetectable level of A-FABP mRNA (Fig. 2. Gr ID lyi... 
Table 1). 

Loss of A-FABP protein is not compensated by on increase in 
PA-FABP. Recent studies of A-FABP knockout m.ce have 
shown that the loss of ArFABP in fat tissue is com P=" sa '^ 
by an increase in the skin fatty acid-binding protein maJI(l5>. 
Our s.udies. however, indicated that the human homologue 
of mall PA-FABP (19). did not compensate for the loss ol 
A-FABP either in the non-invas.ve or the invas.vc tumors 
analysed in this study (Fig. ID and E). In addn.on. Hg. J 
shows 2D gels ol |»S]- methionine labeled proteins <™> nl '" 0 
code I). Ta TCCs ( 192-4. T 4 ; F.g. 3A and 192-1. T,; F.g. 3B). 
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Figure 3. Levels of A-FABP and PA-FABP protein in grade II. Ta tumors 
resected from the same patient. The two upper panels show the 2D gel 
auioradiograms of (^SJ- methionine labeled proteins from TCCs (grade II, 
Ta) resected from the same patient. A, TCC 192-9 tumor 4 and B,- TCC 
192-9 tumor I. Only the relevant area of the auioradiograms are shown. The 
low panel shows the RT-PCR analysts of A-FABP mRNA expression in the 
same tumors (see also legend Id Fig. 2). 



which differ significantly in their levels of A-FABP protein 
and mRNA (Fig. 3, low panel). As shown in Fig. 3, the 
decrease in A-FABP observed in TCC 129-4, T, is not 
accompanied by an increase in the PA-FABP prolein (Fig. IE). 

Discussion 

Of the TCC progression markers identified to date, A-FABP 
is perhaps one of the most interesting as its presence correlates 
both with the grade of atypja (p=0.0006) and the stage of the 
disease (p=0.0269) (3). A-FABP is a low molecular weight 
protein belonging to a cytosolic multigene family of lipld- 
binding proteins that include heart, liver, intestinal, muscle, 
brain, skin and epithelial isoforms (20). Members of the 
FABP family are highly expressed in differentiated cells and 
show narrow tissue distribution. Their precise function is at 
present unknown, although there is evidence suggesting that 
Ihey may play roles in intracellular lipid transport and 
metabolism, signal transduction (21,22) as well as growth 
control and differentiation (23). The role in signal transduction 
has been inferred from ihc fact that long-chain fatty acids and 
their metabolites can act as primary and second messengers 
in specific signalling pathways (24). Recently, it has been 
shown thaf A-FABP may play a central role in the pathway 
that links obesity with insulin resistance, most likely by 
connecting the fatty acid metabolism with the expression of 
TNF-o (18). Furthermore, [here is evidence indicating that 
the A-FABP gene contains sequence information necessary 
for differentia I ion- dependent ex pi ess ion in adipocytes (25). 
Our own data derived from the study of TCCs and normal 
moihelium suggest that A-FABP may be required foi normal 
urothelium differentiation (I), as may be the case foi PA- 
FABP in the skin ( 19). 



Considering the potential prognostic value of A-jFABP 
protein and/or mRNA in TCC progression it was important ro 
determine if the levels of both type of rnacromGle 
correlated both in the non-invasive and the invasive lesions 
expressing and lacking A-FABP. The need for such correlation 
was underlined by recent studies of Anderson and Seilhamer 
(15), who reported a lack of overall correlation between the 
mRNA and protein levels of. 45 rat proteins analysed by 2D 
PAGE in combination with cDNA arrays^ Their data yielded a 
correlation coefficient of 0.45 which is half way between weak 
and perfect correlation. Clearly, our data showed a very' good 
correlation between the protein and mRNA levels of A-FABP 
in all tumors analysed indicating that the loss of A-FABP 
protein observed in. some tumors is not due to post- 
transcriptional regulation. 

Recently, knockout mice carrying a null mutation in the 
aP2 gene encoding for A-FABP was produced (18). These 
animals do not show an obvious morphological or metabolic 
phenotype. but exhibit a 20-fold increase in the levels of the 
keratinocyte type FABP (mo/7), which may compensate for 
the loss of the deleted gene (18); The human homologue of 
the mall gene. PA-FABP, was cloned in our laboratory and 
has been shown to be highly upregulated in psoriatic skin as 
well as in abnormally differentiated primary keratinocytes 
(19). PA-FABP is expressed in normal urothelium together 
with A-FABP (3), and ongoing studies in the laboratory have 
shown that its level decreases significantly as tumors progress. 
Interestingly, the studies reported in this article did not revealed 
a compensatory up- or down-regulation of PA-FABP in the 
TCCs analysed, supporting the contention that PA-FABP may 
also play a role in cell growth and differentiation (19). . 
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j . AJ y JJ> Patients than m normal controls (P < 0 OH In <:ihi 

Sa and ™ iUn8S - Ll C ° ntrast ' there was mar ^d upregulation of MIF 

mKNA and protein expression n the ARDS !i,n„, i n a nrv? . . " 01 1W 
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Restored expression and activity of organic ion transporters rOATl, 
rOAT3 and rOCTZ after hyperuricemia in the rat kidney. 

H abtl X. Yano I , Okuda M Fukatsu A . Inni K , 

Department of Pharmacy, Kyoto University Hospi tal, Faculty of Medicine, Kyoto 
University, Sakyo-ku, Kyoto 606-8507, Japan. 

We previously reported that in hyperuricemic rats, renal impairment occurred and organic 
ion transport activity decreased, accompanied with a specific decrease in the expression 
of^oisanic-aiuon transporters, rOATl and rOAT3, and organic cation transporter 

act^l StU p y * WC investi 8 ated the reversibility of the organic ion transport 

activity and expression of organic ion transporters (slc22a) during recovery from 
hypenmcemia. Hyperuricemia was induced by the administration of a chow containing 
unc acid and oxomc acid, an inhibitor of uric acid metabolism. Four days after 
discontinuance : of thechow, the plasma uric acid concentration- returned to the normal 
uu u.i functlons su <* as creatinine clearance and BUN levels were restored 
although the recovery of tubulointerstitial injury was varied in sites of the kidney ' 
Basolateral uptake of p-aminohippurate (PAH) and tetraethylammonium (TEA) and both 
protem and mRNA levels of rOATl, rOAT3 and rOCT2 in the kidnev pradnallv 
Z r °^ I "1 ng 14 da y s ofrecover y from hyperuricemia. Basolateral PAH transport 
X?^? 8 ^ COn ? lat T With the P rotein le vel of rOATl (r(2)=0.80) than rOAT3 

S^a^frf^S P ^ r ° ne COncentration > « a dominant factor in 

the regulation of rOCT2, was gradually restored during the recovery from hyperuricemia 

hel d„ h T™. PlaSma teSt0SterOne leVCl r0CT2 Protein^xpression * 

^„,n rt Y niV? S T RCant ThCSe reSUltS SU ^ eSt that the ^^ion of organic ion 
tonsporters rOATl, rOAT3 and rOCT2, by hyperuricemia is reversible, and the orgaTc 
ion transport act.vity restores according to the expression levels of these transporters. 
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Regulation of cytochrome P4501A1 in teleosts: sustained induction of 
CYP1 Al mRNA, protein, and catalytic activity by 2,3,7,8- 
tetrachlorodibenzofuran in the marine fish Stenotonius chrysops. 

Hahn ME , Stcgeman JJ . 

Biology Department, Woods Hole Oceanographic Institution, Massachusetts 02543. 

Cytochrome P4501 Al (CYP1 Al) is known to play important roles in the activation and 
detoxification of carcinogens and other toxicants in vertebrate animals, including fish. 
Although extensively studied in mammalian systems, the regulation of CYP1A forms in 
other vertebrates is less well understood. We examined the time course and dose- 
response relationships for induction of CYP I Al mRNA, protein, and catalytic activity by 
23,7,8-tetraclilorodibenzofuran (TCDF) in the marine fish Stenotomus chrysops (scup). 
The time course of CYP1 Al induction was determined following a single ip dose (10 
nmol/kg) of 2,3,7,8-TCDF. Hepatic ethoxyresorufin O-deethylase activity was increased 
after I day, reached a maximum by 8 days, and was still elevated 14 days after treatment. 
The content of irnmunodetectable CYP I At protein in liver was elevated on Day 1 and 
continued to increase through 14 days. CYP1A1 protein content was also strongly 
induced in heart and gill beginning at 2 days after treatment and extending through Day 
14. Hepatic CYP1 Al mRNA was strongly induced by 1 day after dosing and remained 
elevated through 14 days. The sustained induction of CYP1 Al mRNA by 2,3,7,8-TCDF 
contrasts with the transient induction seen previously in fish treated with nonhalogenated 
inducers and most likely reflects differences in persistence of the inducers. Dose-response 
studies indicated that induction of CYP 1 Al mRNA, protein, and catalytic activity 
occurred following doses of 2,3,7,8-TCDF as low as 0.4 nmol/kg (120 ng/kg), within the 
range of whole-body contents of this congener measured in fish from contaminated 
environments. The estimated dose producing half-maximal CYP.l Al induction in scup 
was approximately 2-10 nmol/kg, suggesting that the sensitivity of these fish to induction 
may be as great as or greater than that of rats. In contrast to previous results obtained with 
3,3 , ,4,4 t -tetrachlorobiphenyl (TCB) and beta-naphthoflavone, which appear to inhibit or 
inactivate CYP1 Al in fish and other vertebrates, there was a good correlation among 
levels of CYP1 Al mRNA, protein, and catalytic activity in individual fish following 
various doses of 2,3,7,8-TCDF. The difference in response to 2,3,7,8-TCDF versus 
3,3',4,4-TCB may reflect differences in the inducing potencies of the two compounds 
relative to their similar potencies as irihibitors of CYP 1 A 1 catalytic activity. In additional 
studies to evaluate structure-activity relationships for CYP I Al induction by chlorinated 
dibenzofurans in fish, scup were treated with 2,3,6,8-tetrachlorodibenzofuran (2,3,6,8- 
TCDF). At 10 or 50 nmol/kg, 2,3,6,8-TCDF was inactive as an inducer of CYP1 Al 
mRNA, protein, or catalytic act ivity.( ABSTRACT TRUNCATED AT 400 WORDS) 
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Expression of the pS2 gene in breast tissues assessed by pS2-mRNA 
analysis and pS2-protein radioimmunoassay. 

Hahnel E, Robbins P, Harvey J, Sterrett G, Hahnel R . 

Department of Pathology, University of Western Australia, Queen Elizabeth II Medical 
Centre, Nedlands. 

The expression of the pS2 gene in breast tissues was assessed by measuring pS2-protein 
using a radioimmunoassay, and by determining pS2-mRNA using Northern blotting. 
There was a good correlation between the two measurements, indicating that expression 
of the pS2 gene in breast tissues may be assessed by either method. Since 
radioimmunoassay is technically easier and more efficient than Northern, blotting, 
radioimmunoassay will be the method of choice in routine applications. 

PMBD: 1463873 [PubMed - indexed for MEDLINE] 
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Expression of the pS2 gene in breast tissues assessed by pS2-mRNA analysis 
and pS2-protein radioimmunoassay 



Erika Hahnef, Peter Robbins, Jennet Harvey, Gregory Sterrett and Roland Habnel ' - 
Department of Pathology, University of Western Australia, Queen Elizabeth II Medical Centre, Nedlands, 
6009, Western Australia . ' 



Keywords: breast tissue, pS2-mRNA, pS2 protein, radioimmunoassay 
Summary 

The expression of the pS2 gene in breast tissues was assessed by measuring pS2-prorein using a radioimmu- 
noassay, and by determining pS2-mRNA using Northern blotting. There was a good correlation between the 
two measurements, indicating that expression of the pS2 gene in breast tissues may be assessed by either 
method. Since radioimmunoassay is technically easier and more efficient than Northern blotting, radioim- 
munoassay will be the method of choice in routine applications. V 



Introduction 

Expression of the pS2 gene is controlled by estro- 
gen. This was first described in the JVICF-7 breast 
cancer cell line [1]. pS2 expression has since been 
reported to be useful as a prognostic indicator f2, 
3J, although this was not confirmed in another se- 
ries [4]. 

pS2 expression may be assessed in tissue ho- 
mogenates by analysis of pS2-mRNA [5], by ra- 
dioimmunoassay of the pS2-protein [2], or by in> 
munocytochemical detection of the pS2 protein in 
tissue sections f5], Jt was the aim of this study to 
establish the correlation between pS2-mRN A and 
pS2-proiein.by radioimmunoassay in a series of 
tissues obtained from mastectomy specimens per- 
formed for carcinoma of the breast. Primary breast 
carcinoma tissue, metastatic carcinoma within ax- 
iJJary nodes, and macroscopcaJJy benign breast 
ussue were examined. 
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Materials and methods 
Breast tissues 

Tissue specimens from mastectomies performed 
for carcinoma of the breast were examined. 32 pri- 
mary breast carcinomas, 10 axillary lymph nodes 
containing metastatic breast carcinoma, and 20 
samples of uninvoi ved breast tissue were analyzed 
for pS2 expression. 

The primary breast carcinomas were histolog- 
ically classified using a conventional subclassifi- 
cation. The presence or absence of primary tumour 
was assessed. The presence of metastatic carcino- 
ma within lymph nodes studied was verified by 
histological examination of the node remnant after 
sampling. 

'Uninvolved* breast tissue was sampled from 
sites weJJ removed from the primary breast tumour 
(usually in another quadrant of the breast), and was 
selected only if the tissue appeared macroscopical- 
ly unremarkable'. Tissue sampling occurred imme- 

Queen Elizabeth-)) Medic a) Centre. Nedbnds 6009, Western Australia 
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diately upon anival of the mastectomy specimen in 
the laboratory, with minimal delays between re- 
moval and sampling. 

. Tissues for j>S2 analysis were snap frozen in Iiq- 
uid nitrogen and stored at -70°C until processed. 

Extraction of RNA and determination of 
pS2-mRNA 

Details of the procedure have been described in pur 
previous paper f 6 J. Briefly, the deep-frozen tissue 
was homogenized in a micrp-dismembrator. The 
homogeneous powder, was extracted with guani* 
diniunusothiocyanatephenolchlorofonniso- 
amylalcohol, and RNA was precipitated with iso- 
propanol. The washed RNA jteljet was dissolvedin 
SDS and glyoxylated, and the RNA preparation 
loaded onto agarose gel. After electrophoresis the 
gel was capillary blotted, onto Zeta-probe mem- 
branes. Membranes were hybridized overnight 
with cDNA probes pS2 and 36B4, which were la- 
beled with [a*P] dCTP by nick translations. 
Washed membranes were exposed to Kodak X- 
omat AR film. Relative intensities of the mRNA 
bands were assessed visually as not detectable, ve- 
ry weak, weak,, medium, strong, and very strong, 
taking the intensities of theubiguitous 36B4 bands 
into account! 



' Radioimmunoassay of pS2-protein 

Deep frozen specimens were pulverized with a rni- 
crodismembrator. The tissue powder was suspend- 
ed in 10 volumes of pH7.5 phosphate buffer. The 
homogenate was centrifuged in a refrigerated cen- 
: trifuge at 4°C for 60 minutes at 2600xg. The su- 
pernatant was removed with a Pasteur pipette, 
carefully avoiding the fat layer on the top. The pro- 
tein concentration in the supernatant was estimated 
by use of the Coomassie dye-binding method [7]. 
An aliquot of the supernatant was diluted to a pro- 
tein concentration between 1 and 2mg/ml before 
assay of ibe pS2-proiein. In one case the protein 
concentration of the supernatant was well beJow 
lms/ml. 



The estimation of the pS2-protein was perform- 
ed using a solid phase, two-site radioimmunoas- 
say. The, kits were bought from CIS Biointerna- 
tional, Gif-sur-Yvette, France (ELSA-PS2). In this 
method the molecules of pS2 are sandwiched be- 
tween two monoclonal antibodies; the first one is 
coated on the ELS A solid phase, the second one is 
radiolabeled with J25-iodine. The radioactivity 
bound to the ELS A is proportional to the concen- 
tration of pS2-protein. Details of the procedure are 
supplied with the kit (8). * 

Results and discussion 

32 primary , breast carcinomas, metastatic breast 
carcinoma in 10 lymph nodes, and 20 samples of 
benign breast tissue from mastectomies were ij> 
vestigated. Two of the carcinomas were of the infil- 
trating lobular type, two were ductal carcinomas in 
situ, one was a multicentric invasive ductal carci- 
noma, all others were invasive ductal carcinomas. 

Examples of pS2 Northern blots have been 
shown in our previous paper [6] which demon- 
strate that undegraded pS^mRNA can be isolated 
by the method used. 

The results of the pS2-protein and pS2-mRNA 
assays are shown on Hg. h There was a good corre- 
lation betweerube two types of results. When pS2- 
mRNA could not be detected by Northern blot, 
pS2-protein results were usually below 1 ng/mg 
protein (22 of 30), or between 1 and 3 .7 ng/mg (6 of 
30). Two were exceptions (7/7 and 14.6); one of 
them could have been due to the very low protein 
content in the cytosol which would lead to a large 
pS2 value and an associated error. There was no 
explanation for the other high result. Very weak 
pS2-mRNA signals on Northern blots correspond- 
ed to pS2-protein values between 1J and 19.2 with 
. an average of 6.6ng/mg protein (median 5.7): The 
mean and median pS2-protein concentration in the 
tissue with weak pS2~mRNA signals were 14.3 and 
10.7 ng/mg protein, respectively. The average pS2- 
proiein concentration increased to 32.7 (median 
3L5) ng/mg protein for tissues assessed as medium 
pS2-mRNA intensity, and to 43.3 (median 53.8) 
ng/mg protein for tissues with strong or very strong 
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pS2-mRNA signals. These values should be used 
as an approximate guide only, since the number of 
samples in the various groups was fairly smalh 
One-way analysis of variance confirmed that the 
means of the pS2-protein values in the groups 
made up according to their pS2-mRNA signal in- 
tensity, were significantly different (p <10^). 

If the pS2 gene is expressed, its expression is on 
average greater in breast carcinomas than in unin- 
volved breast tissue. If one takes pS2-protein val- 
ues above 4ng/mg protein as cut-off, the average 
pS2-protein in 14 breast cancers was 343 (median 
■ (35.2), while it was only 18.1 (median 13.8) in 12 
uninvolved breast tissue samples.. ]f the cut-off is 
taken at lOng/mg protein, average pS2-protein in 
breast carcinoma is also abour twice the level of 
uninvolved tissue. There were not enough lymph 
node metastases which expressed the pS2 gene to 
allow a comparison with carcinoma or uninvolved 
breast tissue. , 

Recent preliminary results of pS2 by radioim- 
munoassay [9] are similar to ours for breast cancer 
but considerably lower than our results for normal 
breast tissue. 

The values of the pS2-protein measured obvi- 
ously depend on the protein used for calibration. 
We used the pS2- protein standards supplied with 
the CIS kit, which according to the supplier gave 
values from 0 to 740ng/mg protein in a series of 
205 breast cancer cytosols. Previously, a different 
standard had been used for presumably the same 
series of breast carcinomas [2], and a conversion 
factor to current standards is given as 2.8 [8]. 

It was noticed that the correlation between pS2- 
protein and pS2-mRNA was better in breast carci- 
noma specimens than in uninvolved breast tissue. 
This is unexplained, though it could be due to the - 
variable content of cell or tissue types in adjoining 
pans of a specimen, a variation more likely to oc- 
cur in our sampling of non-malignant breast tissue 
compared to sampling of carcinomas. A similar 
variability in breast carcinoma specimens will 
probably have a smaller influence on the pS2 re- 
sults, since the malignant cells - if they do express 
the pS2 gene - contain more pS2-protein than nor- 
mal breast. 
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Fig. A Correlation between pS2-protein by radioimmunoassay and 
p$2-mRNA by Northern blot. <•) = 22 results bcJow K The horizontal 
lines indicate the mean values. 
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The role of the epidermal growth factor receptor in sustaining neutrophil 
inflammation in severe asthma. 

Hamilton LM, Torres-Lozano C. Puddicombe SM Richter A, Kimber I . Dearman 
M Vrugt B, Aatbers R , Holgate ST Djukanovic R . Wilson SJ, Davies DE . 

Division of Infection, Inflammation & Repair, School of Medicine, University of 
Southampton, UK. 

BACKGROUND: The extent of epithelial injury in asthma is reflected by expression of 
the epidermal growth factor receptor (EGFR), which is increased in proportion to disease 
severity and is corticosteroid refractory. Although the EGFR is involved in epithelial 
growth and differentiation, it is unknown whether it also contributes to the inflammatory 
response in asthma.- OBJECTIVES: Because severe asthma is characterized by 
neutrophilic inflammation, we investigated the relationship between EGFR activation and 
production of 1L-8 and macrophage inhibitory protein- i alpha (MIP-1 alpha) using in 
vitro culture models and examined the association between epithelial expression of IL-8 
and EGFR in bronchial biopsies from asthmatic subjects. METHODS: H292 or primary 
bronchial epithelial cells were. exposed .to EGF or H202 to achieve ligand-dependent and 
Hgand-independent EGFR activation; IL-8 mRNA was measured by real-time PCR and 
IL-8 and MIP-1 alpha protein measured by enzyme-linked immunosorbent assay 
(ELISA). Epithelial IL-8 and EGFR expression in bronchial biopsies from asthmatic 
subjects was examined by immunohistochemistry and quantified by image analysis; 
RESULTS: Using H292 cells, EGF and H202 increased IL-8 gene expression and 
release and this was completely suppressed by the EGFR-selective tyrosine kinase 
inhibitor, AG1478, but only partially by dexamethasone. MlP-lalpha release was not 
stimulated by EGF, whereas H202 caused a 1 ;8-foId increase and this was insensitive to 
AG1 478. EGF also significantly stimulated IL-8 release from asthmatic or normal 
primary epithelial cell cultures established from bronchial brushings. In bronchial 
biopsies, epithelial IL-8, MlP-lalpha, EGFR and submucosal neutrophils were all 
significantly increased in severe compared to mild disease and there was a strong 
correlation between EGFR and IL-8 expression (r = 0.70, P < 0.001). CONCLUSIONS: 
These results suggest that in severe asthma, epithelial damage has the potential to 
contribute to neutrophilic inflammation through enhanced production of IL-8 via EGFR- 
dependent mechanisms. 
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Human hepatic microsomal epoxide hydrolase: comparative analysis of 
polymorphic expression. P analysis of 
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Varfati0n ^ ^ eXprCSSi0n ° f human microsomal epoxide hydrolase 
(mEH) may be an important risk factor for chemically induced toxLties Sud^ 
= and teratogenesis. In this study, phenotypic variability and n^H g ^ 

sa31 mPH? y m aCt,vity levds across Ae P^el of 40 liver 

In^SiH -TT ^ dS ° demonstra ^ strong correlation (r > or 74) with 
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Department of Oncology, Finsen Center, National University 
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Abstract 

Lung cancer is the most common cause of cancer death 
in developed countries. The prognosis is poor, with less than 
J 5% of patients surviving 5 years after diagnosis. The poor 
prognosis is attributable to lack of efficient diagnostic meth- 
ods for early detection and lack of successful treatment for 
metastatic disease. Most patients (>75%) present with stage 
111 or JV disease and are rarely curable with current ther- 
apies. Within the last decade, rapid advances in molecular 
biology, pathology, bronchology, and radiology have pro- 
vided a rational basis for improving outcome. These ad- 
vancements have led to a belter documentation of morpho- 
logical changes in the bronchial epithelium before development 
of clinical evident invasive carcinomas. This has changed our 
concept of lung carcinogenesis and emphasized the multislep 
carcinogenesis approach on several levels. Combined with 
the technical developments in bronchoscopic techniques, 
e.g., Jaser-induced fluorescence endoscope (JJFE) bronchos- 
copy, we now have improved methods to localize preinvasive 
and early-invasive bronchial lesions. With the LIFE bron- 
choscope, a new morphological entity (angiogenic squamous 
dysplasia) has been recognized, which might be an impor- 
tant biomarker and target for antiangiopenic ehemopreven- 
live agents. To reduce the mortality of lung cancer, these 
new technologies have been taken into ibe clinic in different 
scientific settings. The nse of low-dose spiral computed to- 
mography in the screening ol a high-risk population has 
demonstrated (he possibility of diagnosing small peripheral 
tumors that are not seen on conventional X-ray. A shift in 
the therapeutic paradigm from targeting advanced clinically 
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manifest lung cancer toward asymptomatic preinvasive and 
early- invasive cancer is occurring. The present article re- 
views the recent advances in the diagnosis of preinvasive and 
early-invasive cancer to identify biomarkers for early detec- 
tion of lung cancer and for c hem ©prevention studies. 

Introduction 

Lung cancer is the most common cause of cancer deaths in 
the countries of North America and other developed countries, 
accounting for 29% of all cancer deaths and more deaths than 
from prostate, breast, and colorectal C3ncer combined in the 
United States (I). Lung cancer. will be diagnosed in —170,000 
new patients in the United States in the year 2000, and < 15% of 
them will survive 5 years after diagnosis (1)- The prognosis for 
the patients with lung cancer is strongly correlated to the stage 
of the disease at the time of diagnosis. Whereas patients with 
clinical stage 1A disease have a 5-year survival of about 60%, 
the clinical stage 1I-IV disease 5-year survival rate ranges from 
40% to less than 5% (2). Over two-thirds of the patients have 
regional lymph-node involvement or distant disease at the lime 
of presentation (3). The poor prognosis is largely attributable lo 
the lack of effective early detection methods and the inability lo 
cure metastatic disease. The unsatisfactory cure rates supports 
efforts aimed at early identification and intervention in lung 
cancer. 

Historically, the only diagnostic tests available for the 
detection of lung cancer in ils early stages were chest radiogra- 
phy and sputum cytology. The efficacy of these tests as mass 
screening tools was evaluated in controlled trials sponsored by 
the NCI 3 and conducted at Johns Hopkins University, Memorial 
Sloan-Kcticring Cancer Center, and the Mayo Clinic during the 
1970s (4-6). The principal goal of these studies was to deter- 
mine whether a reduction in lung cancer mortality could be 
achieved by adding sputum cytology testing lo annual scieening 
by chest radiography. Results from these trials showed thai both 
tests could detect piesymptomatic, eaily- stage carcinoma, par- 
ticulaily ol squamous cell type. Fesectabiliry and survival rates 
were found to be generally higher in ihe study groups than in ihe 
control groups. However, improvements in rcscciabilily and 
survival did not lead fo a reduction in overall lung cancer 
mortality, ihe most critical end point. A subsequent study of 
6?46 Czechoslovakia)! male smokeis also found no icdnciion in 
limy cancer mortality after dual screening by chest radiography 
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and sputum cytology (7). The negative results from these 
screening studies lead the NCI and other health policy and 
research groups to conclude that mass screening programs in- 
volving periodic sputum cytological evaluation and chest radio- 
graphs could not be justified. However, controversies in the 
methodology and interpretation of the data from these studies 
have later been extensively discussed (8, 9). One additional 
study of annual chest X-ray screening is currently being con- 
ducted by the NCI; The Prostate-, Lung-, Colorectal-, and Ovar- 
ian (PLCO) screening trial. This trial includes individuals 55-74 
. years old, but they are not selected for Ibis trial on the basis of 
high risk for lung cancer (e.g., smoking history with >20 
p3ck-years). 

The failure of clinical trials to demonstrate the efficacy of 
sputum cytology and chest radiography as mass screening tools 
has resulted in a search for better diagnostic approaches for 
early lung cancer detection that take advantage of recent devel- 
opments in molecular biology, gene technology, and radiology 
(10). Furthermore, as has been the case for mammography 
screening for breast cancer, it has also been important to identify 
risk groups for rung cancer. 

Although, much is known about predisposing factors, nat- 
ural history, and the outcome based on histology and stage, our 
understanding remains very incomplete in many areas. What are . 
the early premalignanl changes molecularly, biochemically, and 

• morphologic ally? Which changes are reversible and which are 
not? What research tools are available lo provide answers to 
these questions? The identification of preinvasive lesions allows 
for developing promising merhods for early intervention (II). 
The therapeutic paradigm and focus are today shifting from 
targeting only clinically verified lung cancer as previously to- 

* ward targeting. the premalignanl and early- malignant lesions. 
Furthermore, the prospect of Jung cancer screening has today 
become more meaningful as a consequence of recent develop- 
ments in biology and radiology and better possibilities to define 
high-risk populations most suitable foi lune cancer screenine 
(12). 

The present article will focus on the clinical perspectives of 
our biological knowledge of premalignanl and early- malignant 
lesions and the potential of the recent technological advance- 
ment for early diagnosis of lung cancer. 

Pathology of Preinvasive and Early Invasive 
Bronchial Lesions 

Most of the ef foils to classifv hmtj cancer have been 
directed toward invasive carcinoma (13). However, better un- 
derstanding of the pathogenesis of lung cancer aroused renewed 
interest in morphological abnormalities 'hat fall short of inva- 
sive carcinom:i but may indicate initial ion of carcinogenesis. 
These morphological abnormalities are refened to as preinva- 
sive lesions and are shown in Fig. I. The last edition of the 
WHO classification of lung tumors included the classification of 
preinvasive lesions as a sepaiatc section. Numerous recent s Ind- 
ies have indicated dial hinj: cnncci is not the icsult of a sudden 
translonmne event in the bronchial epithelium hut a muliistcp 
process in which piadually act rum;: sequential ^enriic and 
cellular changes result in the formation of an invasive 
malignant) linn or Mucosal changes in the huge airwavs 



may precede or accompany invasive squamous carcinoma in- 
clude hyperplasia, metaplasia, dysplasia, and CIS (14); Hyper- 
plasia of the bronchial epithelium and squamous .metaplasia 
have generally been considered reversible, and not premalignanl 
. in the sense of squamous dysplasia and CIS (15). 

Squamous metaplasia is a common finding, especially as a 
response to cigarette smoking. Peters et al (16) studied bron- 
choscope biopsies from six sites in 106 heavy cigarette smok- 
ers; Squamous metaplasia was noted at one or more biopsy sites 
in approximately two- thirds of the group, and one- fourth 
showed squamous metaplasia in three or more biopsy sites. The 
incidence of squamous metaplasia increased with smoking his- 
tory and was highest in individuals who had smoked more than 
two packs of cigarettes a day. Auerbach et al (17) noted similar 
. findings in autopsy tissues: basal cell hyperplasia and squamous 
metaplasia are increased in smokers in proportion to smoking 
history. Hyperplasia and metaplasia are believed to be reactive 
changes in the bronchial epithelium, as opposed to true preneo- 
plastic changes (17, 18). The reasons for this include: (a) they 
are frequently found in association with chronic inflammation, 
and may be induced by mechanical trauma; (b) they spontane- 
ously regress after smoking cessation; (c) in chronic smokers, 
the molecular changes present in these lesions are similar to 
those present in histologically normal epithelium; and (d) there 
are no reports linking their presence to increased risk for devel- 
oping lung cancer. In contrast, moderate-to-severe dysplasia and 
CIS lesions seldom regress after smoking cessation (19). 

Dysplasia and OS aie changes that frequently " precede 
squamous cell carcinoma of the lung. Saccomanno et al. (20) 
studied more than 50,000 samples from 6,000 men, many of 
whom had worked in the ujanium mining industry. Both smok- 
ing and uranium mining (radon exposure) were found lo be 
associated with increased incidence of dysplasia, CIS, and in- 
vasive cancer. The studies of Saccomanno et al established (hat 
increasing degrees of sputum atypia may be recognized an 
average of 4-5 years before the development of frank long 
carcinoma. 

Another question is: which grades of sputum atypia pro- 
press to cancer? From. the Johns Hopkins cohort of the NCI 
chest X-ray/spumm screening trial, we know that among indi- 
viduals wilh moderate atypia on sputum screening. — 10% de- 
veloped known cancer up to 9 years later. Among individuals 
with severe atypia on (he sputum screening. >40% developed 
known cancer during the same lime period (21). Although there 
are daia in the literature showing the relationship between 
sputum atypia and subsequent invasive cancer, there is still vcrv - 
little information about the histological progression in the bron- 
chial mucosa in the high risk populations. Jn a jeeent publica- 
tion, nine patients with CIS were followed with antofluorcs- 
eence bronchoscopy at regular intervals, and 5 (56%) had 
pjopu'ssion to invasive cancer despite endobronchial therapy 
i?7). 7 he number of invasive cancers might even have been 
hiebci if treatment had not been not yiven. Ongoing studies of 
hit'h-iisk subjects (r.t-.. the Colorado sputum cohort stndv) 
including smal lollow up bronchoscopies will provide evidence 
t elated to the frequency of development of invasive Jung cancer 
as it relates rosinokinj: history, airflow obstruction, and sputum 
atypia 

Since the pievious WHO- classification was published in 
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Fig. I A f squamous metaplasia. The 
cells are widely dispersed, with a reg- 
ular marmatjon from the basal region 
lo ihe top. There is keratint2aiion, and 
the nucJei/cyropJasniic ratio is low. B, 
.moderate dysplasia with ASD. Hypei- 
celrulnriry of ihe epithelium wilh in- 
complete maturation and mkropapil- 
lary invasion of capillaries are seen. 
The nuclci/cytoplasmic ratio is high. 
C, severe dysplasia. There is marked 
pleomorphisin of ihe cells with irregu- 
larity and prominent nucleoli. 
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1981, two nonsquamous lesions have been added to ibe WJ10 
classification of premalignan! lesions: atypical alveolar hyper- 
plasia and diffuse idiopathic neuroendocrine cell hyperplasia 
(13). Both of these lesions are diagnosed rarely. The former 
consists of lesions <5 mm in diameter and composed of a 
peripheral epithelial cell proliferation with minimal cytological 
atypia or stromal response and resembles bronchioloalveolar 
carcinoma. The lesion has been seen in hirig specimens resected 
for lung caDCer, but no prospective significance of this lesion 
has been reported. However, this morphological lesion may play 
a role for the pathogenesis of peripheral lung adenocarcinomas 
(23, 24). The resolution of spiral CT (currently about 3 mm) 
approaches the diameter of these lesions, and it is anticipated 
that atypical alveolar hyperplasia will be increasingly encoun- 
tered in subjects undergoing this procedure (25). Diffuse idio- 
pathic neuroendocrine cell hyperplasia consists of a patchy 
increase in the number of well-differentiated neuroendocrine 
cells in the bronchioles. This process may result in the formation 
of small carcinoid tumors, and for this reason it is considered 
. "preinvasive." To date, small cell caicinomas have not been 
associated with this lesion (13). 

Recently, the use of fluorescence bronchoscopy (see be- 
low) has increased the recognition of dysplastic lesions inlhe 
large airways and a new morphological entity, ASD, was iden- 
tified (26). Dysplasia of bronchial epithelium in "micropapjllo- 
matosis" and the possible link between angiogenesis and prein- 
vasive bronchial epithelial dysplasia were recognized as early as 
1983 by Muller and Muller (27), who also described the ultra- 
structure of these lesions. It has been suggested that this angio- 
genesis, which is recognized as capillary loops projecting into 
the dysplastic bronchial lining, is responsible for the reduced 
fluorescence seen in dysplastic lesions by LIFE bronchoscopes 
(Figs. I and 3; Ref. 26). Funue prospective studies will show 
whether this morphological entity is correlated with a progres- 
sion io lung cancer so as to be a target for the use of aritiangio- 
genic agents for chemoprevention. 

t In general, there are several questions/problems relating lo 
prcmalignant lesions, which will be addressed in future studies: 

(a) The motphological criteria for premalignant and early- 
malignant changes, both on sputum cytology and in bronchia) 
biopsies, have to be validnied fot intra- and imei observer repro- 
ducibility. 

(b) Uniform and reproducible morphologies l/cvi ologicul 
criteria have to be published more extensively, and a twining set 
of slides should be available. By the use of Internet technology, 
this could be more easily facilitated (78). 

(c) The correlation of sputum atypia and histological 
changes in ihc bronchi in high-risk population is not well 
defined. 

[tf) The natural comse of preinvasive changes in the bron- 
chi from the high risk subjects needs to be clarified through 
longitudinal, prospective studies with reference to histological 
changes in the bronchi. Ongoing longitudinal studies with flu- 
orescence bronchoscopy and multiple biopsies with hisMogy 
nnd other biomarkers will define the ability of these markers to 
assess lor risk. 

tV) What is the pathology. -I'M nlogy of the MnaJk often pe- 
ripherally located, rumors (3 mm in diameter), which are more 



often diagnosed with newer radiological techniques (e.g., low- 
dose spiral CT)? 

(/) Optimization of the tissue procurement and processing 
techniques are important. Distinction of reactive from neoplastic 
, processes is usually straightforward, but diagnostic difficulties 
may arise in the case of (a) inadequate or poorly prepared 
histological material to evaluate and (b) the presence of cyto- 
logical atypia in epithelium stimulated by inflammation, viral 
infection, radiation, or chemotherapy. 

(g) DNA array analyses of gene expression: will it be 
useful? How to collect proper mRNA? Can mRNA extracted 
from microdissected cells obtained at bronchoscopy be globally 
amplified and still remain representative of RNA present in sittt? 

Biology of Lung Carcinogenesis and Potential Early 
Detection Markers. 

Lung cancer is the end-stage of multiple-step carcinogen- 
esis, in most cases driven by genetic and epigenetic damage 
caused by chronic exposure lo tobacco carcinogens. The genetic 
instability in human cancers appears to exist at two levels: at the 
chromosomal level, including large scale losses and gains; and 
at the nucleotide level including single or several base changes 
(29). Lung cance*rs harbor many numerical chromosomal abnor- 
malities (aneuploidy) and structural cytogenetic abnormalities 
including deletions and nonreciprocal translocations (30). At 
least three classes of cellular genes are involved: proto- onco- 
genes, TSGs, and DMA repair genes. Oncogenic activation often 
occurs via point mutations, gene amplification, or chromosomal 
rearrangement, whereas TSGs are classically inactivated by the 
loss of one parental allele combined with a point or small 
mutation or aberrant methylalion of a target TSG in the remain- 
ing allele. Additionally, dysregufared gene expression (either 
increased or decreased expression) can occur by other, as yel 
unknown, mechanisms (30). Present studies have not yel con- 
firmed a prominent role for abnormalities of DNA repair genes 
in lung cancer. 

Preneoplastic cells contain several molecular genetic ab- 
normalities identical to some of the abnormalities found in overt 
lung cancer cells (Fig. 2). These include allele loss at several 
loci (3p, 9p. tip, and Hp), myc and ros up- regulation, eye) in Dl 
ovcrexpression, p53 mutations, and increased immunorcactiv- 
ity, be I- 2 ovcrexpression and DNA aneuploidy (31-35). Allclo- 
typing of precisely microdissected. preneoplastic foci of cells 
suggests that the earliest changes in the bronchial epithelium is 
allele loss at chromosome regions 3p, then 9p, Sp, I7p. 5q. and 
then kis mutations (36-39)/! he biological meaning of LOH is 
only vaguely understood. Recent evidence suggests that LOH 
may be a consequence of mitotic recombination, thai there is 
only infrequent physical loss of genetic loci, and that I.OH 
probably precedes chromosomal duplication HO). Allelic loss 
would thus be significant primarily in the presence of mntaiion 
in the retained allele, and gene dosage would not be expected to 
exert a phenotypic effect in LOH. Some reports have indicated 
i bar ac matron occurs at early carcinoma stages (34). His- 
tologically normal bronchial epithelium adjacent to cancers has 
also been shown ro have certain geneiic U^sci. Aiypieal ade- 
noma ions hyperplasia, the potent iat |. f eruisur lesion ol adeno- 
carcinomas, often have Ki-nn mutations (-Jl) 
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Fig, 2 Sequential changes during lung cancer pathogenesis. Although multiple genetic markers ore abnormal in lung cancers., their appearance during 
the lengthy preneoplastic process varies. The liming of the appearance of these changes has been investigated in bronchia) prcneoplasia. because 
sequential sampling of the peripheral lung is technically difficult. Several alterations have been described in histologically normal bronchial epithelium 
of smokers. Other changes have been detected in slightly abnormal epithelium (hyperplasia, metaplasia) which we do not consider to be true 
premalienaol lesions. These changes are regarded as early changes. Molecular changes detected frequently in dysplasia are regarded as intermediate 
in timing, whereas those usually detected at the CIS or invasive stages are regaided as Isle changes. It should be stressed that although there is a usual 
order, exceptions regarding the timing of onset may occur. Some changes are progressive, such as chromosome 3p deletions. Thus small discrete 
changes are present early, progressively become more extensive during pathogenesis, and frequently involve all or almost all of the arm in CIS 
samples. Although allelic loss at the TP53 locus may precede the onset of mutations, data on this sequence are scant. Dysrcgulation of the RNA 
component of telomerase (with its appearance in nonbasal cells) is an early event, whereas up- regulation of the gene is a relatively late event. 
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Molecular changes have been found nol only in the Junes 
of patients with lung cancer, but also in the lungs of omen! and 
former smokers without lung cancer (IS. 42. 43). These obser- 
vations are consistent with the multistep model of carcinogen- 
esis and "field cincciizniion" process, whereby the whole region 
is repeatedly e.\poscd to carcinogenic damage (tobacco smoke) 
and is al risk lor developing multiple, separate, clonal! v unre- 
lated foci of neoplasia. The widespread aneuploidy that occurs 
throughout the respiratory tree of smokeis supports this theory 
(•1-1). However, the presence of the same somatic p53 point 
mutation at widely dispersed preneoplastic lesions in a smoker 
without invasive lung cancel indicates ihat expansion of a single 
progenitor ck>m* may spread throughout the respirator tree 
(4S). These molecnlru alterations might thus be important 
I arrets lot use in the early detection of lung cancer and lor use 
as surrogate biomarkers in the follow-up of chemopi event ion 



studies. Detection of these mutant cells should be possible with 
the different molecular techniques in accessible specimens. The 
prospects of diagnosing these biological abnormalities in mul- 
tiple types of clinical specimens are discussed below. 

Specimens for Clinic of T citing: Sputum 

Since rhe 1930s, cytological examination of sputum has 
been used for the diagnosis of lung cancer (46). Cyiological 
examination of sputa, especially multiple samples, is helpful for 
the detection of central rumors arising from the larger bronchi 
[e.».. squamous cell- and small cell carcinomas). Ha foliated 
cells (torn peripheral tumors, such ;ts adenocarcinomas, arising 
from the smaller ainvays (small bronchi, bronchioles,, and alve- 
oli), especially those less than 2 crn in diametei. can be detected 
only occasionally in sputum samples. This has become of 
gieaiei importance because the changes in cigarette exposition 
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(filters and decreased nicotine content) have created an increase 
in adenocarcinomas and a decrease in squamous carcinomas 
(47-49). The sensitivity of sputum cytology for early lung 
cancer is only in the 20%-30% range from screening studies, 
but by adhering to proper specimen collection, and processing 
and interpreting criteria, the yield can be substantially improved 
(50, 51). The data on the reliability of the sputum are conflicting 
(52-54). Browman ei al (52) reported interobserver agreement 
of 68% for exact and 82% for within - )- category. HoIIiday et 
ol (54) reported low agreement within observers (27- 60%) and 
across observers (13-50%). Within - 1 - category intraobserver 
agreement underwent a two- or 3-fold increase in agreement, 
which was also the case for interobserver agreement. The var- 
iation in intra- and interobserver agreement seems to depend on 
experience among the cytotechnicians/cytopathologists and the 
composition of categories studied. A higher degree of agreement 
is obtained for higher grades of dysplasia (54). Risse et al (55) 
showed that the ability to detect premalignant conditions is 
dependent on the number and type of cells present in the deeper 
airways, suggesting a mode of improvement that is unrelated to 
observer reliability. MacDbugalJ et al (56) concluded that spu- 
tum cytology was too insensitive and insufficiently accurate to. 
be included in the routine work-np of any patient suspected of 
having lung cancer. To improve the refiability of sputum cytol- 
ogy examinations a simplification of the diagnostic categories 
from .6 (normal; squamous metaplasia; mild, moderate, and 
severe atypia; and carcinoma) to 2-3 categories have been 
proposed (54). Future climcopalholoeicaj studies will be re- 
quired to validate this concept. 

To improve the sensitivity of sputum examination as a 
population-screening fool for the detection of early lung cancer, 
several approaches are currently under development. 

jmnm»r>staimng. Annual sputum specimens obtained 
from individuals screened at Johns Mopkins were obtained, and 
the patients were monitored for 8 yeais (57). Because the 
clinical outcome of these patients was known, archival sputum 
specimens were screened for the presence of biomarkers that 
could indicate the presence of lung rumois in an eatly. preinva- 
sive stage. In an attempt to distinguish the pattern of marker 
expression Tockman el al (58) studied two monoclonal anti- 
bodies. Positive staining predicted subsequent Jung cancei ap- 
proximately 2 years' before clinical recognition of rhe disease, 
with a sensitivity of 91% and a specificity of 88% (58). One of 
rhesc antibodies (703 D4) had a higher sensitivity and was later 
identified as recognizing hnKNP A 2/131 (59). 7 he role of 
hnRNP A2/BJ overexpression for detecting preclinical lung 
canctr has been studied in a large high- risk population including 
6000 Chinese tin miners who were heavy smokers and who had 
an extraordinary rare of lung cancer (60). 7 he results from, this 
Study indicated that detection of hnRNP A2/BI oveiexpression 
in sputum epithelium cells was 2- to 3- fold more sensitive for 
detection of lung cancer than standard chest .X-ray and sputum 
cytology methods. The method was panic ulaily effective in 
identifying early disease (60). The sensitivity was 7-1% vrtMn 
?}% for cytology and 47% for chest X-ray. However, ihe 
biomaiket had a lower specificity t'70'\i compared with ecol- 
ogy ( 1 00%) and chest i a d i og t a p h 1 90% V An ongoing clinical 
trial is evaluating the performance ol the Ai'-'MI protein as a 
biomarker for rheeaify detection of olM.C 'Ihe patients at t\^k 



for SPLC have the highest incidence of lung cancer (2—5%) 
among asymptomatic populations (6 J). In this trial, 13 SPLCs 
were identified by A2rTJl, and the sensitivity and specificity 
were 77-82%. a,nd 65-81%, respectively. Arnbrig the cases 
identified as positive by immunocyl ochemistry and image cy- 
tometry, 67% developed SPLC within 1 year (62). Whereas the 
previous immunocytochemistry studies on materia) from the 
older screening materia) from the NCI-supported screening 
studies were made on sputum cells cytoIogicaHy classified with 
moderately or gravely atypica) metaplastic appearance, the latter 
studies have been done on cytoIogicaHy "normal appearing" 
cells. More recently Sueoka et al (63) reported the confirmation 
of the value of overexpression of hnRNP A2/BI to detect 
preclinical lung cancer in Japan. Efforts to improve the sensi- 
tivity of hnRNP markers are ongoing (64). 

PCR Techniques. PCR techniques have been used for 
the evaluation of molecular biomarkers for early . lung cancer 
detection. In a pilot study with selected patients from the Johns 
Hopkins Lung Project (JHLP), 8 (53%) of 15 patients with 
adenocarcinoma or large cell carcinoma were detected by mu- 
tations in sputum cells from I to 13 months before clinical 
diagnosis (65). However, the method seemed to be less sensitive 
than the protein marker described above, and the identification 
of specific gene abnormalities is further limited by the need to 
know the specific mutation sequence with which to probe the 
sputum specimens. Currently, this approach is not practical for 
screening undiagnosed individuals. Future advances in gene 
chip technology may permit testing for aJJ possible mutations of 
.common oncogenes and TSGs in clinical specimens of asymp- 
tomatic individuals (62). 

Microsatellile markers are small repeating DNA sequences 
found in the noncoding regions of a gene. PCR amplification of 
these repeat sequences provides a rapid method for assessment 
of LOH and facilitates the mapping of suppressor genes (66, 
67). Microsatellite alterations are extension or deletions of these 
repeated elements. Detection of microsatellite alterations in 
histological or cytological specimens may facilitate the detec- 
tion of clonal preneoplastic or neoplastic cell populations. Al- 
though the detection of microsatellite alterations does not indi- . 
cate the specific genetic change in the tumor, detection of clonal 
cell populations might serve as a cancer screening marker (65). 
Identical alterations have been found in lung cancers and cor- 
responding sputum samples demonstrating minimal atypia (68). 
7 he pJ6 gene is located on the short rum of chromosome 
9(9p21) and is frequently mutated or inactivated in nmiors and 
cell lines derived from lung cancer (69, 70). Belinsky e t al. (71) 
measured hypcnnethylatjon of the CpG islands in the sputum of 
lung cancel patients and demonstrated a high correlation with 
enrly stages of non-small cell lung cancer, which indicated that 
pi 6 CpG hypermcthylation could be useful in the prediction of 
(urure lung cancer. However prospective studies ate needed to 
evaluate the role of p!6 hypcrmeihylaiion as a marker lor early 
lung cancer detection. Multiple other genes are inaclrvated by 
hypermethylatjon in lung cancei (72). and the detection of 
hypcrmeihylaiion may be useful for risk assessment and early 
diagnosis. 

Computer- assisted linage Analysis. Computer- assisted 
image analysis was initially used to detect maliunancy- associ- 
ated changes (eg., subvisunl or nonobvions changes in the 
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distribution of DNA in the nuclei of histologically normal cells 
in the vicinity of preinvasive or invasive cancer; 73). In a 
retrospective analysis of sputum cytology slides, malignancy- 
associated changes alone correctly identified 74% of the sub- 
jects who iater developed squamous cell carcinoma (74). The 
technique has been improved, and recent data showed sensitiv- 
ities of 75% for stage 0/J Jung cancer and 85% for adenocarci- 
nomas with a specificity of 90% (75). This quantitative micros- 
copy technique allows the examining of thousands of cells per 
slide within a relative short time. Similar techniques have been 
approved in the United States for cervical cancer screening, and 
might, in the future, play a role for lung cancer screening. 
However, no prospective clinical studies has evaluated this 
technology in a larger lung cancer screening setting: 

High Throughput Technology. With future advances in 
^gene chip technology, it might become feasible to. probe for 
expression of multiple genes in sputum specimens of asymp- 
. tomatic individuals. However, this requires a large amount of 
undegraded RNA from respiratory tract cells. With the high 
throughput technology, a higher sensitivity might be achieved 
by using multiple markers at the cost of achieving a lower 
specificity, which would be undesirable for a screening study. 

In conclusion, we need to reevaluate the role of sputum 
cytology for screening and early, detection of lung cancer be- 
cause of advances in biomarkers and technology. Ongoing stud- 
ies with standard and bjomarker analysis in high-risk groups 
might change the previous negative attitude and provide a new 
perspective on sputum cytology as □ mass screening tool when 
applied in a high-risk population. Adding different molecular 
diagnostic tests gives the possibility foj early diagnosis far in 
advance of clinical presentation. However, validation of the 
tests in larger prospective studies is necessary, and the individ- 
ual tests have to be compared with each other to define the role 
of early diagnosis in the overall management of high-risk sub- ' 
jeets. Furthermore, health economic issues have to be consid- 
ered. 

Specimens for Clinical Testing: UAL 

BAL involves the. infusion rtnd rcaspiration of a sterile 
saline solution in distal segments of the lung via a frbciopiic 
bronchoscope. Ahrendt ?t al.. (76) examined a series of 50 
resected non-.SCLC tumor patients and compared the rumor and 
BAL with regard to molecular mat kcrs including p53 mutations, 
K-ias mutation, the mcthvfntion status of the CpG island of the 
pi 6 gene, and min osatellhe alteration (Tables ] and 2). With 
the possible exception of 'the test foi rnicrosaielliie alteration, all 
of the tests had relatively high sensitivity and could detect 1 
mutant cells in the presence of :i large excess of normal cells. 
The frequencies of these changes in the rumors ranged from 
27% (for K-tas mutations) to :•(■.% (|' 0 > p53 mutations). As 
expected. p33 mutation* uerr mote frequent in rem ml (predom- 
inantly -squamous cell) tumois. and K-ias mm:) lions were more 
frequent in peripheral (predominantly adenocarcinoma) tumors. 
The specificity was high f nearly 100%) because, with the ex- 
ception of mirfosatcl'ite a briar ions, ihe same genetic change in 
BAL sample as in tunmi> was always found, bm ihe sensitivity 
was low, and in only .*v% of unnois that contained molecular 
lesions were the same abnormalities detected in corresponding 
BAL fluid: Specifically, the icms were least helpful in the 



group of patients in whom improved diagnostic abilities are 
most needed, those with small, peripherally located rumors (77). 
Unfortunately, the investigators were not able to compare the 
molecular tests with routine cytopathological analysis of the 
BAL specimens. The sensitivity of the molecular tests in BAL 
specimens has to be improved, and we need to know the results 
from subjects at increased risk (current and former smokers 
without lung cancer and survivors of previous cancer of the 
upper respiratory tract) and subjects with chronic lung diseases 
as well as results from healthy never smokers. 

A European group has pieviousry shown that genetic al- 
terations detected in DNA from bronchial lavage of individuals 
with lung cancer were also found in individuals with no evi- 
dence of malignant disease (78), which raises the question about 
the specificity of such molecular damage in neoplastic condi- 
tions. To improve the sensitivity and specificity of detecting 
allelic imbalance in lung tumors, high throughput PCR : based 
microsatellite assays have been established (79). In a recent 
study by Fielding et al (80), the up-regulation of hnRNP A2/BI 
was found to be a promising marker in BAL for the detection of 
premalignant and malignant bronchial lesions with a diagnostic 
sensitivity of 96% and a specificity of 82%. 

It is too early yet to make conclusions as to whether BAL 
examinations will add to other pathological/molecular biologi- 
cal clinical studies. To obtain diagnostic material for BAL 
bronchoscopy is required, and we do not have any data that 
compare BAL examinations with biopsies. Thus, we do not 
know whether BAL is a valuable adjunct to the biopsies taken 
under the same bronchoscopy procedure. 

Specimens for Clinical Testing: Peripheral Blood 

Fot many years scientists have searched for a Jung cancer- 
specific rumor marker that could be detected in peripheral blood. 
Optimism was raised in the "early" immunocytochemistry era 
by the use of monoclonal antibodies raised against more-or-less 
specific epithelial epitopes. In Ihe search for epithelial cells in 
peripheral blood and bone marrow, monoclonal antibodies 
against cytokeialin have been used. However, these reactions 
are clearly not cancer-specific, and some antibodies have been 
shown to cross- react with normal blood or bone marrow ele- 
ments (81, 82). Another explanation could be that celts from the 
macrophage/monocyte system may contain proteins derived 
from the primary tumor that have undergone . necrosis and 
a p opt os is and that these processed proteins are recognized by 
ihe antibodies (S2). On the basis of "traditional" immunocyio- 
ehemistry, no markers have been able to delect premalignant or 
early- malignant disorders based on a peripheral blood sample. 
However, with the development of DNA technologies, new- 
possibilities have been raised, and. with the use of PGR tech- 
niques, some promising reports have been published. 

Nanogram quantities of DNA circulating in blood arc pres- 
ent in healthy individuals (83 ; 8-1). Turnoi DNA is also released 
into the plasma component in increased quantities (83, J>o). 
Thus, die plasma and scrum of cancet patients is enriched in 
DNA, an aveiage four times the amount of free DNA as com 
pared with normal coniiols 1 87). In a study by Chen et a/. fSS'}. 
a comparison of miciosaiellitc alterations in rumor and plasma 
DNA was done in SCXC patients, and 93% of the patients \mh 
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Table J Tissues and other resources for the sludy of molecular markers 



Specimen 



Ref. 



Comments 



Tumor tissue 
Sputum 

Sunogate organ 

Bronchial brush/wash 
Bronchial tissues 

BAL Iluitls " ■ 
Blood components 

Tissue for molecular singing 
Tumor cell lines 

Cultures of nonmaltgnani (issues 



Numerous 



65, 68,71, 74 



42, 43, 45, 144, 145 



76, 78, 146, 147, ]48 



Noninalignant tissue fiom 
and from enne er- fiee n 



patients 1 56, 
latives 



Mixture of cell types, may require microdissection (139). Extensively used 
for most, studies. Alcohol- fixed fine- needle aspirates may be used for 
mutational arid other studies. 

Respiratory cells usually in small minority. Most samples fixed in . 
'Saccomanno's fixative. Several studies have been performed on these 
specimens many years later. 

Predominantly squamous epithelial cells. Buccal smears, brushihgs of 
tongue or tonsil may be explored as potential surrogate organs resulting 
from the field effect of tobacco damage of the entire upper ., 
aerodigestive tract. This concept needs to be confirmed. 

Predominantly respiratory cells. Fresh, frozen, or alcohol- fixed samples are 
suitable for multiple studies including FISH.* 

Usually from bronchial biopsies, but may be obtained from surgical 
resection specimens. Formalin fixation and paraffin embedding required 
for histological diagnosis, although EAS) preps may permit 
identification and isolation of subpopulations. Paraffin sections may be 
used for genoryping polymorphisms," for alleloryping, and for in situ 
hybridization. 

BAL fluids are useful for examining the peripheral airway cells, which are 
the precursor cells of most adenocarcinomas. Numerous mononuclear 
cells present. Enrichment of epithelial cells desirable. 

Analysis of circulating rumor cells and genetic material shed by dying 
tumor cells into the plasma component may yield useful biological and 
diagnostic information. Gene mutations and presence of epithelial eel) 
markers have been used to detect circulating rumor cells. Gene 
mutations, allelic loss, mictosatellite alterations, and aberrant 
methylalion have been used 10 identify tumor cell DNA released into 
the fluid compartment. 

Although little data exists for Jung cancers, regional lymph nodes, sentinel 
lymph nodes, and surgical icseclion margins- have been used in other 
rumor types for molecular staging. 

Provide an unlimited self-replicating source of high-quality molecular 
reagents and have been used for numerous studies. Cell Jines may oi 
may not leflect the properties of the tumors horn which they were 
derived (26). although they piobabiy represent cellular subpopulations 
(27). Aggressive metastatic rumors aie more likely to be successfully 
cultured (28) resulting in skewed data. 

Epithelial cultures may be useful for studying molecular changes during 
multistage pathogenesis. Temporary as well as a few immortalized 
cultures from nonmalignanl epithelial cells have been established. 
B-iyrnphoblastoid cultures are useful for linkage analysis, for genetic 
suspect tbiliry studies, and for alicloryping corresponding rumors. 

Tissues such as buccal smears, tumor free lymph nodes, and peripheral 
blood mononuclear cells arc useful as controls for linkage analysis, for 
genetic susceptibility studies, and for allelotypiog corresponding tumors. 



" ITSH, fluorescence in jitu hybridization; EASi. rpithrtiat aggregate separation and isolation. 



miciosatclJite alterations in tumor DNA also had modifications 
in the plasma DNA. However, some patients had LOH oniv in 
(he rumor DNA. Because most of the inicrosatellirc alterations 
wen* similar in rumor DNA and plasma DNA. they ton elude J 
that some of the DNA circulating in the blood comes from the 
tumor. Thus, modifications of circulating DNA can be used as 
an early detection marker. Detection of aberrant DNA methyl- 
ation tn scrum DNA in patients with non-SCI.C lias been 
u-poned i '■?.). A hhottgli the number of patients was small and 
the hyper methylated DNA was I omul in all stages, n opens up 
for the possibility to be used as an eajly lung cancel detection 
rnaiker. fuiihrirnoir. p5J and ras gene mutations have been 



detected in the plasma and serum of patients with colorectal 
cancers (89-91). pane tea tic carcinomas (92. 93). and hemato- 
logical malignancies (94). 

In conclusion, the limited ditect accessibility of lung car- 
cinomas has led lo efforts to identify tumor- associated soluble 
maikers in serum or plasma. Many of the currently recognized 
soluble markers were lust identified as "tumor v markets bui r 
when evaluated in nonneoplastic tissue, have often been found 
in nomia] cells as well as in lumOis. Foi eaily detection ol Jung 
cancel, we need mote clinical data evaluating these new molec- 
u!ai biological maikeis from multiple sites, especially in high- 
risk groups 
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Table 2 Molecular approaches for lung cancer investigation 



Specimen 



Ref. 



Comment 



Gene mutations 



AJlelotyping 



Gene expression at RNA 
and protein level 



Molecular cytogenetics 



Comparative genomic 
hybridization 

Morphomciric studies 



159, 160, 161 



18, 158 



145, 16?, 163, 164, 165, 166 



40, 167.. 16S : 169, 170 

171, 172 

7-1, 173. 174 



Widely used technique, especially for p53 and ras genes: Often used 
to determine the r6)e of a newly discovered gene in tbe 
pathogenesis of hmg cancer. May be of diagnostic and prognostic 
significance. Multiple methodologies available. 

Useful as a partial substitute for mutational analysis and for. 
determining the chromosomal locations of putative tumor 
suppressor genes. Widely used to study multistage pathogenesis. 
Readily performed on formalin-fixed.and microdissectcd tissues. 
Increasing use of genotyping using automatic sequencers. 

Northern blotting and reverse transcription- PCR are widely used to 
investigate gene expression. Western blotting often used for 
detection of protein expression. In situ hybridization for message 
expression can be performed on paraffin-embedded tissues and, 
thus, can be used to investigate multistage pathogenesis. 
Microarray techniques offer promise of examining all or most of 
the genome but currently require relatively large amount of high-, 
quality RNA from purified cell populations Sage technique useful 
for investigation and identification of expressed genes. Similarly, 
advances in proteomics will permit simultaneous detection of 
multiple proteins. Numerous immunohislochemical studies of 
. oncogene expression have been used to study multistage 
pathogenesis. Of particular interest, hnRNP expression on 
exfoliated epithelial cells in sputum samples may predict for 
development of cancer. 

In situ hybridization studies of fixed materials or using smears has 
provided considerable information 3bout numerical and structural 
changes. 

Useful for detection of gene amplifications. Less sensitive for the 
detection of regions of allelic loss. 

May be applied to paiaffin-em bedded tissues. Useful for determining 
aneuploidy and for measuring a number of nuclear and 
cytoplasmic parameters. " 



Specimens for Clinical Testing: Bronchoscopy 

WLB is the most commonly used diagnostic loo) for ob- 
taining a definite histological diagnosis of lung cancer. Bron- 
choscopy has major diagnostic limitations for premalignant le- 
sions. Because these lesions are only a few cells thick (0.2- J 
mm) and have a surface diameter of only a few millimeters, they 
rarely are observed as visual abnormalities. Woolner (95) re- 
ported thai squamous cell CIS was visible to experienced bron- 
choscopists in only 29% of cases. To address this limitation, 
fluorescence bronchoscopy was developed.- Early studies of 
fluoiesccncc bronchoscopy entailed the use of fluorescent drugs 
(hematoporphyrin dyes) that were preferentially retained in ma- 
lignant tissue (96). Although, studies evaluating ibis approach 
did. in fact, show that early invasive and in .wm cancers could be 
localized, the detection of dysplasia remained problematic (97- 
J 00). Furthermore, the development of phmodynnmir diagnos- 
tic systems was hampered by problems including skin photo- 
sensitizing and interference with tissur auloflunrescence. To 
overcome these problems, a new laser photodynamic diagnostic 
system was developed (101). This svstem detected rumor- 
specific drug fluoresce nee at 630 nm wavelength, which is far 
from normal tissue a mo floor esc encr (.son -o$0 nm). and inter- 
ference bv antolluorcsccnce horn norma! tissue should then 
have been eliminated, bur it remained a srg nilicant problem 
(102). 



Another approach .was developed by Palcic et a!. (103), 
who noticed the lack of autofmoicscence irt the tumor lesions by 
using blue light (442 nm) rather than white light to illuminate 
the bronchial surface. They amplified the difference in autofiuo- 
rescencc between normal, premalignant, and tumor tissue for 
clinical use (103, 10-1). Using a high-quality-charge coupled 
device and special algorithm, the LIFE was developed, taking 
advantage of the principle that dysplastic and malignant tissues 
reduce 3Ulofluorcscent signals compared with normal tissue 
(Fig- 3). 

Several studies have been performed comparing the diag- 
nostic specificity and sensitivity of LIFE bronchoscopy wrsus 
WEB in diagnosing preinvasive and early- invasive lesions 
(J05-I09; Table 3). Most of the studies reported a higher 
diagnostic sensitivity of LJFJE bronchoscopy in the detection 
piemahgnant and early-malignant lesions at the cost of lower 
specificity [i.e., more false-positive results). In most of these 
studies, lesions with moderate dysplasia or worse were the target 
of the study and rated as "positive."* The prevalence of prein- 
vasive nnd carlv lung cancer varies widely from one study to 
another, from 20.?% (Kb) to f.O 8% (1 02). The explanation 
might be beyond the tisk profile of genetic variations or differ- 
ent levels of experience among the endoscopists as well as (he 
pathologists involved. Furthermore, there seems lobe a training 
cifeci in using the I. IFF: bronchoscope. " Inch has been demon- 
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Fig. 3 A, normal WLB and normal 1_IFE bronchoscopy. B, WLB shows infbmmaiorv changes in ihc bronchial mucosa but no suspicion of 
-malignancy [left). LIFE bronchoscopy shows diffuse reduced auloOuorcsccncc (visualized by diffuse reJ-hranmsfi chtonzonorr urrow) Biopsy 
Uemonsiraied diffuse severe dysplasia. " 



slrated by Yenmans a al. (J 07). In their study, ihc diagnostic 
sensiiivity increased from 67 to 80% when comparing the fusi 
and fhe second half of the study. The use of the LIFE device in 
conjunction with WLB improved ihe detection rate of preneo- 
plastic lesions and OS significantly (Table 3). Kwrie ei «/. "f 106) 
looked for more subtle tissue transformation, but their studv 
included few patients - with moderate dysplasia or worse. No 
impiovemem in the evaluation of metaplasia index was ob- 
served by the use of LIFE bronchoscopy. Thus, differences in 
the srudy population might explain the diffeicnt conclusion. 
There are still no clinical studies with sufficient long term data 
showing that moderate dysplasia is the most jelevam clinical 
predictor of eventual malignancy. Limitations in making con- 
clusions from the twisting studies ate also the potential meth- 
odological bias i elated to the order m winch the different bron- 
choscopy procedures aie done and whether the same examine! 
has performed both pioredmes. To address these issues ;i 



ptospetiive randomized study between LIFE bronchoscopy and 
WLB was done at the University of Colorado Cancel Center. 
The study design included a randomization with regard to the 
order of procedure as well as the order of the individual bron- 
choscopist 1 1 Of)). The order of the procednie and of the indi- 
vidual btonchoscopist did not affect the results 'Ihe snidv also 
dcmonstiated a significantly higher sensitivity m delecting pie- 
malignani lesions visual rzxd by the LIFE, but ;tt the cost of a 
lower specificity (109). The reason for the low diagnostic spec 
rhcity found with the LIFE bronchoscopy in the different studies 
might be attributable to the visualization of moie abnoimal foci 
w ith the LIFE bronchoscope, with the consequence that a hugei 
nmrtbci ol biopsies were taken and. thus, rheie v. as a higher nsl: 
of mote false-positive results. The use of 1. 1 IT: bi one boscopv 
has led to the identification of a new moiphologjc vt\ eniitv. ihe 
ASL>. which is described above. In a recent morphological Mudv 
arigiodysplnsiie changes were treqnentiv bumd m pie neoplastic 
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Fig. 4 Seventy-one-year-old man with a spicular nodule in upper (eft 
Jobe demonsirared on low-dose helical CT (picture), but nol visible on 
chest X- radiography. CT- guided biopsy showed adenocarcinoma. 



and early- malignant lesions in the bronchi (26). The morpho- 
logical entity has been confirmed in preneoplasias among smok- 
ers, and the perspectives of this finding have been extensively 
discussed ( 1 10). The prognostic significance of this morpholog- 
ical entity is currently studied in ongoing long- term follow-up 
studies. Future studies have to evaluate the role of ASD as a 
biomarker for early lesions and whether it can be used as a 
marker for treatment effect or therapeutic target for chemopre- 
vention. 

The LIFE bronchoscope may piny an important role in the 
screening and follow-up of subjects at high risk of developing 
lung cancer. At this stage, however, it is unknown whether (he 
LIFE bronchoscope will lead to a reduction in lung cancer 
mortality. There are also no data on cost-effectiveness and 
cost-benefit analyses available for this new diagnostic proce- 
dure. The use of the LIFE bronchoscope may also in the future 
be extended lo other indications, eg., patients staged as having 
resectable lung cancer on one side. Whether LIFE bronchoscopy 
of the comralaleral lung will disclose abnormalities, which 
would change the therapeutic decision, is not yet reported. 

ftecrnl Advances in Kadiology 

The previous NCLsponsored screening trials failed to don* 
onsnate any reduction in the lung cancer mortality by sputum 
cytology and yearly chest radiography as mass screening tools 
lor lung cancer screening. Limitations of design and execution 
of the studies, however, have been discussed extensively (8, 
HI. 112). An extended follow-up (median. 20.5 years) of the 
Mayo Lung Project was recently published ( )J3). There was 
still no diffeience in lung cancer molality between the inter- 
vention arm and i he contiol arm (4.1 x e>.?it.i 3.9 deaths per 1000 
person yeais). However, the median survival for patients wilb 
resected early. 5 race disease was !<>.(> yeais in the inicr.ention 
arm ivm/o 5.0 years in the usual -0 are arm [P < P.OM, 'I he latter 
tmdines haw raised ihe question :is 10 whether some small 
lesions with limited c finical relevance mav have been identified 
m the inten cniion ami. and the rjuestion el "overdiapripsis" was 
discussed in accompanying editorials 111*1 



Mass screening for lung cancer has been performed in 
Japan for many years and has been performed in over 500,000 
. people in about 80% of the local communities (J )5). Sobue et 
al. (IJ6) observed that annual clinic-based chest X-ray screen- 
ing for lung cancer in Japan showed reduced hing cancer mor- 
tality by about one- fourth among individuals who. underwent 
screening once a year. In this screening program, ihe relative 
odds ratio of dying from rung cancer within 12 months was 
0.535 and in the J 2-24-month period was 0.638 (117). How- 
ever, many studies have focused on the pitfalls in the detection 
of abnormalities by radiography (1)8-122). The limit of chest 
radiographic sensitivity for nodule detection is roughly I cm in 
diameter, by which time the rumor has over JO 9 cells and may 
already have violated bronchial epithelium and vascular epithe- 
lium. CT has been shown. to be more effective in the detection 
of peripheral rung lesions compared with plain radiography or 
conventional tomography of the whole lung (J 23, 124). • 

Spiral CT scan is a relatively new technology with the 
. ability to continuously acquire data resulting in a shorter scan- 
ning lime, a lower radiation exposure, and improved diagnostic 
accuracy compared with those of plain radiography (125-127). 
Spiral CT allows Ihe whole chest lb be imaged in one or two 
breath-holds, reducing molion "artifacts and eliminating respira- 
tory misregistration or missing nodules. Although there is 
greater radiation exposure wilh CT than with chest radiography, 
low-dose techniques (lower mA of 30-50 compared with 200 
for conventional CT) have achieved calculated exposure doses 
that are 17% that of conventional CT and 10 times that of chest 
radiographs. Further reduction in radiation dose while maintain- 
ing diagnostic accuracy is a topic of current research. Further- 
more, for the baseline screening, low- dose spiral- CT- scan i.v. 
contrast is not adminislered. Nodules as small as 1-5 mm can be 
shown with modern spiral CT technology (25. 128). The obvi- 
ous advantages with this new technology led some groups in 
Japan and in the United States to look to low-dose spiral CT as 
a tool for screening (Refs. 129- J 3 1: Tables 4 and 5). 

•In a Japanese report, spiral CT scans and chest radiographs 
were done twice a year in 1369 individuals (J 29). Peripheral 
lung cancer w.is detected in 15 (0.3%) of 3*157 examinations, 
and, among the 15 lung cancer cases delected, the results of 
chest X -fay were negative in 11(73%), and the tumors were 
delected only by low-dose spiral CT. The detection rates of 
low-dose spiral CT and chest X-ray were 0.43% (15 of 3457 
examinations) and 0.12% I 'I of 3457 examinations), respec- 
tively. Furthermore, 14 (93%) of the J 5 lung cancers were stage 
I disease. The histology showed that II of the 15 lung cancer 
cases were adenocarcinoma, anil <1 had squamous cell carci- 
noma. The effective exposure dose with spiral CT scan in that 
study was calculated to about one-sixth thai of conventional CT. 

The ELCAP in New York was designed to determine: [n) 
the frequency with which nodules were detected; (M the (re* 
qnency with which detected nodules r epic sent malignant dis- 
ease; and (r) the frequency wiih which malignant nodules are 
curable (131). In the LLCAP study. 71 hmp eanceis were found 
among 1000 subjects screened. Among ihe 27 patients with 
cancer. 85% had stage I disease ( fable M 

Anothei population- ba;:ed study on low- dose C I screening 
has been published by Sone a nl tl30"). using a mobile low- 
dose spiral CT scanner. 'I be deirciion rale was 0.-?S% H r , 4-> 
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Table 3 Bronchoscopy versus WLB in diagnosing preroaligriant and early-malignant lesions 



Sensitivity Specificity Predictive values 



Relative Relative PPV NPV 

No of LJFE+ , sensitivity LIFE+ specificity LIFE+ L1FE+ PPV NPV PPV NPV 

Author biopsies WLB LIFE WLB LIFE+WLB WLB LIFE WLB LIFE+WLB WLB WLB LIFE LIFE WLB WLB 

Lam etal (105) 700 0.67 NR 0.25 6.3(2.7/ 0.66 NR 0.90 NR 0.33 0.89 NR NR 0.3? 0.83 

Kuriee/^006) 234 NR 0.38 NR NR NR 0.56 NR NR NR NR 0.16 0.81 NR NR 

Venmans^UIO?) 139 NR 0.89 0.78 i;43 NR 0.61 0.88 NR 0.20 NR 0.14 0.99 0.32 0.98 

Vermulenwa/.UOS) 172 0.93 NR 0.25 3.75 0.21 NR 0.87 NR 0.13 0.96 NR NR 0.19 0.90 

Kennedy et at. {109) 394 0.79 0-72 018 4.4 0.3 0.43 0.78 0.38 0.21 0.85 0.25 0.87 . 0.17 0.80 
" PPV, positive predictive value; NPV, negative predictive value; NR, not reported. 
b Based on reference pathologist. 
r Jf invasive carcinoma is included. 



Table 4 Results from three population-based screening studies with low-dose spiral CT (LDCT) 





No. of individuals 
studied 


True 
positive n 


False 
positive" % 


Predictive 
value % 


Deiection rate % 




Age incl. 
yr 


Authors 


LDCT 


X-ray 


Pack-yr 


Kaneko et nl. (129) 
Sone et al. (130) 
Henschke et al (131) 


1369 - 
•3967 
H>00 


15 
19 
27 


15.6 
5.0 
20. J 


6.6 
8.8 
J 1.6 


0.43 
0.46-0.5 
2.7 


0.12 
0.70 


>20 
>30* 
. >lf/ 


>50 
40-74 
>60 



* Defined as individuals with "test-positive," m whom further workup gave no suspicion of malignancy. 
h The study also included a group of nonsmokers. 
r Average - 45 {not reported in the other studies). 



Table 5 Histology, stage, and size of primary lung cancer detected by low- dose spiral CT 

Histology % ■ TNM % Size (mm) 

No. of cancets/ — : — - — — 

Authoi No. screened Adeno* Squam. Other I II III IV Average Range <10 11-20 >21 

Kaneko. er al (129) 15/1369 (1.1%) 73 ' 17. 93 7 12 8-18 

■Sone et at. (130) 19/5483 (0.3%) "63 5 32 84 16 17 6-^7 4 J 4 3 
Henschk e et ul ( 13 )) 27/1000 (2.7%) 67 3 30 85 -1 II 15 S 4 

" Adcno, adenocarcinoma; S<juam. : squamous cell carcinoma; TNM, tumor- node -me la stasis. 



cases per 1000 examinations). Surprisingly, there was no dif- 
ference in the detection rale among smokers (0.52%) versus 
nonsmokers (0.46%). The results fiom the Ihree population- 
based studies are summarized in Tables 4 and 5. The conclusion 
from these studies is thai 85% of the lung cancers detected by 
low-do.se CT were in Mage I, offering impiovei) possibility for 
curative treatment and bettcj prognosis in general. However, the 
issue of "false- positive" scans has to be taken into consideration. 
Thus far., up to 20% of the paiiicipanis with nodules on the scan 
had no malignancy during the follow- up period. The. possibility 
that the cancers found represent incidental cancers as in the 
Mayo Lung Project must also be considered (114). The results 
from these studies confirm the e.\ pee ration that low- dose CT 
increases the detection of small none a lei ft ed nodules and, that 
June cancer at an rarlier and more curable stage are detected. 
The mobile O screening study by Sone et nl. (130) showed that 
low-dose CI, increased the likelihood of detection of malignant 
disease 10 times r;s compared with radiogiaphy. The overall late 
of malignant disease was lower m the Japanese studies (129. 
130) compared with the tl.CAP Mudy (Kef. I? I; deiection rates 
0.43-0.4S% vr-/;wo ?.7%>. this could be because the Japanese 
studies screened individuals from die genera) population ages 



40-74, whereas ELCAP screened people at high risk, ages £60. 
with a tobacco history of at least 10 pack- years. Thus, as 
expected, the risk of the population to be screened affects the 
rale of cancer detection. 

Questions remaining to be answered include: (a) what are 
the diagnostic sensitivity and specificity of this procedure: and 
(6) docs screening reduce lung cancer mortality? The spiral CT 
has not been as sensitive for small centra) cancers as it is for 
small peripheral cancers (129, 131). Minute nodules of lung 
cancer that are near the threshold of.dcteciabiliry may be over- 
looked at spiral CT scieening (132). A prospective study of the 
diagnostic sensitivity of spiral CT has recently shown thai the 
diagnostic sensitivity exceeded the sensitivity of conventional 
CT in previous reports (23). However, there were limitations in 
the detection of mtiapnlmonary nodules smaller than 6 mm and 
of pleural lesions. Compared with surgery (thoracotomy with 
palpation of deflated lung, resection, and histology), the sensi- 
tivity of spiral CT was 60% for inirapulmonary nodules o( <(> 
mm and 93% fot nodules of 5 6 mm and was 100% for neo- 
plastic lesions >6 nun. fur ihei mote, a marked difference in the 
sensitivities of two independent observers was found for nod- 
ules smaller than 6 mm, whereas agreement w:is much beiu-r (or 
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6- t )0-mm nodules (25). Given these promising preliminary clin- 
ical results, further research is needed to determine ihe optimal 
technique for spiral CT screening, which includes collimation, 
reconstruction interval, pitch, and viewing methods. Decreasing 
the slice thickness to 3 mm, monitoring the viewing of exami- 
nations, and computer-aided diagnosis have been used to im- 
prove the diagnostic capability of spiral CT in the detection of 
pulmonary nodules (133-136). 

Future large scale randomized studies have to confirm 
whether in fact spiral CT screening will lead to a reduction in 
lung cancer mortality. In a randomized study, the following 
questions arise: (a) what is the optimal high-risk group to study 
and what should be the control arm? (b) what should be the end 
points (goals) of the studies? The ultimate goal is to reduce the 
Jung cancer mortality. However, although this is a long-term 
goal, intermediate end points from such studies should be eval- 
uated. The change to more curable stages at diagnosis for the 
lung cancer patients is one such immediate goal; (c) what is the 
optimal workup and the morbidity of this program? (d) what is 
the cost of such a screening program? and (e) what is the 
false- positive rate of the screening findings? Incorporation of 
smoking cessation programs should be included in the future 
design of screening studies because it has been shown that 
screening with low- dose CT in participants who are still smok- 
ing provides substantial motivation for smoking cessation (J37). 

The studies with spiral CT-scan have demonstrated the 
superior diagnostic ability in (he detection of small peripherally 
located tumors, most of the malignant ones of adenocarcinoma 
type of histology. The diagnostic sensitivity of spiral CT for 
more centrally located himors (mostly squamous eel! carci- 
noma) is significantly lower than for the peripherally located 
ones. Through these spiral CT studies, we will leam about the 
biology, pathology, and clinical course of these small tumors, 
which might be different from what we know about clinically 
more evident himors detected routinely in previous studies. 

Because lung C3nccr is so common, the introduction of any 
new screening technique in this area has to be underpinned by 
careful definition of the cost implications and must be justified 
by compelling evidence. The cost-effectiveness of the spiral 
CT approach should be assessed by evaluating the rale of 
over-diagnosing nonmalignant, relatively common abnormali- 
ties and comparing CT imagine to other diagnostic technologies. 

PET with FOG has recently emerged as a practical and 
useful imaging modality in the preoperative staging of patients 
with lung cancer However, whereas CT is most frequently used 
to provide additional anatomical and morphological information 
about lesions, the FDG PET imaging provides physiological and 
metabolic information that characterizes lesions that me inde- 
terminate by CT. | r DG PET imagine lakes advantage of (he 
increased accumulation of l : DG in transformed cells and is 
sensitive l~9>%> for the detection of cancer in patients who 
have indeterminate lesions on CT ( 138). The specificity ( - S.S%) 
of PET tmaeine is slightly lc<s than its sensitivity because i.omc 
inlhrnmairrv piorr;,.'C5 avidly. accumulate FDG. The high neg- 
ative piedMiw v-.ijue of Pl;1 suggests that lesions considered 
negative on the study ::re benign, biopsy is not needed. :md 
radiographic !»dk>%-. - nj> is recommended. Several studies have 
documented die increased accuracy of PET compared wiih CT 
in the i-o!u:tiio» v t the hilar and mediastinal lymph node Mams 



in patients with lung cancer (138). However, the PET resolution 
is sufficient only for nodules ^6 cm and will not be helpful in 
detecting the very small nodules. Compared with low-dose 
spiral CT, the FDG PET scan is more expensive and time 
consuming. The role of PET scan in early diagnosis of lung 
cancer in an asymptomatic high-risk population is not yet eval- 
uated. However, future studies have to include PET evaluation 
to define its role in a population screening setting. 

Conclusion 

Recent advances in molecular biology and pathology have 
led to a better understanding and documentation of morpholog- 
ical changes in Ihe bronchial epithelium before development of 
clinical evident lung carcinomas. Combined with technical de- 
velopments in radiological and bronchoscope techniques, these 
procedures offer great promise in diagnosing lung cancer far in 
advance of clinical presentation. Any of these individual proce- 
dures could be incorporated into the routine management of 
individuals at risk for developing primary or secondary lung 
cancer, and for several of these methods, clinical srudies are 
under way. Preliminary reported data are very promising for the 
early detection of lung cancer. Future studies must incorporate 
Ihe different methods in a multidisciplinary scientific setting to 
evaluate the role of the individual method in the overall man- 
agement for individuals at high risk for developing lung cancer. 
Several of these tests might diagnose the disease nt the stage of 
clonal expansion before invasive carcinoma has developed. A 
management and intervention strategy appropriate to that stage 
of disease have to be developed. Preliminary studies of chemo- 
prevention agents are reported, and new agents based on other 
biological mechanisms are under development and ready for 
clinical trials. It is now time to plan clinical trials that evaluate 
both diagnostic and therapeutic approaches to access iheir im- 
pact on Ihe incidence of clinical lung cancel. 

Acknowledgments 

We thank Drs. Stephen Lam, Vancouver, British Columbia, Can- 
ada, and Kavita Gatg. University of Colorado Health Sciences Center. 
Denver. Colorado, for a critical review of the manuscript and Drs. 
Timothy Kennedy and York Miller for submitting illustrations for 
white- lichi and LIFE bronchoscopy, 

References 

I. Greenlee. R. T.. Munay, T., Bolden. S., and YVingo, P. A. Cancer 
Statistics. ?000. CA Cancer J. Clin., 50: 7-30. 7000. 
?. Mountain. C. T. Revision in the international ivMfm foi Ma pint.* of 
lung cancer. Chest. II); 1710-1717, J 997. 

Ihde, D. C. Chemotherapy of lung cancer N. Until. J Med. .?//: 
M3-t-)44l. J992. ■ 

4. Melamed, M. R.. f lehiiiger, B. J., Zaman. M. B . Hecbn. R. T.. 
Par(hick : W. A., and Martini, N. Screening for lung rancor results of 
the Memorial Sloan-Keiirring study in New York. CheM. $6' 44- S?. 
I ?S4 

5. rontana. R. S.. Sanderson. D. R , Woolmr. 1., B . Taylor. W 1" . 
Miller. W. and Muhrn. J. R. Lung Canrei Seieenini: "Ihe Mavr, 
program. J. Oreup. Mcd. f ?S: 746-750, l9Po 

(> 'loci: man. M, S. Survival and monalitv fn>in hinf enu'er in :t 
screened popular ion: Ihe John? Hopkins Study. 0>r.M. .'i.v V'-*S-3?.' S. 
I9S6 



Review: Advances in Early Detection of Lung Cancer 



■7. Kubik, A., Parkin, D. M. t Khlat, M, Erban, J., Polak, L, and 
Adfamec, M. Lack of benefit from semi-annual screening for cancer of 
the lung: follow-up report of a randomized controlled trial on a popu- 
lation of high-risk males in Czechoslovakia. Int. J. Cancer, 45: 26-33, 
1990. " ' 

8. Fontana, R. S., Sanderson, D. R., Woolncr, L. B., Taylor, W. F., 
Miller, W. E... Muhm, J. R., Bernatz, P. E., Payne, W. S., Pairolero, 
P.. C, and Bergstralh, E. I Screening of lung cancer. A critique of the 
Mayo Lung Project. Cancer (Phila), 67: 1155-1164, 1991. 

9. Strauss, G. M, Gleason, R. E., and Sugarbaker, D. J. Screening for 
lung cancer. Another look: a different view. Chest, J J J: 754-768, 1997. 

10. Hirsch, F. R., Brambilla, E, Gray, N., Gritz, E., Kelloff, G. J., 
llinnoila, 1., Pastorino, U, and Mulshine, J. L. Prevention and early 
detection of rung cancer— clinical aspects. Lung Cancer (Limerick), 17: 
163-174, 1997. 

1 1. Hong, W. K. Chemoprevention of Lung Cancer. Oncology (Basel), 
J 3 (Suppf 5): 1 35- HI, 1999- 

12. Mulshine, J. L., and llenschke, C. I. Prospects for lung-cancer 
screening. Lancet, 355: 592-593.. 2000. 

13. Travis, W. D., Colby, T. V., Corrin, B., Shimosaio. Y., and Bram- 
billa, E. Histological typing of tumours of lung and pleura. In: L. IL 
Sobin (ed.). World Health Organization International Classification of 
Tumours, Ed. 3. New York: Springer-Verlag, 1999. 

14. Franklin, W. A. Pathology of lung cancer. J. Thorac. Imaging, J 5: 
3-12, 2000. 

15. Colby, T. V. Precursor lesions to pulmonary neoplasia. In: C. 
BrambiHa and E. Brambilla (eds.), Lung Tumors. Fundamental Biology 
and Clinical Management, pp. 61-87. New York: Marcel Dekker Inc.; 
1999. 

J 6. Peters, E. ) . Morice. R., Benner, S. E., Lippman, S., Lukeman, J., 
Lee, J. S.. Ro, ). L., and Hong, W. K. Squamous metaplasia of the 
bronchial mucosa and its relationship to smoking. Chest. 103: 1429- 
M32, 1993. 

17. Aueibach, O., Geie, B., Fomiiin, J. B.. Petrick, T. G. ; SmoJin. H. )., 
Muchsam, G.E., Kassouny, D. Y. r and Stout, A. P. Changes in bron- 
chial epirhelium in relation to smoking and cancer of the lung. R Engl. 
J. Med,. 256: 97-) 04, 1957. 

18. Wistuba, 1. 1., Behtens C. Milchgrub, S., Bryant, D., Hung, J., 
Minna. J. D., and Gazdar, A. F. Sequential molecular abnormalities arc 
involved in the multistage development of squamous cell lung carci- 
noma. Oncogene, 18: 643-650, 1999. 

19. Lam, S.. LcRiche, J. C, Zheng, Y„ Goldman. A.. MacAulay, C, 
Hawk. E.. KeUoff. G.. and Gazdar, A. F. Sex-related differences in 
bronchial cpiihelial changes associated with lobncco smoking. _L Nail. 
Cancer )iVm., 91: 691-696, 1999. 

20. Saccomanno G. Archer, V. E.. Aueibach. O., Snundeis. P. P.. ;md 
Bicnnan. 1.. M. Development of carcinoma of the tune as reflected in 
exfoliated cells. Cancer (Phila.), 32: 256-270, 1974. 

21. Frost. J. K.. Ball. W. C. ; Ji., Levin, M. L.. Tockman. M. S.. F.ro2an, 
Y. S., Gupia. K. : F.ggleston. J. C. Pressman. N. J.. Donilhan, M. P. : and 
Kimball, A. W. Sputum cytopathology; use and potential in monitoring 
the workplace environment by screening for biological effects of e\po 
sure. J. Or cup. Med, ?$: 692-703, 1986 

2?. Venman, B. J. W.. van Boxem. T. ). M.. Smit, 1;. F.. Posimus. P. E.. 
and Suiedja. 7. f~i. Outcome of bicnehial carcinoma in um. Chest. JJ7: 
157?- 157*6. ?()t)0. 

23. Sir bos. R Boas. I. 0.. Clement. M. ) . Otfrfhatis. G. L, Ask in. 
f. 0. Hruban. I(. H.. and WcsttJ. W. H. p53 alieiaric-ns in atypical 
aKeobi hyperplasia ot the human lung. Hum. Pathol. 79: S'OI- SOS. 
I9?S 

?4. Kitamina. )L. Kameda. V.. ho. '!.. ami llayashi. H Atypical ade- 
nomatous hyperplasia ot the limp Imphrai inns tot the pathogenf MS of 
periplie:al !i.'ni? :iJi-noi aictnom.'! Anv .1 Clin PailioL }>} 610-622. 
(999 

25. Oiedurh. ,S . Semik. M . l.rnischig. M. G. Wmiti. F. Svheld. 
H It. R i - '(>>. N , and Boneanv. G Hehial C7 ol poJmonary nodulfs in 



patients with extratboracic malignancy; CT-surgtcal correlation. Am. I 
Roentgenol., /72: .353-360, 1999. 

26. Keith, R. L., Miller, Y. ,E., GemmiJK R. M.", Drabkin, H. A., 
Dempsey, R C, Kennedy, T. C, Prindiville, S., and Franklin, W. A. 
Angiogenic squamous dysplasia in bronchi of individuals at high risk for 
lung cancer. Clin. Cancer R«. ; 6: 1616-1625, 2000. 

27. Muller r X. M., and Muller, G. Jhe ultrastrvcture of preneoplastic 
changes in bronchial mucosa. Curr. Top: Pathol., 73; 233-263, 1983.. 

28. Hirsch, F. R., Gazdar, A. F., Gabrielson, E. r Urn, S., and Franklin, 
W. A histopathologic evaluation of premalignant and early malignant 
bronchial lesions; an interactive program based on internet digital im- 
ages to improve WHO criteria for early diagnosis of lung cancer and for 
monitoring chemoprevention studies— a SPORE collaborative project. 
Lung Cancer (Limerick), 29 (Suppl. 2); 209, 2000. 

29. Lengauer, C, Kinzler, K. W., and Vogelstein, B. Genetic instabil- 
ities in human cancers. Nature (Lond.), 3Pr5: 643-649, 1998. 

30. Fon& K. M., Sekido, Y., and Minna, J. D. Molecular pathogenesis 
of lung cancer. ). Thorac. Cardiovasc. Surg., 118: 1 136-1152, 1999- 

31. Hirano.T., Franzcn, B., Kato, U., Ffbibara, Y., and Auer,G. Genesis 
of squamous cell lung carcinoma; sequential changes of proliferation, 
DNA ploidy, and p53 expression. Am. ). Pathol., U4: 296-302, 1994. 

32. Betticher, D. C, Heighway, J., Thatcher. N., and Hasleton, P. S. 
Abnormal expression of CCNDI and RBI in resection margin epithelia 
of lung cancer patients. Br. J. Cancer, 75: 1761- 1 768, 1997. 

33. Satoh, Y., Ishikawa, Y., Nakagawa,K.- Hirano, T., and Tsuchiya, E. 
A follow- up study of progression from dysplasia to squamous cell 
carcinoma with immunohistochemical examination of p53 protein over- 
expression in the bronchi of e.x-chr ornate workers. Br. J. Cancer, 75: 
.678^683, 1997. 

34. Li. Z. H., Zheng, J., Weiss, L. M., and Shtbata D. c-k rtts, and p53 
mutations occur very early in adenocarcinoma of the lung. Am. ). 
Pathol., 144: 303-309, 1994. 

35. Brambilla, E., Cazzeii, S., Lantue|oul, S. : Coll, J- L., Moro, D., 
Negoescu, A., and Brambilla, C. p53 mutant immunopbenotype and 
deregulation of p53 transcript ion pathway (Bcl2, Bax, and Wafl) in 
precmsor bronchial lesions of lung cancer. Clin. Cancer Res., 4: 1609- 
161$, 1998. 

36. Hung, J., Kishimoto, Y.. Sugio/K., Virmani, A., Mclntire, D. D., 
Minna. J. D., and Gazdar, A. F. ADelc-spectlic chromosome 3p dele- 
tions occur at an early stage in the pathogenesis of lung carcinoma. 
J. Am. Med. Assoc., 273: 558-563, 1995. 

37. Kishimoto, Y., Sugio, K., Hung. J. Y., Virmani, A. K., Mclntire, 
D. D.. Minna. J. D. f and Gazdat. A. 1 : . Allele- specific loss in chromo- 
some 9p loci in pteneoplastic lesions accompanying non-small cell lung 
cancers. ). Natl. Cancer Inst., ,V: 1224-1229. 1995. 

38. Sugio, K., Kishimoto, V.. Viimnni, A. K., Hung, ). Y., and Gazdar, 
A. F. K-ros mutanons ate a i datively hue event in the pathogenesis of 
lung carcinomas Cancer Res., 5811-5815, 1994. 

39. Wistuba, 1. I.. Behrens, C, Vinnant, A. K._. Milchgrub, S.. Syed, S., 
Lam, S., Mackayl B., Minna, J. D., and Gazdar, A. F. Allelic losses at 
chromosome 8p21-23 arc emly and heqtienl events in the pathogenesis 
of lung cancer. Cancer Res.. 59: 1973-1979, 1999. 

40. Vaiella-Gaicia, M., Gemini II. R. M., Rabenhorst, S, H., Lotto. A., 
Drabkin. H. A.. Archer. P. A . and Fiat»klin T W. A. Chromosomal 
dupliealion accompanies allelic loss in non-small cell lung carcinoma. 
Canca Res., 5S: 4701- -1707, IP9S. 

41. Wcstr a. W. I)., Baas, I. O . Iliuban. R. ().. Ask in, F, B.. Witson, K. r 
Offeih:ms, G. J. : Slebos. R. J K-iys ciKO^one activation in atypical 
alveolar hyperplasia:* of the human lung Cancer Res.. b6: 2224-2228, 
1 996. 

-t?. Wistuba, 1. ).. Lam. S . Bthrrtts. C . Virmani. A. K.. Fung, K. M., 
LeRjthe. ) . Samrt. .1. M.. SiivaMava. ? . Minna. J. U.. anttGazd3r. A. F. 
Molecular ibmrigc in ihe hroiuhial i pnhchnm ol oincnt r»nd former 
smck cr s. J. Nail Cai'i et In: t . . I 6''- -1377. 1 997 . 
43, Mao, L.. fee. J. S . Kune. J M . Fan. V. IF. Lippman, S. M., Lee, 
J. ).. Ro. J. V.. Idoson. A. : Yii. R . Moike. R C , Knnp, B. L.. Khwi ; 
F. R . Walih : G. L . H:tic!m:m. W, N . sml Hong. VV. K. Clonal genetic 



Clinical Cancer Research 19 



alterations in the lungs of current and former smokers. J. Natl. Cancer 
}nst^89: 857-862, 1997. 

44. Smith, A. L., Hung, )., Walker, L-, Rogers, T. E., Vuitch, F., Lee, 
E ami Gazdar, A. P. Extensive areas of aneuploidy are present in the 
' respiratory epithelium of lung cancer patients. Br, J. Cancer, 73: 203- 
209, 1996- 

45 Franklin W. A., Gazdar, A. F., Haney, J., Wisruba, 1. L, La Rosa, 
F. a, Kennedy, T., Ritchey, D. M., and Miller, Y. E. Widely dispersed 
p53 mutation in respiratory epithelium: a novel mechanism for field 
carcinogenesis. J. Clin. Investig., 100: 2133-2137, 1997. 

46 Johnston, W. W., and Elson, C. E. Respiratory tract. Jn:M. Bibbo 
(ed.) ? Comprehensive Cytopathology, pp. 325-401. Philadelphia: Saun- 
ders, 1997. 

47. Travis, W. D., Travis, L. B., and Devesa, S. S. Lung cancer. Cancer 
{Phila.), 75 (Suppl. 1): 191-202, 1995. 

48. Wynder, E. L., and Muscat, J. E. The changing epidemiology of 
smoking and lung cancer histology. Environ. Health Perspect., 103 
(Suppl. 8); 143-148, 1995. 

49. Thun, M. J.. Lally, C. A., Flannery, J. T., Calle, E. E., Flanders, 
W. D., and Heath, C. W., Jr. Cigarette smoking and changes in the 
hisiopathology of rung cancer. J. Natl. Cancer Inst., 89: 1580-1586, 
1997. 

50. Lam, $., and Shibnya, H. Early diagnosis of lung cancer. Clin. 
Chest Med, 20; 53-61, 1999. 

51. Kennedy, T. C, Proudfoot. S., Piantadosi, S., Wu, L., Saccomanno; 
G * Petiy, T. L and Tockman, M. S. Efficacy of two sputum collection 
techniques in patients with air flow obstruction. Acta CytoL 43: 630- 
636, 1999. 

52. BrowmarK G. P., Arnold, A., Levine, M. K, and D'Souza, T. Use 
of screening phase data to evaluate observer variation of sputum cyto- 
diagnosis as an outcome measure in a chemoprevention trial. Cancer 
Res., 50: I216-I2I9, 1990. 

. 53. Canloboni. A.. Pe2zotta. M. 0,. Sitoni, M., and Porcellati, M. 
Quality assurance in pathology: cytologic and histologic correlation. 
Acta Cytol, 36: 717-721, (992. 

54. Holiday. D. B., McLany, J. W., Farley, M. L., Mabry, L. C. 
C02Cns, D , Roby, T.. Waldion. E., Underwood, R. D., Anderson, E., 
and Coibielh, W. Sputum cytology within and across laboratories. A 
reliability study. Acta CytoL 3!>:~l 95-206, 1995. 

55. Risse. E. K. Voorjs. G. P., and van') Hoff, M. A. Relationship 
between the cellular composition of sputum and the cytologic diagnosis 
of lung cancer. Acta Cylol... 3t: 170-176, 1987. 

56. MacDougall. B., and Wcinerrnan, B. The value of sputum cytology. 
J. Gen. Inlem. Med.. 7: 11-12, 1992. 

57. Tockman, M. S.. Erozan. Y. S.. Gupia. P., Piantadosi, S,, Mulshine. 
J. L and Ruckdeschel. J. C. The early detection of second primary lung 

' cancers bv sputum immunosiaming. Chest. 106 (Suppl.): 385S-390S, 
1994. 

5S. Tnckman ; M. S. ; Gupta, P. K.. Myers, J. D... Frost, J. K. r Baytin, 
S. B. Gold. E. B.. Chase, A. M-. Wilkinson. P. H., and Mulshine, J. L. 
Sensitive and specific monoclonal antibody recognition of human lung 
cancer antigen on pieserved fputwn cells: a new approach to early lung 
cancer deter iron. .1. Clin. Oncol. ; 0: 1 6*5-1693. 1983. 

59. Zhou. A. Mulshine. .). t... Unswoith. t. J., Scott. F. M., Avis. 1. M. r 
Vos. M. D.. and Trerton, A. M. Purification and characieriz.-wion of a 
protein lhat permits caily tierce lion of tune cancel. J. Biol. Chcm. r 27 1 : 
10760-10766, 1996. 

60. Qiao Y-L, Tockman. M $.. Li. L .. Ero2an. Y. S., Yao, S. X, : 
Barren. M A. Zhou. W. II.. Giffrn. C. A.. I.uo. X C, and Taylor, P. R. 
A case- cohort study o( .tn early bt^rmukcr of lunj.' cancer in a m iccntng 
rohon of Yunnan tin mnici> in China Omar Epidemiol Biomark. 
Piev, 6. S93--900, 1997 

t-l Oiovn. F. L.. and Pi:»iiiad.-jM. S Kivuncm-e and Mirvnal following 
rejection of hronchtoli-alvcolai i':u» mom a of iht tun-: the Lung Camo 
Study Gioupr.xpcrj.-ner Ann Surg.. ?09: 779-790. I9S9. 
iP Tockman M. S. Adv:mn-> in :pulum analysts lor M'le.cnmg ^nd 
e-;„"ly deitciiim ot h.np v. we. Gmct. Control, 7: I9-M ; T000. 



63. Sueoka, E., Goto, Y. f Sueoka, N., Kai, Y., Kozu, T., and Fujiki, H. 
Heterogeneous nuclear ribonucleoprotein Bl as a new marker of early 
detection for human lung cancers. Cancer Res., 59: 1404-1407, J 999. 

64. Mulshine, J. U Reducing lung cancer risk. Early detection. Chest, 
//6V493S-496S, i999. 

65. Mao, U ; Hruban, R. R, Boyle, J. O., Tockman, M., and Sidransky, 
D. Detection of oncogene mutations in sputum precedes diagnosis of 
lung cancer. Cancer Res., 54: 1634-1637, 1994. 

66. Ruppert, J. M. t Tokino, K., and Sidransky, D. Evidence for two 
bladder cancer suppressor loci on chromosome 9. CanceT Res., 5J. 
5093-5095, 1993. 

67. Nawroz, H., van der Riet, P., Hruban, R. H., Koch, W, Rupperl, 
J. M., and Sidransky, D. Allelotype of head and neck squamous cell 
carcinoma. Cancer Res, 54: 1 1 52- J 155, 1994. 

68. Miozzo, M„ Sozzi. G., Musso, K., Pilotti, S., Incaibone, M, Pas- 
torino, U, and Pierotti, M. A. Microsatellite alterations in bronchial and 
sputum specimens of lung cancer patients. Cancer Res., 56: 2285-2288, 
1996. 

69. Shapiro, G. 1, Park, J. E, Edwards, C. D, Mao, L, Meilo, A., 
Sidransky, D, Ewen, M. E. 3 and Rollins, B. J. Multiple mechanisms of 
pI6 JNK4A 'inaciivation in non-small cell lung cancer cell lines. Cancer 
Res, 55/ 6200-6209, 1995. 

70. Hamada, K, Kolmo. T., Kawanishi, M, Ohwada, S, and Yokota, J. 
Association of CDKN2A(pl6)/CDKN2B9pl5) alterations and homozy- 
gous chromosome arm 9p deletions in human lung carcinoma. Genes 
Chromosomes Cancer, 22: 232-240, 1998. 

71. Belinsky, S. A. : Nikula, K. J, Palmisano, W. A, Michels, R, 
Saccomanno, G, Gabrielson, E, Baylin, S. B, and Herman, J. G. 
Aberrant methylation of pl6 inh4 * is an early event in lung cancer and a 
potential biomatker for caily diagnosis. Proc. Natl. Acad. Sci. USA, 95: 
11891-1 1896 ; 1998. 

72. Esteller. M, Sanchez-Cespedes, M, RoseJI. R, Sidransky, D , 
Baylin. S. B, and Herman, J. G. Detection of aberrant promoter byper- 
meihylation of tumor suppiessor genes in serum DNA from non-small 
cell lung cancer patients. Cancer Res., 59: 67-70, 1999. 

73. Nieburgs, H. E. Recent progress in the interpretation of malignancy 
associated changes (MAC). Acta Cyiol, 12: 445-453, 1968. 

74. Payne, P. W., Sebo, T. A, Doudkine, A, Garnei.. D., MacAulay, C, 
Lam, S., LeRiche, i. C... and Palcic, B. Sputum screening by quantitative 
microscopy: a reexamination of a portion of the National Cancer Instt- 
rute. Cooperative Early Lung Cancer Study. Mayo Clin. Pioc, 72: 
697-704, 1997. 

75. Lam, S, Palcic. B., Garner, D, Beveridge, A, MacAulay, C. 
LeRiche, A, and Coldman, A. Lung Cancer Control Strategy in the New 
Millennium. Lung Cancer, ?9 (Suppl. 2); 145. 2000. 

76. Ah.endt, S. A., Chow. A.T.. Xu, L. H. : Yang. S. C, Eisenbetger, 
C. F. : Esteller. M. : Herman, J. G, W» ; L., Decker, P. A.. Jen, ).. and 
Sidiansky, D Molecular detection of tumor cells in bronchoalveolat 
lavage fluid from patients with early stage lung canter. J. Nail. Cancer 
InM.,'*?;. 332-339, 1999. 

77. Gn^dar. A. P.. and Minna, A D. Molecular detection of early lung 
cancer. A Nail. Cancer Inst.. 91: 299-301, 1999. 

7Sv Field. A K.. Liloylou : 7.. Xinarianos, G., Prime. \V, Fielding, P.. 
Walshaw. M. J., und Turiipitll, L. Generic alterations in bronchial lavage 
as a potential maiker foi individuals with a high risk of developing lung 
cancer. Come. Res . 59: 2690-2695, 1999. - 

79. LiJogrW \. r Moloney, P., Xinananos. G.. Fear, S,. and Field, A K. 
Sens'niviry tu.J limitations of high throughput fluorescent mirtoyaielliie 
analysis for ihf deieetion o( allelic imbalance. Application in lung 
u.mois. Int. J. Oncol. 10. i-l-A 2000. 

Fii-Minf. I' . "luinhuH. I . Piime. W, W'3hhow : M. : and Field. A K. 
iletttogonron:, imcleai >ibor;iu leopiorrin A7/B! up rcgulation in bron- 
chial lavace -,|-e< inVTiifa clinical maiker of early lung cancer Jeteciion 
Clin. Cancer 5 I9V9. 

SI. f':mit'l, K. Schhrnok. C, AngMM-unn. M. Wecfcennaitn. D. 
Schmaus. W.. Oath. H.. I';:s:,!ick, B. : bbicki. A R.. and Kit iluntillei. G 
Med»odoloric:H fin:i!\;is immnnf-cytixhrmieal screening foi flissem- 



20 Revkw: Advances in Early Detection of Lung Cancer 



mated epithelial rumor cells in bone marrow. J. Hematother., 3: 165- 
173, 1994. 

82. Lambrechls, A. C, van't Veer, L. J., and Rodenfmis, S. The detec- 
tion of minimal numbers of contaminating epithelial tumor cells in 
blood or bone marrow: use, limitations and future, of RNA-based meth- 
ods. Ann. OncoL 9: 1269-1276, 1998. 

83. Steinman, C. R. Free DNA in serum and plasma from normal 
adults. V Clin. Investig., 66: 1391-1399. 1980. 

84. Rapris, L., and Menard, H. A. Quantitation and characterization of 
plasma DNA in normals and patients with lupus erythematosus. J. Clin. 
Investig., 66: 1391-1399, 1980. 

85. Leon, S. A., Shapiro, B., Sklaroff, D. M., and Yaros, M. ). Free 
DNA in the serum of cancer patients and the effect of therapy. Cancer 
Res., 37: 646-650, 1977. 

86. Stroun, M. ; Anker, P., Maurice, P., Lyautey, J., Lederrey, C, and 
Beljanski, M. Neoplastic characteristics of the DNA found in the plasma 
of cancer patients. Oncology, 46: 318-322, 1989. 

87. Shapiro, B., Chakrabary, M. r Cohn, E., and Leon, S. A. Determi- 
nation of circulating DNA levels in patients with benign or malignant 
gastrointestinal disease. Cancer (Phila ), 5/: 2 1 16-2 120, 1 983. 

88. Chen, X..Q., Stroun, M., Magnenat, J-L., Nicod, L. P., Kurt, A. M., 
Lyautey, J., Ledertey, C, and Anker, P. Microsaicllite alterations in 
plasma DNA of small cell lung cancer patients. Nat. Med., 2: 1033- 
1037,1996. 

89. Anker, P. ; Lefort. F., Vasioukhin, V., Lyautey, J., Lederrey, C, 
Chen, X. Q., Stroun, M., Mulcahy, H. E., and Farthing, M. ). K-ras 
mutations are found in DNA extracted from the plasma of colorectal 
cancer patienis. Gastroenterology, J 12: II I4-M-29, 1997. 

90. Kopreski, M. S., Benko, F. A., Kwcc, C, Leitzel. K. E. ; Eskander, 
E v Lrpton, A., and Cocke, C. D. Detection of mutant K-ras DNA in 
plasma or serum of patients with colorectal cancer. Dr. J. Cancer, 76: 

. 1293-1299, 1997. 

91. Hibi, K., Robinson, R., Wu, L, Hamilton, S. R.. Sidransky, D., and 
Jen, J. Molecular detection of genetic alterations in the scrum of colo- 
rectal cancer patients. Cancer Res... 5S: 1405-1407, 1998. 

92. Soreoson, G. D., Pribish, D. M„ Valone, F. II. , Mcmoli, V. A., 
B2ik, D. J., and Ydo. S. L. Soluble normal and mutated DNA sequences 
from single copy genes in human blood. Cancer Epidemiol. Biomark. 
Prev., J; 67-71, 1994. 

93. Mulcahy. H. E., Lyautey, J., Ledcney. C. qi Chen, X., Anker, P., 
A Is lead. E. M.. Ballinger : A., Farthing, M. J., and Stroun, M. A pro- 
spective study of K-ro.v mutations in the plasma of pancreatic cancer 
patients. Clin' Cancer Res., 4: 271-275, 1998. 

94. Vasioukhin. V.. Anker, P. Maurice. P.; Lyautey, 1, Ledeney, C, 
and Slronn, M. Point mutations of the N-ras in the blood plasma DNA 
of patienis with mvelodysplastic syndrome or acute myelogenous leu- 
kaemia. Br. J. Haematol., 86: 774-779, 1994. 

95. Woolner, L. B. Pathology of cancer detected ecologically, in: Atlas 
of Early Lung Cancer. National Cancer Institute, NIIL Untied Slates 
Department of Health and Human Services, pp. 107-203. Tokyo: Igoku- 
Shoin, 1983. 

96. Kato, It. and Conese, D. A. Eaily detection of Innj; cancer by 
means of brmatoporphyrin derivative fluorescence and laser phoio- 
radiation. Clin. Chest Med.. 6: 237-253, 1985. 

97. Piofio. A. E.. Doimn, D. R. : and King. E. G. L. Laser lluoiciccncc 
bronchoscopy loi localization of occult lung mmois. Med. Phy.v ("NY). 
6: 532-535. 1979. 

9S. Kinsey, J. H . and Conese, D. A. Endoscopic system f«-r simulta- 
neous visual oaminaiion and electronic detection ol fluorescence, Rev, 
Sci. )nstrum ; 5/: 1-103- NOd.. 1 980. 

99. Profio. A. f;.. Doiion. D R.. :md Saiuaik. J Fluutomettr f>-i 
endoscopic diagnosis of minors. Mr J. Phys. (NV'l. !J M6-520. 19S-J. 

100. Median. S.. Svanbcig. K.. and S\anbei(\ S Mnliicnlct imaging 
r,n<i contrast enhancement in cancer- tumm to( alr/ation using laser- 
induced fluorrseence in liematopornhw in- der iv;iti\e- bearine tissue. Op- 
tics I. nt.. 10 50-5?. I9S5. 



101. Kato, U., Imaizumi, T., Aizawa, K., Iwabuchi, H., Yamamolo, H_, 
Jkeda, N. f Tsuchida, T., Tamachi, Y., Ito, T. f and Hayata, Y. Photody- 
namic diagnosis in respiratory tract malignancy, using an excimer dye 
laser system. J. Photochero. Pholobiol. Biol., 67 189-196, 1990. 

102. Kato, H., and Ikeda, N. The role of fhioresceiKe diagnosis in the 
early detection of high-risk bronchial lesions. J. Broncho!., 5: 273-274, 
1998. 

103. Palcic, B., Lam, S., Hung, J., and MacAulay, C. Detection and 
localization of early lung cancer by imaging techniques. Chest, 99: 
742-743, 1991- 

104. Hung, K La"»> S. ; LeRrche, J. C, and Palcic, B. Auto fluorescence 
of normal and malignant bronchial tissue. Lasers Surg. Med., //: 
99-105, 1991. 

105. Lam, S., Kennedy, T., Unger, M., Miller, Y. E, Gelmonl, 
Rusch, V., Gipe, B, Howard, D., LeRichc, J. C, Coldman, A., and 
Gazdar, A. F. Localization of bronchial intraepithelial neoplastic lesions 
by fluorescence bronchoscopy. Chest, 7/3: 696-702, 1998. 

106. Kurie, J. M., Lee, J. S., Morice, R. C, Walsh, G. L, Khuri, F. R., 
Broxson, A., Ro JY. Franklin, W. A., Yu, R., and Hong, W. K. 
Autofluorcscence bronchoscopy in the detection of squamous metapla- 
sia and dysplasia in current and former smokers. L Natl. Cancer Inst., 
90: 99J-995, 1998. 

107. Vcnmans, B. J., van der Linden, J. C, van Boxem, A. J., Postmus, 
P. E., Smil, E. F., and Sutedja, G. Early detection of pre- invasive lesions 
in high risk patients. A comparison of conventional fiberoptic and 
fluorescence bronchoscopy. J. Bi one hoi., 5; 280-283. 1998. 

108. Vermylen, P., Pieraid, P., Roufosse, C, Bosschaerts, T., Verhest, 
A., ScuJier, J-P, and N inane, V. Detection of bronchial preneoplastic 
lesions and early lung cancer with fluorescence bronchoscopy: a study 
about its ambulatory feasibility under local anaesthesia. Lung Cancer 
(Limerick), 25: J61-168, 1999. 

J 09. Kennedy, T., Hirsch, F. R., Miller, Y., Prindiville, S., Bunn,P. A. r 
Jr.. and Franklin, W. A randomized snjdy of fluorescence bronchoscopy 
versus white- light bronchoscopy for early detection of lung cancer in 
high risk patients. Lung Cancer (Lirncrick) r 29 (Suppl. 2): 244, 2000. 

1 10. Gazdar. A. F., and Minna, J. D. Angiogenesis and the multistage 
development of lung cancers. Clin. Cancer Res., 6. J 611- 1 617. 2000. 

111. Henschke, C. 1... Miettinen, O. S. : Yankelevitz, D. F. f Libby, 
D. M., and Smith, ). P. Radiographic screening for cancer: proposed 
paradigm for requisite research. Clin. Imaging, }S: 6-20. 1994. 

1 12. Mieiiincn. O. S. Screening for lung cancer. Radiol. Clin. N. Am.. 
38: 479-486, 2000. 

1 13. Marcus, P. M., Bcrgsiralh, E. J., Fageisuorn, M.. Willliams. D. E., 
Fontana, R., Taylor. W. F.. and Prorok, P. C. Lung cancer mortality in 
the Mayo Lung Project: impact of extended follow-up. J. Nat). Cancer 
Inst... 97: I308"-I3I6, 2000. 

1 14. Black. W. C. Overdiagnosis. An iindrnecug nized cause of confu- 
sion and harm in cancer screening. .1. Nad. Cancer Jnst.. 97: 1280-1282, 
2000. 

I 15. The Health, and Welfare Statistics Foundation. 7 he current issue 
on the screening project for eldetly people by the Ministry of Health and 
Welfare Japan. J. Health Welfare. 47: 37 : 1995. 

116. Sobue. T.. Suzuki. 7.. and Natuke. 7. The Japanese L nn£- Cancel - 
Scrcenini; Research Group: a case-conuol study for evaluating lung 
cancer scieening in Japan. Int. J. Canrer. >0: 230-237. 199?. 

117. Oknmolo, N. ? Suzuki. 7.. Hasrgawp, II., Cotoh. 7 . f Hagiwart. S. r 
Sekimoto. M.. and Kanefco. M. Evahianon of a eljnic bascd soecning 
piociam lor lung canrti witli a ease-contiol dt sigi* ri> Kimagawa. Japan. 
Lung Cancer I Lime-rick). 75 77- S 5. 1999. 

US, Muhm, 1 R .- Mdlci. W. f . : Fomana. R S , Sandt ison, I). R_. and 
Uhlenhi>p]\ M A. Lunr rancei drircteJ dining 5 screening pio^ram 
u.sinp fout ■ month chrsi^adfopiaphs. Ra.liology. 609-615.. I9S3. 
»)') Hav;ihuclii. N . Russell. W J. ,mj Murakami. J Problems in 
ladio^rapbie dotc(in>n and di:i^n<">>ts of lung cjnen Ada Radiol. 
tCPfH. 30. 163- J 67. 1989 

K'0. Wnodimg. J H Pitfalls in ihe iadiclo{Mc dij-nr>: is ui Itmi: cancer. 
Am } Rot-nipcnoL I I fo- 1 1 ~- :"■ 1990 



Clinical Cancer Research 



121. Greene, R. E. Missed lung nodules: lost opportunities for cancer 
care. Radiology, 182: 8-9, 1992. 

122. Austin, J. H. M.. Rornney, B. M., Goldsmith, L. S. Missed bron- 
chogenic carcinoma: radiographic findings in 27 patients with a poten- 
tially resectable lesions evident in retrospect. Radiology, 182: 115-122, 
1992. 

123. Schaner, E. G., Chang, A. E.-, Doppan, J. L., Conic le, D. M., Flye, 
M. W., and Rosenberg. S. A. Comparison of computed and conventional 
whole lung tomography in detecting pulmonary nodules. Am. J. Roent- 
genol., 131: 51-54. 1978. 

124. Webb, W. R. Radiologic evaluation of solitary pulmonary nodule. 
Am. J, Roentgenol., J 54: 701-708, 1990. 

125. Kalender, W. A., Seissler, W.„ Klolz, E., and Vock, P. Spiral 
volumetric CT with single-breath- hold technique, continuous transport, 
and continuous scanner rotation. Radiology, 176: 181-183, 1990. 

126. Costello, P.. Anderson, W. A., and Blume, D. Pulmonary nodule: 
evaluation with spiral volumetric CT. Radiology, 179: 875-876, 1991. 

127. Remy-Jardin, M., Ciraud, F„ and Marquette, C. H. Pulmonary 
nodules: detection with thick section spiral CT versus conventional CT. 
Radiology, 187: 513-520, 1993. 

128. Davis, S. CT evaluation for pulmonary metastases In patients with 
exrrathoracic malignancy. Radiology, J 80: l-12 7 1 991. 

129. Kaneko, M., Egoebi, K., Obmarsu, 11., Kakinuma, R., Naruke, T., 
Suemasu, K., and Moriyama,N. Peripheral lung cancer: screening and 
detection wiih low-dose spiral CT versus radiography. Radiology, 201: 
798-802, 1996. 

130. Sone, S., Takashima, S., Li, F., Yang.. Z., Honda, T., Maruyama, 
Y., Hasegawa, M., Yamanda, T., Kubo, K„ Hanamura, K., and Asakura, 
K. Mass screening for lung cancer with mobile spiral computed tomog- 
raphy scanner. Lancet, 3M: 12-12-1245^ 1998. 

131. Henschke, C. I., McCauIey. D. I.. Yonkelcvitz, D. F., Naidich, 
D. P., McGuinness, G., Miettincn. O. S., Libby, D. M., Pasmantier, 
M. W. t Koizumi. I, Altoiki, N. K.. and Smith, J. P. Early Lung Cancer 
Action Project: overall design and findings from baseline screening. 
Lancet, 354 99-105, 1999. 

132. Kakinuma, R. f Ohmatsu : H., Kaneko, M., Eguchi. K. f Naruke, T., 
Nagai. K.. Nishiwaki. Y. f Suzuki, A., and Moriyama, N. Detection 
failures in spiral C7 screening for lung cancer: analysis of CT findings. 
Radiology, 217: 61-66, 1999. 

133. Paranjpr. D. V., and Eteigin. C. J. Spiral CT of the lungs: optimal 
technique and resolution compared with conventional CT. Am. J. Roent- 
genol., 167: 561-567.- 1994. 

134. Seller S. E.. Judy, P. F., Adams. D. F., .lacobson, F. L.- Stark, P.. 
Kikinis. R., Swensson. R. G., Hooion, S. f Head, B., and FcJdman, U. 
Spiral CT of ibe chest; comparison of cine and film-based viewing. 
Radiology. 197: 73-75, 1995. 

135. Cioisille. P.. Sou to. M., Cova : M., Wood, S., Afewoik. Y., Kuhl- 
man.'J. E.. and Zcrhouni. E. A. Pulmonary nodules: improved detection 
with vascular segmentation and extraction with spiral CT. Radiology. 
197: 397-401. 1995. 

136. Kanazawa. K.. Kawnta. V.. Niki. R. Satoh, R, Obmntsu, R, 
Kakinuma. R„ and Kaneko. M Computer- aided diagnosis for pulmo- 
nary nodules based on he tic a) CT im.-»<:cs. Comput. Med. Imaging 
Graph. 22; I57-Ib7. 199*. 

137. Buckshee, N.. Afmcllo. K.. Yankc)tvii7.. D. F., Mancuso. C and 
f leiisihke. C. I Smoking habits and overall satisfaction after early tune 
cancer screening usinc low- dose C7. Al.A'A7S International Confer- 
ence, 1999. 

J3K. Coleman. R. E. I'f.1 in lung cancer. J. Nucl. Med.. 40: SM-S2Q : 
1999. • 

139. Maitia A. WisMiiba. I 1.. Vinnnni. A K . Sakaguchi. M . Park. I.. 
Srurk>' : A.. Milthgmb. S . Gibbons. 0 . Minna. .1. D.. and Gnzdai, A F. 
Fnmhmeni of cpi.hclbl cells for molefubi studies. Nat. Med.. 5. 

HO. Kopclovwh. I . Ilcnson. 0. I: , C-:izdat. A J.. Dubb. 13. . Siiv;*- 
ia\3, S . Krlloff G. ) . and GrccrmTild. \' iMmo^ate jinaioinit/hincr tonal 



sites for evaluating cancer risk: an extension of the field effect. Clin. 
Cancer Res., 5: 3899-3905, 1999. 

141. OowelL R. E., Gilliland, F. D., Temes, R. T , Harms, H. J., NefL 
R. E. : Heaphy, E., Auckley, D. H., Crooks, L, A., Jordan, S. W., Samet, 
J. M^ ( Lechner, J. F., and Belinsky, S. A. Detection of trisomy 7 in 
oonmalignant bronchial epithelium from lung cancer patients and indi- 
viduals at risk for lung cancer. Cancer Epidemiol. Biomark. Prev., 5: 
631-637, 1996, • 

142. Somers, V. A., van Henlen, A. M., ten Velde, G. P., Arends, J. W., 
and Thunnissen, F. B. Additional value of K-ras point mutations in 
bronchial wash fluids for diagnosis of peripheral lung rumours. Eur. 
Respir. J., 13: 1 120-1 124. 1999. 

143. Yahata, N., Obyashiki, K., Obyashiki, J. H., Iwama, H., Hayashi, 
S., Ando, K., Hirano, T., Tsuchida, T., Kato, H., Shay, J. W„ and 
Toyama, K. Telbmerase activity in lung cancer cells obtained from 
bronchial washings. J. Natl. Cancer Inst., 90: 684-690, 1998. 

144. Wistuba, L 1., and Gazdar, A. F. Molecular abnormalities in the 
sequential development of lung carcinoma. In: S. Srivastava, D. E. 
I Jenson, and A. F. Gaidar (eds.), Molecular Pathology of Early Cancer, 
pp. 265-276. Amsterdam: IOS Press, 1999. 

145. Yashima, K., Lilzky, L. A., Kaiser, L., Rogers,. T., Lam, S., 
Wistuba, 1. 1., MilchgTub, S., Srivastava, S., Piatyszek, M. A., Shay, 
J. W. ; and Gazdar, A. F. f etomerase expression in respiratory epithe- 
lium during ibe multistage pathogenesis of lung carcinomas. Cancer 
Res., 57: 2373-2377, 1997. 

146. Scott, F. M., Treston, A. M., Shaw. G. L., Avis, L, Sorenson, J., 
Kelly, K. ( Dempsey, E. C, Cantor, A. B., Tockman, M., and Mulshine, 
J. L. Peptide amidaiing activity in human bronchoalveolar lavage fluid. 
Lung Cancer (Limerick), H: 239-251, 1996. 

147. Scott, F. M., Modali.. R., Lehman, T. A., Seddon, M., Kelly, K.. 
Dempsey, E. C, Wilson, V., Tockman, M. S., and Mulshine, 1. L. High 
frequency of K-ras codon 12 mutations in bronchoalveolar lavage fluid 
of patients at high risk for second primary lung cancer. CJin. Cancer 
Res., 3: 479-482, 1997. 

148. Mills, N. E.. Fishman. C. L., Rom. W. R, and Jacobson, D. R. Has 
oncogene detection in broncbioJoalveolar lavage fluid from patients 
with lung cancer. Lung Cancer (Limerick), J (Stippl.); II, 1994. 

149. Nawroz. 11., Koch ; W., Anker, P., Slroun. M.. and Sidransky, D. 
Microsalellite alterations in senim DNA of head and neck cancer pa- 
tients. Nat. Med. ; .2; 1035 -J 037, 1996. 

150. Brcnnan, J. A., Mao, L., llmban. R. H., Boyle. J. O., Eby, Y. J., 
Koch, W. M., Goodman, S. N., and Sidransky. D. Molecular assessment 
of histopathologicn! staging in squamous-ccll carcinoma of the head and 
neck. N. Engl. ). Med., JJ7; 429-435, 1995. 

151. Ahicndi, S. A.. Yang. S. C, Wu, L., Westra, W. H., Jen, J., 
Califano, J. A., and Sidransky, D. Comparison of oncogene mutation 
detection and lelomeiasc activity for the molecular staging of non-small 
cell lung cancer. Clin Cancer Res.. 3: 1207-1214, [997. 

157. Gazdar. A. F., ami Minna. J. D. NCI scries of cell lines: an 
historical perspective. J. Cell. Brochcm., 24 (Suppl): 1-11, 1996. 
153. Wistuba. I. )., Bryant. D.. Bcbrens. C, Milchgnib, S., Virmani, 
A. K.. A.shfa<v \\.. Minna. J. D.. and Gradar. A. F. Comparison of 
features of human lunji cancer cell line;; and their cor responding tumors. 
Clin. Cancer Re.v. b: 991-1000, 1999. 

I .VI. Amfimi, P, Reddrt. R. P.. Pfeifer, A.. Malan. S. L... Mark. G. D.. 
and H^rrir. C C. NfopbMii transformation of a human bronchial 
epithelial ctf) hne by :> recombinant icliovinis encoding virjl ilanry 
tai. Mol. Cnrr inL^.. ; I S J- J 60. I9SS 

155. franklin. W. A . lolkvoid, J. M.. Varelta-Gaicia, M., Kennedy,!., 
Pioudfoot. S.. Cook, R . Dempsey. E. C. Helm. K.. Bunn. P. A., and 
Miller. Y F r:,\p;inNton c| bronchial cptdielia! fell pt»pulaiions bv in 
\ iuo cuhuie of oplantN from dvsplasnc and histol'tpically normal sites. 
Am. J Re;.prr. Cell Mol I'tiol . i5 1*97-304. 1996.' 
!.'■{>. K();tv. K. Spriv M. 7 no. A . and Wiencke. J. Deletion of 
j-luirithj.-ni: J7-tt:;n:.tcr:i:.e r p :j:>d c ln?.s 0 rents inteiiitts to errhance 
Miseepimihiy m lunif csneei m nnni'niy populations Cancel Causes 
Conunl. \r'r- 



Review: Advances in Early Detection of Lung Cancer 



157. Wu, X., Zhao, Y., Honn, S. E., Tomlinson, G. E, Minna, J. D, 
Hong, W. K., and Spitz, M. R. Benzofajpyrene dioJ epoxide-induced 
3p21.3 aberrations and genetic predisposition to fung cancer. Cancer 
Res., 58: 1 605-1608, 1998. 

158.. Virmani, A. K., Fong, K. M., Kodagoda, EL, Mclntire, D., Hung, 
J., Tonk, V., Minna, J. D., and Gazdar,, A. F. AHelolyping demonstrates 
common and distinct patterns of chromosomal loss in human lung 
cancer types. Genes Chromosomes Cancer, 21: 308-319, 1998. 

159. Takahashi, T., Nau, M. M., Chtba, J.; Birrer, M. J., Rosenberg, 
R. K., Vinocour, M., Levitt, M., Pass, H., Gazdar, A. F., and Minna, 
J. D. p53: a frequent target for genetic abnormalities in rung cancer. 
Science (Washington DC), 246: 491-494, 1989. 

160. Forgacs, E„ Biesterveld, E. J., Sekido, Y-, Fong, K. M., Muneer, 
S., Wisruba, 1., Milcbgrub, S., Brezihscbek, R., Virmani, A., Gazdar, 
A. F., and Minna, 3. D. Mutation analysis of the PTEN/MMAC) gene in 
lung cancer. Oncogene, 17: 1557-1565, J 998. 

161. Milsudomi, T., Steinberg, S., Oie, R K., Mulshine, J. L., Phelps, 
R., Viallet, J., Pass, H., Minna, J. D., and Gazdar, A. F. ros gene 
mutations in non-small cell lung cancers are associated with shortened 
survival irrespective of treatment intent. Cancer Res., 51: 4999-5002, 
1991. 

162. Sozzi, G., Veronese. M. L. ( Negrini, M. t Baffa, R., Corticelli, 
M. G., Inoue,. H.. Tomielli, S., Pilotti, S.. Ohta, M., Huebner. K., and 
Croce, C. M. The FHIT%tm z\ 3pH.2 is abnormal in lung cancer. Cell, 
85: 17-26, 1996. 

" 163: Anbazhagan, R. r Tihan.T., Bomman. D. M.. Johnston. J. C, Saltz, 
J. H., Wcigering, A., Piantadosi, S., and Ga,brielson, E. Classification of 
small cell lung cancer and pulmonary, carcinoid by gene expression 
profiles. Cancer Res., 59: 5119-5122, 1999. 

J 64. Hibi. K„ Liu, Q.. Beaudry, G. A. f Madden, S. L., Westra, \V. H., 
Wehage, S. L., Yang, S. C, Hei'imiHer, R. F., Bertelsen, A. H., Sidran- 
sky. D., and Jen. J. Serial analysis of gene expression in non-small cell 
lung cancer." Cancer Res., 58: 5690-5694, 1998. 
165. Soz2i. G. t Pastor ino f U.. Moiraghi, L.. Tagliabue, E., Pezzella, F... 
Ghirelli, C, TornicIU. S., Sard, L., Huebner, K., Picroiri. M. A.. Croce.. 
C M., and Pilot li. S. Loss of FMIT function in lung cancer and 
preinvasive bionchial lesions. Cancer Res. r 58: 503 2-503 7 . 199$. 



166. Tockma% M. S., and Mulshine, J. L. Sputum screening by quan- 
titative microscopy: a new dawn for detection of rung cancer? Mayo 
Clin, Proc., 72: 788-790, 1997. 

167. Izzo, J., and Hittelman, W. N. Characterization of multislep ru- 
morigenesis by in situ hybridization. In: M. Andreeff and D. Pinkel 
(eds.), Introduction to FISH. In press, 2001. 

168. Neft, R. E., Crowell, R. E, Gilliland, F. D., Murphy, M. M., Lane, 
J. L., Harms, H., Coons, T., Ileapby, E.; Belinsky, S. A., and Lechner, 
J. F. Frequency of trisomy 20 in nonmalignanl bronchial epithelium 
from hing cancer patients and cancer- free former uranium miners and 
smokers. Cancer Epidemiol. Biomark. Prev, 7: 1051-1054, 1998, 
169- Heppell-Parton, A. C, Nacheva, E., Carter, N. P., and Rabbitts, 
P. H. A combined approach of conventional and molecular cytogenetics 
for detailed karyotypic analysis of the small cell rung carcinoma cell line 
U2020. Cancer Genet. Cytogenet., 108: 110-119, 1999. 

170. Lechner, I F., Neft, R., Gilliland, F. D., Crowell, R. E., Auckely, 

D. H., Temes, R. T., and Belinsky, S. A. Individuals at high risk for lung 
cancer have airway epithelial cells with chromosome aberrations fre- 
quently found in lung tumor cctts. In Vivo J 2: 23-26, 1998. 

17.1. Walch, A. K., Zitzelsberger. H. F., Aubele, M. M., Maitis, A. E., 
Bauchinger, M., Candidus, S., Prauer, H. W., Wemer, M., and Hofler, 
H. Typical and atypical carcinoid tumors of the lung aie characterized 
by llq deletions as delected by comparative genomic hybridization. 
Am. J. Pathol.,. 15 3: 1089-1098, 1998. 

1 72. Petersen, I., Bujard, M.. Petersen, S., Wolf, G., Goeze, A., Sch- 
wendel, A., Langreck, H v Geilcrt, K., Reichel, M., Just, K., du Manoir, 
S., Cremer, T., Dielel, M., and Ried, T. Patterns of chromosomal 
imbalances in adenocarcinoma .and squamous cell carcinoma of the 
lung. Cancer Res., 57: 2331-2335, 1997. 

173. Smith. A. L., Hiing, J., Walker, L., Rogers, T. E., Vuitch, F., Lee, . 

E. , and Gazdar, A. F. Extensive areas of aneuploidy are present in the 
respiratory epithelium of lung cancer patients. Br. J. Cancer, 73: 203- 
209, 1996'. 

174. MacAufay, C E., Lam, S., Kein- Parker, H., Gazdar, A., Guillaud, 
M., Payne, P., LeRiche, J., Dawe, C, Band, P., and Palcic, B. Interme- 
diate endpoint biornarkcis for lung C3nccr chemoprcvention. SPJE. 
3260: 207-21 1, 1998. 



Localization of tissue inhibitor of nictaUoproteinascs 1 (TDMP-1) in human 
colorectal adenoma and adenocarcinoma. ' 

S^-^^ fecaU, B™jV, NielscnHJ, L.cn.ann M Nie.sen BS. 
Tlie Finsen Laboratory, Rigshopitalet, Denmark. 

uvii is elevated in blood from colorecta cancer patients and that hiah rri/m i i i 
predict poor prognosis. To clarify the role of TIMP-Mn c h, & ■ L^* 
Expression nattfm nfTR/fP i • 1 • . 1 1UYU Mn colorectal tumongenesis, the 
all oT?4 OT T , ^ m bCmgn md mali &™« colorectal tumors was studied In 
all of 24 eases of colorectal adenocarcinoma T1MP-1 mRNA was detected 

eondus Ton TmpTI 1,,fl ™ ati - - epithehal-stromal interface. In 

ex ressTd b ^fibST" " 3 ^ in ^ '« -Ion tissue but is highly 
pressed by myofibroblasts .n assoaaUon with invading colon cancer cells. 
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Localisation of Tissue Inhibitor of Metalloproteinases 1 (TlMP-l) in Human 
Colorectal Adenoma and Adenocarcinoma ^ 

prevents cancer invannn TW & ™ -mm 1 . . ,,u nereD J 



prevents c * " J * ,,,lA *n«anoproteases a™ hereby 

trajrfonnabo,, and stunularion »t «b.^k ™. kuUd I controversial,^**, alternative fiSm^tS 

reDOitfid both ««. ttk>tt> i » . ^ ^ vc Deen 



transformation and S ttoS^SX^£™ 15eB ^ 
onsr, demonstrated thai TIMP-1 fc^MeTin T 

adenocarcinoma TTMP i m dwa > ; cases or colorectal 

TlMP^I mRNA^trj - snowed good correlation between 

muscle actio or CD68 showed TTI^PlT.^jf^ 8 ^?" a P? OIh 



b^ween high _ protein levels and poor prognosis is known for ^ 
t»e-] plasminogen activator inhibitor (PAI-1)>^ 
the protease inhibitinB function n t . . t L ^ n ? d ? 1D 8 



. A prerequisite for cancer cell invasion and metastasis k .h, 

genesis 6-wiTiuc h ° k " eai ^ sta S es of wraori- 

^?^^Z h ^ mm i ** WmOT levels of 

uitc.aj cancer. - Similarly, a strong correJauon 



reputed both for - TMM as i^i^^J^ 

In order to better understand the role of TIMP-1 in colorenal 

nTX'r .f^r^T^ 63 mi * P^de some McSSa 
number of studies of the localization of TTMP r in^T^L-, 

^Z^r^^-^ ^colleagnesirepS^ 
TMP- J mRNA was expressed both in invasive a'denoSorT 

ment of colorectal adenocarcinomas in spindle-shaned <-».>ic Jl 
rounding me invasive cancer cells. ^Strf £ 
chemical studies of TIMP-) m colon axe also ewflSKe^ 

™£?te£ 2r™ ** '■™ tL1 W3S «Pr«sedTn & U,e c»" 
S^TnH^ baSemeDt memblanc * »on„al mucosa, 
adenomas and adenocarcinomas with only little staining of the 
neoplastic ! epnhdium. On the other band, Tomita and^eW^ 

SSf cob ^ was «5 ressed » stro ™" -d 5sX 

ma/ m ,r ^ f d . aden ° m ^ « well as in adenocarcino- 

mas, ,n which the neoplastic cells were strongly immunoreacUvt 
hi order toresolve these inconsistencies; we undertook our study 
^ate^;^"^ 0 " ^ ^-oW^ocbemis^ demon^ 
Saave^nf T PrCSS , ed J iD ^^blasts in the'srroma a. 
we mvasive front of colorectal adenocarcinomas Because TTMP 1 

ZX^SF ^ ™ CO,ore ^ wTeTal- 
uated the possibility of using TEvDM as a diasnostic tool to 

2E5£r" adMoraas fron ! Duk ^' -ST^S 



Materia) and methods 

Tissue samples 

Pit* * Hvidovl^rr iDC "l ded WaS ° b,ained bom y^'sity Hos- 
pital of Hvidovre (Copenhagen, Denmark) in accordance with a 

0i ™W°Z by ' he ,. j0Cal Scientific ccomit.ee (KF 

eiS V T^i r^' 31 Sa ™P les < foraialiD paraffin 
cotetclr L C p 0, ' eC,ed fr0m 1989 ,0 1993 Eluded Dukes' stage A 
ade "° caJCln0maS ( " = 8 > and c °'°'ec la i aclenomolous 
poj>ps \n T 6. o were pedunculated (J with mild and 2- with 



mSSS^S s ''""*" 2 ™* - wik » i"»»<»« i— ■ 
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moderate dysplasia) and 3 were sessile (1 with moderate and 2 
with focal ly severe dysplasia)]. Samples from 16 colorectal ade- 
nocarcinomas (1 Dukes* stage A, 6 Dukes* stage B, 8 Dukes*, stage 
C and 1 Dukes* stage D), 1 villous adenoma and 1 malignant colon 
lymphoma were prospectively collected during 1999-2000. These 
prospectively collected, tissue specimens were dissected so thai 
samples contained both normal mucosa and tumor tissue and were 
obtained within 30 min following surgical bowel resection. The 
specimens were immediately fixed in 4% neutral buffered formalin 
for 20^24 hr and then paraffin embedded. The 14 archival samples 
had also been formalin fixed and paraffin embedded. ' 

Generation of nonoverlapping TJMP-1 cDNA fragments by PCR 
The full length TTMP-1 cDNA (GenBank NM_0Q3254) cioned 
in pSP64 vector 29 was used as template to generate 2 nonoveriap- 
ping PCR fragments /or in vitro transcription, and named f 104 (bp 
56-378) and f 106 (bp 398-680). First, the whole insert (-780 bp) 



was cut out by digestion with Hind 111 and BamHX and punned 
after agarose gel electrophoresis using the Qiaex H gel extraction 
kit (Qiagen, Crawley, pnited Kingdom). To generate nonoverlap- 
ping antisense probes and the corresponding sense probe* 2 PCR 
fragments were generated using. Upstream' primers flanked by a 
linker sequence containing an EcoBl restriction enzyme site (un- 
derlined nucleotides) and a T3 polymerase binding sequence 
(boldface) 5 f - (oagaatt cattaaccctcactaaa£figasa)-3', and down- 
stream primers flanked by a linker sequence containing a Bamrll 
restriction enzyme site and a T7 polymerase binding sequence 
5'- (ggatcc taatacgactcact3tagEgag)-3' 9. The TIMP-1 specific up- 
stream primers were 5'- acccaccatggccccctttg -3' for fl04 and 5'- 
(linker) - gcaggatggactettgcaca -3' for fl06, and the downstream 
primers were 5 r - linker- actcctcgctgcggttgtgg -3' for f!04 and 5' - 
(linker) - tatctgggaccgcagggact -3; for fl06. PCR using the 2 fl 04 
primers of the 2 f 106 primers was done as previously described. 30 
The PCR products were purified by. column chromatography 
using S-200HR microspin columns (Amersham Pharmacia Bio- 
tech, Ina, Piscatway, HF), and their size tested by agarose gel 
electrophoresis. Both migrated as —300 bp fragments in accor- 
dance with the predicted size (322 and 282 bp, respectively). An 
ABI. PRISM 310 genetic analyzer was employed for DNA se- 
quencing analysis and was performed according to the manufac- 
turer's instructions (Perkin ElmeT, Applied Biosystems, Foster. 
City, CA) using the primers specified above. The DNA sequences 
obtained were confirmed by comparison with the specific TIMP-1 
cDNA nucleic acid sequence (GenBank NMJXB254). 

Plasmids containing human MMP-2 cDNA (pCo!720l , bp 647- 
1284) and human MMP-9 cDNA (pCol9202, bp 175 1-2326) have 
been described elsewhere. 31 

In vitro Transcription 

Antisense and sense riboprobes were labeled with ^ 5 S UTP 
(NEN, Boston, MA) by in vitro transcript ion using T7 and. T3 
RNA polymerases. (Roche, Basel, Switzerland). The DNA tem- 
plate was digested with DNase (Proraega, Madison, WI). Nonin- 
corporated 5;> S UTP and DNA was removed by column chroma- 
tography using S-200HR microspin columns (Amersham 
Pharmacia Biotech, Inc., Piscatway, NJ). The " 5 S activity was 
adjusted for every probe by dilution to 500,000 cpm/pJ. 

In situ hybridization 

Jn situ hybridization was performed essentially as described 
previously. 32 In brief, 3 \im paraffin sections were deparaffinued 
in xylene, by drat ed with graded ethanol and boiled in a microwave 
oven for 10-12 min in 10 mM citrate buffer, p)i 6.0. After 
additional 20 min at room temperature, the sections were dehy- 
drated with graded ethanol and die 35 S labeled probes (2X 10 6 cpm 
in 20 \l] hybridization mixture 3 ' per slide) incubated overnight at 
55°C in a humidified chamber. Sections were washed in I-Jellen- 
dahl chambers whh SSC buffejs containing 0.] % SDS and 1 0 mM 
DTT at 150 rpm at 55°C using a Buhler incubation shaker (Jo- 
hanna Quo GmbH, Hechingen : Germany) for 10 min in 2XSSC, 



10 min in 0.5 >< SSC, and 10 min in 0.2XSSC Sections were then 
RNase A treated for 10 min to remove nonspecifically bound 
ribpprobe. Subsequent wash* was performed in 0.2 X SSC as spec- 
ified above. Sections were dehydrated and soaked" into an autora- • 
diographic emulsion (Bfoxd), exposed for 5t7 days if not otherwise 
stated and finally developed- Sections were countersinked with 
■ haematoxylin and eosin. 

Jmmunoperoxidase staining 

Immuriohistochernistry was performed essentially, as described 
previously.^ 2 Five . micrometer paraffin sections were deparafr 
finized with xylene and hydrated through ethanol/watex dilutions. 
Tissue pretreatment was performed with protease-K (5 pg/ml) 
digestion for 20 min. Sections were blocked for endogenous per- 
oxidase activity by treatment with 19E» hydrogen peroxide for 
15 min. The sections were washed in 50 mM TriS J 50 mM Nad, 
pH 7.6, contaming 0.5% Triton lX-1 00 (TBS-T). Incubation with 

-anftBpdSe^-w^^Grte-oyernigbt at 4°C. Sheep polyclonal antibodies 
(pAb) against TTMP-1 and nonimmune goal IgG were used at a 
final concentration of 41) u.g/ml. Two monoclonal antibodies 
(MAb) against TIMP-1 , 33 NM4 (clone TTTX6A, NeoMarkersv Fre- 
mont, CA) and CalB2 (clone 147^601 1, CalBioheni Oncogene 
Res. Products, Cambridge, MA), and a JylAb against trinitxo- 
phenyl (TOP) 3 * were all incubated at 1.0 p.g/ml (all 3 MAbs are 
lgGl). CalB2 MAb recognizes both free TIMP-1 and TTMP-1 
in complex with MMPs. 33 NM4 MAb only recognizes free 
TIMP-1- 33 According to the manufacturer's descriptions, bpth 
MAbs are raised using recombinant human TTMP-1. The. sheep 

. polyclonal antibodies were raised by immunization with TTMP-1 
purified from human dermal fibroblasts. The IgG was obtained by 
triple precipitation using ammonium-surf ate and characterized by 
immunodiffusion and rocket immunoelectrophoresis. 35 In addir 
tion, we have shown that the pAb recognize both free and MMP- 
coraplexed TTMP-1 - 36 Furthermore, the specificity of the antibod- 
ies was analyzed by Western blotting analysis against recombinant 
human TTMP-1 expressed in NSO mouse myeloma cells. Here, the 
antibody preparation recognizes a band of approximately 28 kDa 
in accordance- with the molecular weight of TTMP-1. To certify 
that the pAb recognize TTMP-1 in colon tumors, the antibodies 
were immobilized on a sepharose column. Total protein extracted 
from 3 colon adenocarcinomas was passed through the column 5 
times and the bound and subsequently eluted protein analyzed in a 
Western blot using a TTMP-1 monoclonal antibody (MAC 15). A 
single band of approximately 28 kDa was revealed in accordance 
with the molecular weight of TTMP-1 (results not shown). In 
immunohistochemistry, the sheep pAb were detected with biotin- 
ylated rabbi t-anti-goat IgG, which* cross-react with sheep IgG 
(1:100, code E466, DakoCytomation) followed by horseradish 
peroxidase in complex with streptavidin (code K377, DakoCyto- 
mation). The MAbs were detected with the Envision-mouse re- 
agent (En Vision reagent, K4003, DakoCytomation), followed by 
tyraroine amplification, using biotinyl tyr amine substrate as- spec- 
ified by the manufacturer (Nen, Boston, MA). Sections were - 
developed with NoyaRed substrate as specified by the manufac- 
turer (Vector Laboratories, Burlingname, CA) for 15 min. Finally, 
sections were counterstained in Mayers haematoxyliri, dehydrated 
in ethanol and mounted. 

Combined in siru hybridization (\nd immunohistochemistry 

Double labeling by combining in situ hybridization and immu- 
nohistochemistry on paraffin sections has been described previ- 
ously. 32 In brief, using MAb against a-sm-aclin (clone 1A4) 
diluted 1:1000. against cylokeratin (cloDe AE1/AE3) dijuted 
■1:1000, or against CD68 (clone PGM1) diluted 1:200, sections 
were incubaied for 2 hours at room temperature and then delected 
with anti-mouse-lgG/horse radish peroxidase-conjugated polymers 
(Envision-mouse reagent, DakoCytomation, Glostrup. Denmark). 
Sections were developed with di ami nobenzi dine (DAB) for 
7-10 min, and immediately dehydrated for in situ hybridization, 
which was performed as described above using the antisense 
probes of O04. Sections were counterstained with hacrnatoxyhn. 
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Results 

Analysis oj TIMP-1 probes and antibodies for in situ 
hybridization and immunohislochemistry 

Hktopatholoeical diagnosis of F ospectively collected speci- 
^JT, 'l^ 0 ^ lesi <™ revealed 16 colorectal aoenocarl 
SEES h vll . ,0BS / den °™ *> d 1 malignant lympho™. «s. 
^^~and sense RNA probes were generated byinym-,, 

SSjf ,eSed by , <n ^dization on adjacent 
sectors from 5 of the colorectal adenocarcinomas. The 2 anrisei.se 
iS t° W *1 t/*"?* W-Mb-to pattern in all the 5 
S eel !h ! l c °?f*m>em surrounding the invading 

a* n ^•f°. spe , c,fi K C SIgDaI was S?eB with "»» 2 
Danied bv^TIMpT whether the TIMP-1 mRNA was accom- 
parued by TIMP-1 protean expression, itnmur«>hkochernistry was 
prfonned on 8 of the adenocarcinomas (including teTm™ 
-J^^^t^S^m-hL , using ™ 

TIMP1 HT 3 1 an6bdd,es on *^ons adjacent ,o 

ilMP-1 .» s tu bybndrzed sections. The TIMP-I mRNA and 

°T f*,™*"** «* nialignan, lymphoma. Ue 
ant.-TIMP-l polyclonal antibodies did not react with other cell 

tZt' b0 "Vl ° f 8 ^^-as and the rTaligZuym- 

^JST" apical iurface - Two MAbs 
against JJMP-] (CalB2 and NM4) required stron° signal amolifi 

S ^ftf™**. FfW- ™* only exception^, 
neiuier.of the 2 MAbs stained the luminal apical surface of the 
norma! and malignant epithelium; No sign/w£ obtirfed with 

<hrcc,ed a ° a,ns, *e synthetic hapten trinitrophenyl (TNP). 
m/W/i expression patterns in colon cancer 

" ■ co,oreclal lemons by m am hybridization. TIMP-1 mRNA 



expression was in all the cases of colon adenocardnorna (includin, 

cells located at the invasive front (Fig. 3a,<i). TIMP-1 mRNA 
signal Was also observed in fibroblast cells ^tocated^ 
omor stroma towards the colonic lumen in 8 of iSSS, whe^ 
A» uooe sttucture was present (data not shown). N Tor & 
"f T m * E ^°«nU part of the carcSonT 

™pma * B ^ ?* reettI ^ocarcinomas, we observeTrK 
mRNA S1 gnal in some fibroblast-lflce cells located aroundfce 

uZ L y " " SOme ^ ,eries ,ocated « the submucosa dTstaw 
.ZL C3I,Ca * ta - » om ">' clonic mucosa, includino^ 
lamina propria that was presentin all samples tested was »enerX 

was detected in stroma] /ibrobJast-like cells surrounding one S a 
nonnaI W (R& 3c/) in 3 out of 6 cases m££ 
extended exposure time (10 days vs. usually 5 days). I„ the vilC 

y^rwnintKsr^ (data hot shown). In the malignant 

cells were, different from the adenocarcinomas, located in a dif 
pattern throughout the whole tumor. No.TIMW mRNA 

SSSh C r* m 2 d " "» ° f ^ 18 » *e cancels 
smooth muscle cells or vascular cells. - 





bndrzanon signal is idcntificA ^ w,JE • ' ( ^ J ' n ' hy- 
The 2- mtisense nl cs ,^T" tm '» illumination (M). 

whereas no sperific si»nal k S ^ ,e , ceiis ( a,TOWS ™ b and ,/). 
St) surrounding the jVaVrti 'w » '"^ SUOaa (»' di "'«» bv 



co^idte T m ^fiKl be h , TIMP:i mRNA "^blast-like cells . 

could be (myo)fibroblasts and/or macrophages, sections from 4 
colorecuU adenocarcinomas and the maBgn°an. lymphla were 
6rst .mmunohistochemicaJly stained with, antib^es directed 

• V - T^ 8 (fOT de,ec ««> n of macrophages) and subse- 
quently incubated with a TIMP-1 mRNA antisense probe h 
normal co on ussue a -sm-acUn is expressed by vascuWsmooA 

colon^mo^ ■ - S We " aS ^ofibroblasu A 

" n« a ; Sm - a = ,ln ' s e3t P««ed by tumor-associated Hbro- 
h G h ce ! ,s 1""'^ thipnghput the tumor stroma, which are 
definedas myofibroblasts. No TIMP-1 mRNA was detected in a^y 

ve"Sis^h"e EST Smt *, ,h . raus <* ! .inehding uiose of d 
T„ ^ h ,amm » """sculans mucosae and the tunica mnsculafis 
h addrnon, no TIMP-] mRNA was detected in the a-sm-aSn 
I P^f^W myofibroblasts of the lamina propria in any of . 
a sm i, " 00S : TiMp - i ,B RNA signal, was- in contrast seen in 
.nv^ JL ,n T aSSOCia,ed myofibroblasts located at rhe 
more m^To9 rt^l '"^T 3 ° f ^ adenocarcinomas, 
^ in^- °f TIMP-1 mRNA-positive cells located close to 
™T ^i. CanCer Ce " S werc «-sn>acUn-posi.ive (Rg. 4) 
TrMP-1 1 mRNA positive fibroblast-Vike cells located more dis.an 

IME^rnr T C V C , e,,S ; ,0Wards ^ »*™*«.. expressed 
a " s ni-act.n. In 1 adenocarcinoma and in the malignant 

• STd aa P prox,ma,e jy 50 *of ««« T1MP1 mRNA positive^ells- 
coSurute a r 3 ^ V be 71MP - J mRNA «^»S cells 
caTd aTL • P ° P0) < a "°" 0/ ,um °">"ociated myoLobiws Jo- 
could t % A0 "' ° f ^ n,m0r - No ™ 1I> - 1 ™RNA s,gna. 
could be identified .n any of the CD68- P osirive cells (Fig. 4) 

Expression cf T1MP-J and MMP-2 and 9 in colon cancer 

invas^rVw Mh 7' 9 m2f yt*™ collagenases expressed in .be 
UTvas.ve cancer „ssue of colorec.a] adenocarcinomas. Thus 

»^ cZt Z rep0 ',^ '° "* CXpieSSed ^ fibioblast-lilce cells 
leaden. eX nfT 1 - - 3 " d MMP ' 9 * ms ""p^ Ses a. d,e 
exnres S ,il ° ,,? .TT To *rec.ly F compare the • 

expression patterns of MMP-2 and MMP-9 with thai of TIMP- 1 
adjacent section* from .5 colorectol adenocarcinomas were Zb,id- 
ZSZT" f °' TO1P1 - MMP 2 - d MMP-9 mRNAs. We 
actenshcaUy at Ae invasive front of the growin" lumo, whereas 
° MAJP - 2 " ,RNA W3S m « -'ense rhetemral 
(M = . MMP-9 „ lRNA expi . cssirig cells were found m the 
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Figure 2 -in situ hybridization and irnmunohistocbermstry for TIMP- ] in human colon cancer. (^) Two.adjacem sections from a human "J?" 
adenocarcinoma were incuba!ed with polyclonal antibodies against TIMP-1 (a,c) and a TTMP-1 mRNA anrisense probe (fr). The TIMP-1 
immunoreacrivity (red-brown color, arrows in a and c) and the TIMP-1 mRNA (silver grains, arrows in b) are identified in the same cells (arrows 
in o.b). Immunoperoxidase staining with the TIMP- 1 pAb reveals the TIMP- 1 -positive cells as fibroblast- like cells (arrows in c) located in the 
stroma (St). No TIMP-1 immDnoreactivity is seen in cancer, cells (Ca). a,b: bars = 50 p.m; c: bars =13 p.m. {B) Four consecutive adjacent 
sections were incubated with CalB2 MAb anti-TlMP-1 (a) ; NM4 MAb (b\ sheep ami TIMP-1 pAb (c) or mouse ami TNP (t/)- The 3 MAbs 
were detected with Envision reagent followed by TS amplification and the sheep pAb with biotinylated rabbit anti-goat followed by 
HRP-conjugated stieptavidin (sec Material and methods). The 3 TIMP-1 antibodies react with the same cells (arrows). No irnmunoveactivity is 
seen when the sections are incubated with anti-TNP. 



invasive front like those expressing TIMP- 1 mRNA but with a 
distinctly different distribution. Foci with high expression of 
TIMP-1 mRNA were not accompanied with increased expression 
of MMP-9 mRNA and vice-versa (Fig. 5*). Thus, TIMP- 1 mRNA 
expression is not coregulated with MMP-2 or MMP-9 mRNA 
expression. 

TJMP-1 in adenomas and Dukes' stage A carcinomas 

TIMP-1 antigen can readily bc measured in blood and we have 
previously reported that levels of TIMP- 1 in blood are significantly 



elevated in colorectal cancer patients compared to healthy donors 
and that high plasma TIMP-1 levels are associated with short 
survival of colorectal cancer patients. 2 '* 36 TIMP-1 has therefore 
been suggested to be a novel marker for detection of early stage 
colorectal cancer and for prognostic stratification of colorectal 
cancer patients. 21 39 These findings, together with the characteristic 
expression pattern of TIMP-1 al the invasive front of virtually all 
the colon cancers and die absence or minuie TIMP- 1 expression in 
normal and benign colon mucosa, prompted the evaluation of 
TIMP- J expression as a marker for early invasive colon cancer. 
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^BofU* cancers (Ca) and ^™fc ' CaDCCT <«A wbS at J*, 1 ?** ! » » 




Ficube 4 -Double labelins forTIMP-1 mRNA ft . . " '— v 

SS-^ Sections were fi , SI processed for immuno . 

Therefore, tte compared TlMP-i ,„». „ 
lionaj 6 colorectal adenomV.™,. , NA "P r « s '°>> '" an addi- 
additiona. 8 Dules" s^e A rn ^ * e " p,essio » in ™ 

fibrobterliL «llsTo °" : aS . C ° n l fi " ed 10 a »>S»e focuTin 
«r* 6). Howe^e h °Z 1"? ^ >^""^>o n 
".is sample cleaMv rev If, 1 " °' 3df, " iona ' 

'sessile type), a few ^ " A P ° K Si ' ivC a * nw »» 

loomed around small™ ^ «pressm s fibroblasts were 
area (data no , show!^ n ™ 3SS ~ 1 «* ruo.br 



sbo^X r'mRNA « 9D " - K ^ A ca >cinomas analyzed 
'he invasive fro ,- of h n " p,CSS,on myofibroblasts located a, 
i» on), 3 o/ 7 ade^i^"^' 5 - ? e '" S e<preSSio " *™ de,EC,ed 
cells Stated ^ nT ^^ ^ T '" 
interface i n 2 of the c 'se . \ T» T a, .* c e P'"«l'a)-s.romal 
case. d W " h a, te " es ,he submocosa in 1 
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Figure 5 - In situ hybridization for TIMP-1, MMP T 2 and MMP-9 in 
human colon cancer. {A) Adjacent sections were incubated with probes 
specific for TIMP-1 mRNA (a,c) and MMP-2 mRNA (M), respec- 
tively, and is shown in brigbtfield \a,b) and dark field illumination 
(c.d). The TTMP-J mRNA signal increases towards the subroucosa 
(Sm) whereas the MMP-2 mRNA signal decreases and is most intense 
in the central areas (Ca). {B) Adjacent sections were incubated with 
probes specific for TIMP-1 mRNA {a,c) and MMP-9 mRNA (b f d), 
respectively, and is here shown in bright field (o.fc) and darkfield 
illumination ic,d). Both the TlMP-J mRNA signal and the MMP-9 
mRNA signal are most intense at the invasive front towards the 
subinucosa (Sm), but their expression patterns are quite different, wiih 
MMP-9 showing die most restricted expression. Bars = 100 jim. 



cDNA fragments and specific pAb and MAbs against human 
TIMP-1. The TIMP-1 mRNA signal in nl] colorectal adenocarci- 
nomas invesiigated was seen in fibroblast-like cells located in the 
tumor periphery. An identical hybridization partem was observed 
with the 2 antisense TIMP-1 probers and application of comple- 
mentary sense probes on neighboring tissue sections as negative 
controls did not result -in any hybridization signal; therefore, we 
conclude that the hybridization signal generated with the nnti sense 
probes represents the genuine TIMP- J mRNA. TlMP-J immimo 
reactivity was aJso distinctly located in fibroblast-like stromal cells 
injhe jomor periphery, and these cells were identified to be the 
same cells as the T1MP- 1 mRNA expressing cells. A piepaiation 



of sheep pAb against human TIMPr) 35 and 2 well-characterized 
MAbs stained the very same cells in the tumor stroma.. Weak 
staining of the apical surface of some normal and- malignant 
epithelial cells was observed with the pAb in some of the samples. . 
No staining was obtained when me-anti-TIMP-1 antibodies were 
substituted with nonimmune goat serum or anti-TNP MAb incur 
bated at the same concentrations. These iinmunohistcK&erri . 
findings strongly suggest that the: TIMP-1 antigen detected in the 
fibroblast-like cells represents the genuine TI)vjP-l protein. 

In our study, we found HMP-1 mRNA expression in stromal 
fibroblast-like cells located in the tumor periphery in all colorectal 
adenocarcinomas tested, whereas no expression was detected in. . 
the. cancer ceils in. any -of the. cases tested.' This finding is in 
agreement with studies by Zeng and colleagues, 12 - 20 but is partly m 
disagreement with findings by Newell and 5 colleagues. 27 In addi- 
tion to TIMP-1 mRNA signal in fibroblast-like cells in the tumor 
peripher y, Newell a nd colleagues 27 detected a weak TIMP-1 
mRNA signal in both benign and malignalt epithelial cells. 27 This 
observation "was, however, based on the use of probes: from, a 
. single TIMP-1 cDNA subclone and no additional controls to verify 
the expression pattern. The difference between our results and- 
those of Newell and colleagues may be explained by methodolog- 
ical differences, since the procedure employed, by Newell and 
colleagues was in several steps different from the one used in the 
present study, e,g. t Newell and colleagues used 3 p T labeled probes, 
whereas we used 35 s ^labe!ed probes. It is in this context notewor- 
thy that in order to look for a low expression level of TIMP-1 
mRNA, we performed in situ hybridization experiments with pro- 
longed exposure time (10 days vs. usually 5 days) with both our 
TIMP-1 antisense probes and both TIMP-1 sense probes, but with 
this challenge we did not detect any TIMP-1 mRNA in any 
epithelial cells. It cannot be excluded though that the TTMP-1 
mRNA is expressed in epithelial cells below the detection limit of 
our in situ hybridization procedure. 

An interesting observation in our study was the characteristic 
intense TIMP-1 mRNA and protein expression in the tumor pe- 
riphery of all colon adenocarcinomas, while little or no expression 
was seen in the center of the carcinomas. Only in the colorectal 
lymphoma did we find TIMP-1 mRNA and protein expression in 
fibroblast-like cells located throughout the tumor tissue. The 
TIMP-1 expression pattern in the colon adenocarcinomas is in 
contrast to the expression pattern reported by Hewitt and col- 
leagues, 19 who found that the TIMP-1 staining in most colorectal 
adenocarcinomas was equally intense in fibroblasts throughout the 
tumors and thai some of the cases even showed decreased TIMP-1 
signal intensity towards the tumor periphery. This difference may 
be explained by possible cross-reactivity of the polyclonal anti- 
bodies employed by Hewitt and colleagues or that Hewitt and 
colleagues employed cryostal sections, while we analyzed paraffin 
sections. * . 

The TIMP-1 expressing cells had a fibroblast-like morphology 
and using combined in situ hybridization for TIMP-1 mRNA and 
inimunohistochemistry for a-srh-actin, we found that many, gen- 
erally more than 50%, of the TIMP-1 mRNA positive cells coex- 
pressed a-sm-actin. According to the cellular morphology of the 
TIMP-1 expressing cells and their localization in the invasive 
front, we could conclude that the cells were myofibroblasts and not 
smooth muscle cells. 

The myofibroblast is a cell type present in the norma) colon 
mucosa, that originally was described as a pericryptal fibro- 
blast 3740 and later was identified with antibodies agamst ct-sm- 
actin. ?7 In the lamina propria, the myofibroblasts form a continu- 
ous cell layer just below the iniestinal epithelium. Hie pericryp 131 
myofibroblasts are phenotypically different from die neighboring 
quiescent interstitial fibroblasts dint do not express markers ol 
smoodi muscle cells. 4 ' During early steps of colonic tumorigenesis 
the number of myofibroblasts is significantly increased 41 The 
TIMP-1 expressing myofibroblasts may be generated after activa- 
tion of ihe pericryptal" myofibroblasts and/or the quiescent inter- 
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sbtial fibroblasts- Adegboyega and colleagues^ hypothesized thai 
the tumor-associated myofibroblasts originate from the quiescent 
interstitial fibroblasts of the lamina propria, rather than from peri- 
crypial myofibroblast or smooth muscle cells, which may help to 
explain why we found some of the TIMP-l mRNA express^ 
fibroblast-bke ceils a-sin-aain-positive and some «-sm-actin-neo! 
arjve. e 

Several MMPs including MMP-2, MMP-11 and MMP- 14 are 
expressed by fibroblast-like cells in human colon cancer W*--" 
some of which may indeed be myofibroblasts. The role of the (mvo-) 
fibroblasts in colon cancer progression is not known. Since TBvIP-1 in 
human colon cancer appears only to be expressed by fibroblast- like 
cells most of which are myofibroblasts, and since high TIMP-l levels 
measured ,n blood or tumor extracts from colon cancer patients are 
?K°T y ^ Ia . ted With 3 ^ Prognosis,^^ it could be argued 
that .he TIMP-l expressing myofibroblasts- play a mmcr-promotin- 
role^ lmmunohistochemical localization studies of proteins involved 
in the activation and regulation of the efficient serine protease plas- 
minogen, including urokinase plasminogen activator (uPA) and its 
specific inhibitor PAJ-1 show that both are mainly expressed by 
myofibroblasts in human breast cancer*^ Hi 2b levels of nPA and 
Snr0, V £lY COTTdaIed w »h Poor prognosis in breast can- 
cer, suprjomng the assumption that the myofibroblast express a 
promoting role ,n cancer invasion. We recently ,eponed that the 
predominant PAl- 1 expressing cell in human colorectal cancer also is 
r^-i 0 . f ^ °" d " Iiier studies ,nd,C3,ed *ai elevated levels 
01 ™' "J CO T C3ncer P 31 ™ 15 m nssociated with poor pioono- 
sis.- Together these findings indicate Uia. myofibroblasts are .sirorHv 
contnbuiin- to the expression of proteins involved m the re"i.lat,W 
extracellular maim degrading proteases that facilitate cancel invasion ■ 
and metastasis. 

A panicL-larly interesting finding of the present S i ud y w as the 
absence of TJMP-J mRNA in 4 of 7 adenomas, whereas m all of 

fibroblast- ,fce cells along the invasive from. In the 2 benign lesions 
in which -he TIMP-l mRNA wns seen in the adenomata X 



TIMP-l mRNA positive cells were confined to a single focus with 
locally increased inflammation related to the dysplastic epithelium. 
Evident disruption of the dysplastic epithelium was observed in the 
adenoma with most intense TJMP-1 mRNA signal. Intestinal in- 
flammation may be caused by disruption of the mucous epithelium 
that leads to focal leakage of mucinous colon material into the 
lamma propria. Increased intestinal permeability is a common 
deficiency in Crohn's disease and interestingly TIMP-l mRNA 
was found in the intestinal granulation tissue of Crohn's disease 50 
and is expressed by myofibroblasts isolated from Crohn's dis- 
eased Induction of TIMP-l in myofibroblasts in a benisn or 
preinvasive tumor may also be a response to locally increased 
MMP activity or a response to the presence of a specific MMP in 
the local microenvironrnent. MMP-2 and MMP-9 mRNA expres- 
sion, however, did not appear to be coregulated with TIMP-l 
mRNA expression in the colorectal adenocarcinomas. Specific 
MMPs may indeed be involved in the transition of noninvasive to 
invasive disease; in studies of preinvasive lesions (ductal carcino- 
mas m situ) of the human breast we recently reported that MMP-1 3 
is specifically expressed in myofibroblasts associated with micro- 
invasive events.^ Future srudies may clarify whether TJMP-1 
expression m colorectal adenomas is correlated with expression of 
specific MMPs, cytokines and/or giowih factors, such as TCF-0J 
and TOF-(525l, and whether TlMP-lcan be used as a hisiopatho- 
logical marker for malignancy in colorectal tumors. 
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Neu oncogene expression in ovarian tutors: a quantitative study. 
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Real-time quantitative RT PCR of cyclin D 1 mRNA in mantle cell 
lymphoma: comparison with FISH and immunohistochemistry. 

IMP, HoweJG , Crouch J, Nimmakavalu M Qumsiveh MB. TaUini G Flynn SD 
Smith BR . ' — • ' 

Department of Laboratory Medicine, Yale University School of Medicine 333 Cedar 
Sfreet, P.O. Box 208035, New Haven, CT 06520-8035, USA. ' 

Presence of the balanced translocation t(l 1 ; 14)(q 1 3;q32) and the consequent 
overexpression of cyclin Dl found in mantle cell lymphoma (MCL) has been shown to be 
of important diagnostic value. Although many molecular and immunohistochemical 
approaches have been applied to analyze cyclin Dl status, correlative studies to compare 
different methods for the diagnosis of MCL are lacking. In this study, we examined 39 
archived paraffin specimens from patients diagnosed with a variety of 
lymphoproliferative diseases including nine cases meeting morphologic and 
immunophenotypic criteria for MCL by (1) real-time quantitative RT-PCR to evaluate 
cyclin D 1 mRNA expression; (2) duai fluorescence in situ hybridization (FISH) to 
evaluate the t(l 1;14) translocation in interphase nuclei; and (3) tissue array 
imrnunohistochemistry to evaluate the cyclin Dl protein level. Among the nine cases of 
possible MCL, seven cases showed overexpression of cyclin Dl mRNA (cyclin Dl 
positive MCL) and two cases showed no cyclin Dl mRNA increase (cyclin Dl negative 
MCL-hke"). In six of seven cyclin Dl positive cases, the t(l 1;14) translocation was 
demonstrated by FISH analysis; in one Case FISH was unsuccessful. Six of the seven 
cyclin Dl mRNA overexpressing cases showed increased cyclin Dl protein on tissue 
array immunohistochemistry; one was technically suboptimal. Among the two cyclin Dl 
negative MCL-hke cases, FISH confirmed the absence of the t(l 1 ; 14) translocation in 
both cases. All other lymphoproliferative diseases studied were found to have low or no 
cyclin Dl mRNA expression and were easily distinguishable from the cyclin Dl 
overexpressing MCLs by all three techniques. In addition, to confirming the need to 
assess cyclin D 1 status, as well as, morphology and immunophenotyping to establish the 
diagnosis of MCL, this study demonstrates good correlation and comparability between 
measure of cyclin Dl mRNA, the 1 1;1 4 translocation and cyclin Dl protein. 

Publication Types: 

• Evaluation Studies 



PMID: 12952233 [PubMed - indexed for MEDLINE] 



448: CancerRes. 1994 Jan 1 5;54(2):539-46. . RelatedA^T^ 

S^ a l i0n ? tOp0isomerase 1 messenger RNA, protein, and catalytic 

act «y <y .» human tumors: demonstration of tumor-type specificity and N 

implications for cancer chemotherapy. P "«ciryand 

Emajnl, Mojlier^/SdgjerJJF, Bcsterman JlVf - - 

S^ST 0 ^' 013X0 ^ Search Triangle Pa*, . 
Topoisomerase I has been identified as an intecellular target of camptothecin aplant 

eels to this drug ,s dl rectly related to the topoisomerase I content In humans th S 
of topoisomerase I have been shown to be elevated in colorectal taSTSS^^ 
normal colon mucosa. The aim of our study was to determine whethSpo^rase 
bve s are elevated in other solid tumors, (b) the elevated enzyme is cSjKESST 

esulfof Tr ^ (C) inCTeaSe in to P° is ^^ I 'evels rcolorectaS is a 
result of increased transenphon or translation..Topoisomerase I levels were quantised in 
crude extracts from colorectal, prostate, and kidney tumors and their ™SSS5S 
count^parts by Western blotting and by direct determination ofS^SS^ 

rumors snowed 5-3 5 -fold increases in topoisomerase I levels, compared to their normal 
colon mucosa In the case of prostate tumors, the increase was 2-lS-fo?d com3™ m 

llZ7^X^TZ St 1 leVdS ™ k ' idney tmaon > COm ^ to the " «o™al 
coun^rparts. The catalyt.c activity of topoisomerase I was determined by a quantitative 
32P-transfer assay m crude homogenates, without isolating nuclei Colorectal mf 
prostate fcmors exhibited 1 1-40- and 4-26-fold increases, respectle^L^c ■ 

™ L ?T er ' kld " e , y tUm ° rS did n0t Sh ° w anv alterati0 « - acivS 
compared o the* normal matched samples. Thus, for all three tumor Types there was a 
good correlation between enzyme levels and catalytic activity. FmaUy^toreSltamors 

found m colorectal tumors, compared to normal colon mucosa. These results suI^tTat 
toons ,n topoisomerase I expression in humans are rumor type ^ZSSfo 
ncr se ,„ topoisomerase I levels results from eitl.er increased Ascription of tie 
topoisomerase I gene or increased mRNA stability ' 
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High-level mRNA quantification of proliferation marker pKi-67 is correlated 
with favorable prognosis in colorectal carcinoma. 
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GeZny^ KHnik ' ^ ^ ^ aS * esio, °g ie > Schmerztherapie und Intensivmedizin, Saarlouis, 

PURPOSE: The present study retrospectively examines the expression of pKi-67 mRNA and 
prote.n m colorectal carcinoma and their correlation to the outcome of patfents MCTHOD^ 
mmunoh.stochem.stry and quantitative RT-PCR were used to analyzed expre^To^ of pKi-67 
in 43 archival spec.mens of patients with curatively resected primary colorect^Tnoma who 
were not treated with neo-adjuvant the^ 

labelmg mdex of 3 1 3% (range 10.3-66.4%), and a mean mRNA level of 0 IWWKCm 
range 0.01-0.69)- indices and levels did not correlate. High pKi-67 mRNADeltaCm v22 ' w .r. 
SK" 9 Arable prognpsis, while'p^ i£S£££? ^ 

T ? T r &n ° SUC ° UtC ° me - A mu,tivar i^ ^lysis of clinical and biological factor 
indicated that tumor stage (UICC) and pKi-67 mRNA expression level we e iSrteST 
prognose ^ CONCLUSION: Quantitatively determined pKi-67 ^A^teTLd and 
new prognose ind.cator for primary resected colorectal carcinoma 8 
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Developmental regulation of acidic fibroblast growth factor (aFGF) 
expression in bovine retina. 

Jacquemin E , JonetL, OliverL, jfrigraK, t^urentM, CourtojsY, Jeannvjg. 

Unite de Recherches Gerontologiques, U. 1 18 INSERM, Paris, France. 

Acidic fibroblast growth factor (aFGF) is a signalling molecule implicated in a wide 
variety of biological processes such as cell growth, differentiation and survival It has 
been punned from bovine retinal The present study was carried out to detect which cells 
m the bovine retina expressed aFGF at the different stages of embryonic and post-natal 
development. The specific aFGF mRNA and protein were detected by in situ 
hybndization employing riboprobes and immunocytochemistry using affinity purified 
polyclonal human recombinant aFGF antibodies respectively. No signal was detected by 
either technique until 4-5 months and then there was progressive expression of aFGF 
with terminal morphogenesis of the retina. By 8-9 months of embryonic development 
nuclei of the 3 neuronal layers (ganglion cell layer, inner and outer nuclear layers) were 
all uniformly and intensely labeled. A slight labeling of the pigmented epithelium of the 
retina was also visible throughout development and maturation. These results showed a 
good correlation between message and protein expression in these cell types. In contrast 
glial cells m the nerve fiber layer and vascular endothelial cells displayed a nuclear 
lmmunostainmg for the protein in the absence of message. These data suggest that aFGF 
plays a role in the late steps of retinal differentiation by autocrine and paracrine 
mechanisms. 
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the p21(Cipl) protein, a cyclin inhibitor, regulates the levels and the 
intracellular localization of CDC25A in mice regenerating Overs. 

jaimgJl PujolMJ, Serratosaj, PgntojaC, ganela N, Casanovas O. Serrano M 
Agcll N. Bachs O. ' 

Department of Cell Biology and Pathology, faculty, of Medicine, Institut d'Investigacions 
Biomediques August Pi Sunyer (IDIBAPS), University of Barcelona, Barcelona, Spain. 

Liver cellsfrom P 2l(Cipl-/-) mice subjected to partial hepatectomy (PH) progress into 
DNA synthesis faster than those from wild-type mice. These cells also show a premature 
induction of cyclin E/cyclin-dependent kinase (CDK) 2 activity. We studied the 
mechanisms whereby cells lacking p21(Cipl) showed a premature induction of this 
activity. Whereas the levels of CDK2, cyclin E, and p27(Ki P I) were similar in both wild- 
type and p2l(Cipl-/-) mice, those of the activator CDC25A were much higher in 
p21(Cipl-/-) quiescent and regenerating livers than in wild-type animals. Moreover, 
p21(Cipl-/-) cells also showed a premature translocation of CDC25A from cytoplasm 
into the nucleus. The ectopic expression ofp21(Cipl) into mice embryo fibroblasts from 
p21 (Gipl -/-) mice decreased the levels of CDC25A and delayed its nuclear translocation 
The levels of CDC25A messenger RNA in p21(Cipi-/-) cells were higher than in wild- 
lype cells* suggesting that this increase might be responsible, at least in part, for the high 
levels of CDC25A protein in these cells. Thus, the results reported here indicate that 
p21(Cipl) regulates the levels and the intracellular localization of CDC25A We also 
found a good correlation between CDC25A nuclear translocation and cyelin E/CDK2 
activation. In conclusion, premature translocation of CDC25 A to the nucleus might be 
involved m the advanced induction of cyclin B/CDK2 activity and DNA replication in 
cells from animals lacking p21(Cipl). 
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Alteration of frizzled expression in renal cell carcinoma. 
Janssens N , Andries L. Janicot M, PereraT, Bakkcr A . 
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To evaluate the involvement of filled receptors (Fzds) in oncogenesis, we investigated 
san^ CXP .T 0n ,eVdS ° f SCVeraI hUman Fzds * m0re *« 30 diffe ^ human tumor 

Z^mTT^S^ ^ tissuesam P les > ^real-time 
quantitative PCR. We observed that themRNA level of Fzd5 was markedly increased in 
8 of 1 1 renal carcmoma samples whilst Fzd8 mRNA was increased in 7 of 1 1 renal 
carcinoma samples. Western blot analysis of crude membrane fractions revealed that 
lit P T Pxr? r , eSS T ih * e matched fc^noonal tfdney samples correlated with the 
observed "^A level: Wn^eta^atenm signalmgpamway activation was ^ 
the m Cre a Se d express™ of a set of target genes. Using a kidney tumor tissue L7 Fzd5 
potent express™ was investigated in a broader panel of kidney tumor samples FzdS 

T 8 7 s • Ct ? ted * 30% ° f dear cel1 ^onaas, and there was a strong 
exnls on" 0 W f f" U ClCar T 1 " ^ u St3ining in thC Samp,eS - 0ur data su ^ d *at altered 
SKI T ^ fS ° f thC Fzd &miIy » 311(1 their ^stream targets, could 
prov,de alternative mechanisms leading to activation of the Wnt signaling pathway in 
renal carc.nogenes.s. Fzd family members may have a role as a biomarker 
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Abstract 

To evaluate the involvement of fri22led receptors (Fzds) 
in oncogenesis, we investigated mRNA expression lev- 
els of several human F2ds in more than 30 differ- 
ent human tumor samples and their corresponding 
(matched) normal tissue samples,. using real-time quan- 
titative PCR. We observed that the mRNA level of Fzd5 
was markedly increased in 8 of 11 renal cSrcinoma sam- 
ples whilst F2d8 mRNA was increased in 7 of 11 renal 
carcinoma samples. Western blot analysis of crude 
membrane fractions revealed that FzdS protein expres- 
sion in the matched tumor/normal kidney samples corre- 
lated with ibe observed mRNA level. Wnt/p-catenin sig- 
naling pathway activation was confirmed by the in- 
creased expression of a set of target genes. Using a kid- 
ney lumor tissue array, Fzd5 protein expression was 
investigated in a broader panel of kidney tumor samples. 
F2d5 membrane staining was detected in 30% of clear 
cell carcinomas, and there was a strong correlation with 
nuclear cycliri Dl staining in the samples. Our data sug- 
gested that altered expression of certain members of the 



Fzd family, and their downstream targets, could provide 
alternative mechanisms leading to activation of the Wnt 
signaling pathway in renal carcinogenesis. Fzd family 
members may have a role as a biomarker. 

'■ Copyright © 2004 Si Kirgtt AG, Basel 

Introduction 

The Wnt signaling pathway is evolutionary conserved 
and controls many events during embryonic develop- 
ment. Members of ihe Wnt gene family of secreted glyco- 
proteins are involved in embryonic induction, generation 
of cell polarity, cell proliferation and the determination of 
cell fate [1, 2]. Recently, it has become evident that the 
Wnt pathway is also deregulated in a range of tumors [3]. 

The Wnt signaling pathway is activated v/hen Wm pro- 
teins bind to a cell surface receptor complex consisting of 
a member of the fmzled receptor (Fzd) family and either 
low-density-lipoprotein receptor- related protein (LRP)5 
or LRP6 [4, 5]. A detailed characterisation of the Fzds 
and ihe immediate downstream events after Wnt binding 
has been hampered by ihe lack of pure biologically active 
Wnls. 

Downstream of ihe receplor complex, three pathways 
may be initialed, depending on ihe composition of the 
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ligand and receptor complex. The 'Wm/p-caienin path- 
way*, the 'Wnl/Ca 2+ pathway' or the 'Wm polarity path- 
way* (6]- The Writ/0-catenin pathway has been linked to 
carcinogenesis. Genetic alterations in components of this 
pathway {adenomatous polyposis coli, APC, axin and 
catenin)can result in the accumulation of non-phosphory- 
lated 0-catenin [3, 7] and this can promote carcinogenesis. 
Conversely, neither the Wm/Ca 3 * pathway nor the Wm 
polarity pathway involves the activation of 0-catenin [for 
review, seeref. 1,6). 

Mutations in one of the three regulatory genes (APC, 
^-catenin and axih), over expression of Wnts and Fzds or 
the expression of a constitutively active Fzd have been 
linked to Wnt/0-caiejiin pathway activation in various 
tumors [8, 9], 

To evaluate the involvement of Fzds in oncogenesis, 
we investigated mRNA expression levels of several hu- 
man F2ds (Fzd2 f 3, 5, 6, 7, 8 and 9) in more than 30 differ- 
ent human tumor samples using real-time quantitative 
PCR. Each sample was compared with its corresponding 
(matched) normal tissue sample. The most striking obser- 
vation was the dramatically increased Fzd5 and Fzd8 
mRNA expression seen in the renal carcinoma samples. 
This was confirmed at the protein level using Western 
blotting. Kidney tumor tissue arrays confirmed Fzd5 
membrane staining in 30% of clear cell carcinomas, with 
nuclear cyclin Dl showing a strong correlation with the 
Fzd5 membrane labeling. Fzd8 protein expression analy- 
sis was not performed due to ihe Jack of suitable reagents. 
These data suggest" that Fzd5 may have a role in renal ceJl 
carcinogenesis due to its frequent overexpression ob- 
served in these tumor samples. Potential future applica- 
tions could include uses in tumor targeting or as a poten- 
tial biomarker. 



Materials and Methods 

Tbsiie Samples 

Fro2cn tumor tissue samples wiih cor res ponding normal tissue 
from the same patient were derived either from human biopsy, or 
autopsy material (Department of Pathology, University or Antwerp, 
kindly provided by Prof. E. Van Marck). Tissue specimens were 
snap- frozen in liquid nitrogen and kept at -S0*C until use. Frozen 
sections of kidney tumor and normal tissue samples were stained 
with hcmatoxylin-eosin to suppon the pathologist's observations and 
loconfirm the type of kidney tumor. Paraffin-embedded tissue slides 
of renal carcinoma, lung carcinoma, breast and colon carcinoma 
wcie obtained, after encryption, from the Department of Paihologv 
(Middrlheim Hospital. Antwerp. Belgium). The CLI human kidney 
cancer (Super BioChips Laboratories) tissue array used in this study 
contained 59 tissue samples consisting of 9 normal kidney tissues. 



30 clear cell renal carcinoma samples and another 20 renal celt tumor 
types (chromophil, chromophobe papillary type, collecting duel car- 
cinoma and samples with mixed types). 

RWA Isolation and Reverse Transcription 
Total RNA was extracted from tissue specimens using Ifltraspec 
Reagent (Bioiecx. USA) according Id the manufacturers instruc- 
tions. All total RNA was routinely treated with DNase (DNA-free kit, 
Ambion, USA). I ug of total RNA was used to synthesize cDNA 
using ojjgo-dT primers {Superscript; Invitrogen, Merelbeke* Bel- 
gium). Reverse transcription was performed at 42 *C for 60 min, fol- 
lowed by 70* C for 10 min. 

Real-Time PCR . 

Real-time PCR was performed on either an AB I Prism 7700 or 
7900 Sequence detection system (Perkin-Bmer Applied Biosysteros, 
Foster City. Calif., USA) using the 5' nuclease assay (Taqman™). 
Primer and probe sequences were designed using Primer Express (PE 
Applied Biosystems) and are shown in table I. Quantitative values 
were obtained from the threshold cycle number (Ct) at which the 
increase in the signal associated with exponential growth of PCR 
products is delected using PE Biosystems analysis software, accord- 
ing to Ihe manufacturer's instructions. ■ - * 

We have used the 2~ A *° method to analyze the relative changes 
in gene expression of the diiTerem gejits between tumor and corre- 
sponding normal lissxst samples. We used Ihe mitochondrial ATP . 
synthase 6 (ATPsy6) as the endogenous RNA control [ 1 0; Janssens et 
aJ. t in prep.}, and each sample was normalized to its ATPsy6 content. 
The relative expression of the target gene was also normalized to the 
corresponding normal tissue sample (calibrator). Results, expressed . 
as the amount of target sample relative to the ATPsy6 gene and the 
calibrator, were determined as follows, N = 2-* 6 r ,M ">pk- AOoifcmoo 
where the ACl values of the sample and calibrator were determined 
by subtracting the average Ct value of the sample and the calibrator . 
from the average Ct value of the ATPsy6 gene. Amplification was 
done essentially as described previously flOj. Briefly, 50 ul of reac- 
tion mixture containing 1 pi of cDNA template were amplified as 
follows; incubation at 50*C for 2 min, denaturalion at 95°C for 
10 min, and 50 cycles at 95*C for 15 s and 60*C for 1 min. 

Membrane Preparation. Get Electrophoresis and Jmmunobloiting 
Tissue samples were weighed, suspended at a 40 limes dilution 
1= 40 volumes/original wet weight of tistue (v/w)J in 50 mM Titi- 
HC1 buffer, pH 7.4, and homogenized with an Ullra-Turrax homoge- 
nizes After centrifugation for 10 min, 24,000$ at 4 *C. the pellet was 
washed three times by resuspensbn in the Tris-HCt bufler followed - 
by centrifugation. The final membrane pellets were stored at - 80* C 
in the Tris-HCI bufler at a concentration of 0.5-1 mg/ml. The Brad- 
ford protein assay (Pierce, Aalst, Belgium) was used Tor protein deter- 
mination. Proteins (50 ug) were separated. by 8% SDS-PAGE and 
transferred to nitrocellulose membranes. After primary and second- 
ary antibody incubation, the antigen-antibody- peroxidase complex 
was detected by cherriiluminescence (Pierce, Aalst, Belgium) accord- 
ing to the manufacturer's instructions. 

Immunohistochemiitry 

Jmmunohistochemislry was performed on JO-u^VMhick cryosec- 
tions of unfixed tumor tissue and on 6-u.AMhick para iTin seci ions 
from renal tumor tissue fi.xed by formalin or by an alcohol-based Fixa- 
tive. Adjacent tissue blocks from renal tumors were processed with 
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Table 1. Real-iiroe PCR primer ond probe sequences 



Target cDNA \ Primer/probe sequences* 


Fragment 
position 1 * 


Accession 
No* 


FZD2 


(a) 5'-alcccgtgcccggc-3' 

(b) 5'-gialtlgatcatgtagaccg!gaagic-3' 

(c) 5'-FAM-tacactccticeeatelr2c-TAMRA-V 


1,548-1,613 


AB017364 

* 


FZD3 


(a) 5'-lcacgcca^tgcatggg-3' 

(b) 5'-iigieaccncaatm3ticatcg-3' 

(c) 5*-FAM-caiccccggaactciaaccaicatcctttt-TAMRA-3' 


1,473-1547 


- AB039723 


FZD3 


(a) 5'-igccaaggtc3Cttccgm-3' . 

(b) 5'-tetccaagtcgccgcg-3' 

**-/-'• » ^'^"^tiicaiggi^gClgilgCCCCO- J AMKA-J 


2,143-2,204 


HSU43318 


FZD6 


(a) 5'<tagcacccccaggll aagagaa-3' 

(b) S'-cccagagagiclggagaiggal^' 

(c) 5'-FAM-tgiggi5aacclgcctc^ccag-TAMRA-3* 


2,094-2,170 


AF072873 


FZD7 


(a) 5'-cci#£gaaaggeataacigtg-3' 

(b) 5'-aaccaacgggaaaccicaga-3' 

(c) i'-FAM-aagcaacilttataggcaaagcagcgcaa-TAMRAO' 


2,687-2,762 


AB0I7365 


FZD8 


(a) 5'-igiggtcggigcic!gcU-3' 

(b) 5'-cgctecatgtcg3laaggaag-3' 

(c) 5'-FAM-ccscc»cgccsccgIc£cca-TAMRA-l' 


853-919 


AB043703 


FZD9 


(a) 5'-ccccgggagciacgg3c-3' : 

(b) 5^!aglcalgtgcaag3ecacgg-3' . 

' (c) 5vFAM-!ggcacgcactgcc3Ctal3aggct-TAMRA-3* 


1,696-1,763 


HSU82169 


ATPsy6 


(a) 5'-ggtgiaggigigcciigigg!-3' 

(b) 5'-gggcgcagtgatt3taggctl-3' 

(c) 5'-FAM-33gigggcragggcalUltaalcH3gagcg-TAMRA-3' 


. 580^503 


AF36827I 


e-myc 


(a) 5'-aec3Ccagcagrg3cictga-3' 

(b) SMceagcagaaegtgaiccagactO' 

(c) 5 ; -FAM-occutlgccagg3gcctgcctci-TAMRA-3' 


1.297-1,413 


HSMYCI 


CyelinDI 


(a) 5'-gaacctggccgcaaigac-3'* 

(b) 5'-cgcctciggc3il>»gga-3* 

(c) 5'-FAM-ccgcacgaiHca»gaacacii-TAMRA-3' 


4,148-4,211. 


AF5I 1593 


PPAR5 


(a) 5'-agca!cctC3ccggcaaa-3' 

(b) 5'-gicicgatgrcg!ggatcaca-3' 

(c) 5'-FAM<cagcc3cacggcgccc!-TAMRA-3' 


932-990 


. NM-006238 . 



(a) ~. Sense primer, (b) = an I isense primer; (c) = probe. 

Fragment positions are given according lo the EM BUCenBank accession No. of cloned sequence. 
EM BJUGenBank accession No. of cloned sequence. - 



were used as positive controls for 0-catenin and cyclin DI immuno- 
staining. 

The following primary antibodies were used: F2d5 (Upstate Bio- 
technology), p^atenin (Zymed), cycJin Dl (Zymed). E^adherin 
(Novaccsira) and eyiokeratin 8 (Biogenex). Cryoseciions were fixed 
in A% paraformaldehyde for 5 min; acetone for 5 min at -20*Cand 
70% elhanol for 5 min. Endogenous peroxidase activity was 
quenched using 3% H;Oj. Paraffin sections of formalin- and alcohof- 



formalin and with the alcohol-based fixative. Paraffin and cryosec- 
iions wctc mounted on poly-i-Iysine or 3-aminopropyltriethoxys^ 
lane-gelaiin-coated slides. The 59 tissue samples on the CLI human 
renal cancer tissue array, slides were, all fixed wiih formalin. and 
embedded in paraiTin. and (he sections were mounted on siJane- 
coatcd slides (SuperBioChips Laboratories). In addition to renal car- 
cinoma tissue., sections from 10 formal in- fixed para /fin- em bedded 
lung carcinomas were stained for Fzd5. Colon and breast tumors 
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Table 2. Fzd mRN r A expression in tumor samples 



Sample 3 Tissue 



Tumor rype 



.vfold expression increase* 



FZD2 F2D3 F2D5 FZD6 . F2D7 FZD8 . F2P9 



\5jI\j2 


kidney 


* adenocarcinoma 


0.17 


J J///0 


kidney 


renal cdj carcinoma 


1.23 


) 38844 


kidney 


renal cell carcinoma 


0.31 


137 146 


kidney 


renal cell carcinoma 


0.47 


137564 


kidney 


. renal cell carcinoma 


? 47 
■ j.*tj 


.133408 


kidney 


renal cell carcinoma 


23.97 


139188- 


kidney 


renal cell carcinoma 


3.7 


135699 - 


kidney 


renal cell carcinoma 


2.6 


139064 . 


kidney 


renal cell carcinoma 


5.16 


134585 


kidney 


renal cell carcinoma " 


1.47 


H0279 


kidney 


renal cell carcinoma 


7.33 


137252 


ovary 


carcinosarcoma • 


0.4 


138256 


ovary 


papillary carcinoma 


0.7 


146472* 


ovary 


serous papillary carcinoma ■ 


0.39 


145845 


colon 


adenocarcinoma 


3.45 


146145 


colon 


adenocarcinoma . 


5J6 


1 46630 


colon 


adenocarcinoma 


4,01 


146633 


colon 


adenocarcinoma 


1.87 


147055 


colon 


adenocarcinoma 


0.66 



0.5 
156 
0.11 
0.45 
2.95 
0.98 
0.56 
0.36 
1.82 
0.72 
18.17 



3.72 

826 

6.84 

3.16 

9.6 

6,39 

0.66 

4.83 

1.25 

1.38 

3.93 



2.13 
2.61 
1.18 

2.23 

1.57 

2.48 

1.33- 

0.9 

2.36 

0.47 

6.05 



0.06 

1.3 

0.23 

0.73 

7.64 

5,05 

0.37 

0.34 

1.8 

0.09. 
6.4! 



1.32 
8.2 1 
3J8 
4.42 
152 
16.72 
4.41 
2.54 
2.19 

o;6 

4.65 



139 
.5.17 
129 



0.49 
1.22 
2.56 



0.44 
2.19 
1.34 



0.92 

0:4 

194 



0.54- 
0.11 
.0.67 



3.36 
6J4 
4.73 
1.07 
1.41 



0.58 

4.42 

0.3 

1.62 

1.01 



U2 
6,57 
1.16 
1.47 
1.33 



J. 67 
136 
0.54 
0.51 
0.24 



124 

6.08 
0.55 
2.07 
0.79 . 



142253 


lung 


adenocarcinoma 


0.67 


143036 


lung 


adenocarcinoma 


0.67 


138938 


. lung 


adenocarcinoma 


0.93 


133563 


Jung 


adenocarcinoma 


2.76 


144387 


Jung 


-. adenocarcinoma 


12.85 


137304 


lung 


acinary adenocarcinoma 


0.54 


144546 


lung 


epithelial carcinoma 


0.1 


137621 


lung 


epithelial carcinoma 


1.52 


145552 . 


lung 


epithelial carcinoma 


1.09 


143987 


1 est is 


embryonal carcinoma 


0.24 


137332 


siomach 


leiomyoma 


19.32 


139026 


stroma 


gastrointestinal carcinoma 


66.1 


136049 


rectum 


adenocarcinoma 


1.71 


140794 


gall bladder 


adenosquamoiis carcinoma 


0.19 



5.17 

1.24 

0.88 

1.23 

0.43 

9.15 

0.67 

2.19 



0.43 
2.63 
1.07 
0.31 
0.49 
1.65 
0.21 
.0.37 
0.16 



. 1.87 
1.13 
1.3 
134 
0.25 
1.25 
8.11 
2.26 
0.91 



0J} 
1. 11 
J. 73 

0.99 
2.73 
0.63 
0.27 
0.52 
1.47 



0.59 
1.04 
0.38 
0.78 
0.54 
141 
0.97 
0.45 
0.13 



0.6 
191 
151 
0.52 



19.43 
0.45 
0.03 
0.4 
0.93 



1-33 
J. 53 
6.67 
0.54 
102 



2112 
18.98 
8.44 
0.29 
0.2 



2.43 
32.82 
2.28 
0.88 



5.44 

2.8 

1.37 



K53 
2.31 
2.85 

7,91 



0,53 
0.59 



1.99 
0.15 
0.12- 

4.87 



0.41 
9.19 
0.76 
4.97 

2.64 
1.32 
1.06 
0.5 



0.33 
16.7 
20.82 

02Z 



» Sample identification numbers were given by the pathologist. 

Results are expressed as a-lbld increase of ihe gene in the tumor tissue sample compared to its matched normal tissue sample after 
norma mng both samples on Ihe basis of their ATPsy6 content. A ciiioJT of 3-fold was used to define differential expression. Significant 
(> 3-Md) mcreDses m the expression level of the Fzd recepior are shown in italics. - = Expression of the target gene undetectable in one or both 
samples (tumor and/or normal). 



(wed tissue were processed with a trypsin^ilraic-microwave pre- 
treatment or with an EDTV microwave pre treatment to unmask epi- 
topes, respectively. Sections were then sequentially processed with 
primary antibodies, biptinylaied secondary antibodies and slreplav- 
idin-bioiin-peroAidase (F2d5. E-cadherin and cytokeratin 8). For 0- 
catenin. polyclonal rabbit antibody with the EnVision dciection svs- 
tem (DAKO) was used. The slides were further developed using 3- 
amino-9-eihylcarb320le. counterstained with hrmalaun and mount- 



ed with glycerin gelatin. Stained sections were observed with an 
Aaioplan 2 microscope equipped with an Axiocam digital camera. 
Staining intensity for 0-catenin was scored as no staining (value 0), 
weak and fragmentary staining of cell membranes (value 1), moder- 
ate membrane staining of less than 50% of the tumor cells (vaJue-2). 
moderate membrane staining of more than 50% of lumor cells (value 
3) and strong membrane staining of more than 75% of tumor cells 
(value 4). The cyclin Dl staining was quantified as a percentage of 
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Fig. 1. Fzd5 protein expression in matched 
tumor/normal kidney samples. T = Tumor 
sample; N = ftiaiched normal sample. Sam- 
ple identification numbers are given by ihfe 
pathologist. 



133702 137770 138844 137146 137SW 133408 

T N T N T NT-N T N T N. 

-:?-?fc ■ ' 



139188 135699 139064 134585 140279 

T T N T N T N T N 



- Actin , 




^ Fxd5 



- Acrin 



cyclin Dl-imrounoreaciWe nuclei in tumor cells in three fields (area: 
18,641 pm J ) of each tumor sample: The total number of tumor nuclei 
ranged from .51 to 164. The correlation between FidS and ft-caienin 
staining arid between Fzd5 and cyclin Dl staining was evaju 3 jed by 
the Mann-Whitney U IcsL . 



Results 

Fid mRNA Expression in Matched Human 

Tumor/Normal Tissue Samples 

Fzd expression in tumor tissue was compared with Fzd 
expression in matched norma) tissue samples and normal- 
ized to ihe expression of I be housekeeping gene mitochon- 
dria] ATPsy6 (table 2). A 3-fold increase was considered 
significant. . 

In the kidney rumor samples, in which 10 of 1 1 sam- 
ples were clear cell carcinomas, Fzd5 was upregulaled in 8 
of Ihe 1 1 samples. A simihr observation was made for 
Fzd8 and Fzd2, which were upregulated in 7 and 5 renal 
tumor samples, respectively. None of the other Fzds 
showed consistent upreguJa I ion. 

Both Fzd2 and Fzd3 were upregulaled in 3 of 5 colon 
adenocarcinoma samples. No other F2d expression was 
significantly different compared to the normal colon tis- 
sue sample. F2d3 showed an increased expression in all 3 
ovarian carcinoma samples. Fzd expression was not al- 
tered in any of the lung tumor samples. The Fzd expres- 



sion level was observed to be relatively Jow in these lung 
tissues compared to the other tissues investigated. 

Western Blot and Jnifriunohisiochemistry Analysis on 
Renal Carcinomas 

Western blotting was used to evaluate Fzd5 protein 
expression in the renal tissue samples used for mRNA 
expression analysis. Membrane fractions of the renal car- 
cinoma and corresponding normal tissue samples were 
prepared. As previously shown (table 2), Fzd5 .mRNA 
upregulation was detected in 8 of the II matched tumor/ 
normal samples. Increased expression of Fzd5 protein 
was seen in membrane fractions from 9 of 11 samples 
(fig. 1). In most cases, concomitant increases in Fzd5 
mRNA and protein levels were observed. 

Hemaloxylin-eosin staining of the Cryoseclioned tu- 
mors confirmed the presence of clear cell carcinoma. 
Fzd5 immunostaining in clear cell carcinoma (fig. 2a, b) 
was observed to be localized to cell membranes and to 
. nuclei. Cytokeratin 8 (fig. 2c) and E-cadherin (fig. 2d) 
were also detected. E-cadherin labeling of cell membranes 
in clear cell carcinoma was less intense and patchy com- 
pared to epithelial cells of normal renal tissue. P-Caienin 
staining was confined to Ihe cell membrane. (J-Catenin 
levels in the clear cell carcinoma membranes were highly 
variable. Nuclear p-catenin staining was not observed in 
any of the samples. Epithelial cells in nprmal renal tissue 
showed intense membrane staining and some cytoplasmic 
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Staining. In addition, weak p-catenin staining of endothe- 
lial cells was observed. A high number of cyclin Dl- 
immunoreacuve nuclei was observed in cleared! carcino- 
. ma (fig. 21). 

On the CL1 human kidney cancer tissue array, 30% 
(n - 9) of the clear cell carcinoma tumor samples (n = 30) 
showed Fzd5 immunoreactivity (fig. 3a). Membrane-as- 
sociated p-catenin staining was observed in 33% of the 
FzdS-positive tumor samples and 57% of Fzd5-negative 
clear renal cell carcinoma samples (table 3; fig. 3c, d). 
Again, nuclear 0-catenin staining was never observed. 
Statistical analysis did not reveal a difference in the 
expression of 0-catenin between Fzd^positive and Fzd5- 
negative tumor samples (fig. 4a). 

Nuclear cyclin Dl was observed in. 89% of the FzdS- 
positive clear cell carcinoma samples (table 3; fig.3e). 
Only 38% of the Fzd5-negative clear cell carcinoma sam- 
ples contained nuclear cyclin Dl. Statistical analysis 
showed a significantly higher cyclin Dl expression in 
Fzd5-positive compared to Fzd5-negalive tumor samples 
(fig. 4b). 



Expression of PPAR5 was investigated because it repre- 
sents a direct target of the P-catenin pathway with T cell 
factor binding sites in its promoter. Expression of c-mvc 
was found to be upreguJated in 7 of 1 1, whilst cyclin Dl 
was upreguJated in 10 of 1 1 kidney tumor samples (ta* 
ble 4). PPAR5 was upreguJated in 9 cases. All three select- 
ed target genes showed a marked upregulatkwi in the 
majority of renal tumors, which suggested that the Wnt/ 
p-catenin pathway was activated in these samples. 

Discussion 



c-mya Cyclin Dl and Peroxisome 
Proliferaior-Aciiyaied Receptor depression in Renal 
Carcinomas 

Wnt/p-catenin.pathway activation, in the kidney tissue 
samples was investigated looking at the expression of a 
number of target genes, which have previously been 
shown to be upreguJated when the pathway is active.' 
Gene expression of c-myc, cyclin Dl and peroxisome 
proliferator-activated receptor 5 (PPAR5) was analyzed. 
Increased expression of both c-rriyc and cyclin Dl genes 
have been implicated in cell proliferation, and carcino- 
genesis, and they represent two of the more important and 
closely studied target genes of the Wnt signaling pathway. 



F5g.2. Distribution of Fzd5 {*, b), evtokeratin 8 (c), E-cadherin (d) 
and P^atcnin (e) immunortaciivity in paraffin sections from a renal 
tumor processed by an alcohol fixative. From ihc same lumor. a for- 
matin-fixed block was used for cyclin Dl immunosiaining (f). Fzd5 
Jmmunosiaining shows distinct immimoieaetiviiy i D cell membranes 
and in nuclei of clear cell renal caj cmoma. Gear cells are immunore- 
active for cyiokeratin 8. fVCaienin and E-cadherin staining of mem- 
branes is rather weak, and not uniform, in clear cell renal carcinoma. 
Nuclear (J-calenin immunoreacliviiy was noi observed. In clear cell 
renal carcinoma, many nuclei showed cyclin Dl immunoreacliviiy 
The inset in f shows a detailed view of ihe cyclin Di labeling of nuclei 
in dear eel) renal carcinom a. 



Fzd family member overexpression has been postu- 
lated to play key roles in different tumor types such as 
esophageal carcinoma [11], gastric cancer f 12] and head 
and neck squamous, cell carcinoma [13], The current 
study evaluated the potential implication of F2ds as 
tumor-associated antigens in different tumor types. We 
screened a number of matched normal/tumor tissue sam- 
ples for the expression of a variety of Fzds using real-time 
quantitative PCR. . 

Results obtained revealed that both Fzd5 and Fzd8 
mRNA were overexpressed in the majority of renal carci- 
noma samples when compared to the matched normal 
kidney samples. Fzd2 and Fzd3 were upreguJated in 3 of 5 
colon adenocarcinoma samples. Fzd3 was also upregu- 
lated in the ovarian tumor tissue samples compared lo the 
matched normal tissue samples. None of the other Fzds 
evaluated showed a specific differential expression pat- . 
tern in any of the samples studied. Fzd5 and Fzd 8 show 
69.1 % similarity and belong to the same subgroup of Fzds 
[14]. The significantly higher expression of Fzd5 and. 
Fzd 8 in the renal tumor samples, as compared to the nor- 
mal renal samples, suggests a higher probability that this . 
subgroup may be implicated in the progression of renal 
cancen Therefore, we decided to further examine the pos- ' 
sible role of Fzd5 in renal carcinoma. 

We observed, using Western blotting, thai protein lev- 
els were mostly consistent with mRNA levels in the tumor 
samples. In order to be able to determine the Fzd5 expres- 
sion in a broader range of kidney tissues, we utilized a 
tissue array. Fzd5 membrane staining was detected in 9 of 
30 (30%) clear cell carcinomas, and importantly, mem- 
brane staining was not detected in the matched 9 normal 
kidney tissue samples. 

Since the Wnt signaling pathway appears to play an 
important role in embryonic development, in particular 
embryonic kidney induction [J 5, 16], activation of this 
pathway in the adult kidney due to mutation or overex- 
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Table 3. Correlation between Fzd5 and 
0-catenin orcyclin DJ expression 
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89 
li 


57 
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38 
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pressjon or one of the components or the pathway couJd 
be a determming factor in the development of renal can- 
cers. Therefore, several studies have looked into the possi- 
ble function the Wnt/^calenin pathway plays in renal 
carcinogenesis. AFC gene mutations have been demon- 
strated not to be involved in renal carcinon/a [17, 18]. J D 
addition, 0-catehin mutations are rare events in renal car- 
cinoma [19, 20J. Nevertheless, cytoplasmic accumulation 
of p-catenm has been reported in a number of renal cell 
carcinomas [19], and thus the Wnt signaling pathway 



Table 4. Wni/^atcnin target gene mRNA expression 



m tumor samples 



Sample 3 Tissue 



Tumor type 



133702 

137770 

138844 

J37J46 

137364 

J33408 

I39J88 

135699 

139064 

J34385 

140279 



kidney adenocarcinoma 

kidney renal ceil carcinoma 

Sidney renal cell carcinoma 

kidney renal cell carcinoma 

kidney renal cell carcinoma 

kidney rC j)a| cell carcinoma 

kidney renal cell carcinoma 

kidney renal cell carcinoma 

kidney renal eel) carcinoma 

kidney renal cell carcinoma 

kidney renal ceJJ carcinoma 



x-fold expression increase* 


a 


4- 


c-myc 


cyclin D I 


PPAR5 . 


0S4 


4.52 


0,52 / 


icore 


3* 


J3S . 


28.91 


5.53 


£ 


2- 


2.39 


31.49 


7.48 


• 




7.62 


1538 


3J5 i 




1- . 


33.82 


19.65 


8.65 






7.8 


8.92 


4.86 




D- 


2.22 


9.92 


n.67 






12.18 


22.73 


5J3 . 




-1 - - 


22.11 


5.04 


633 






1.79 


1.14 


1.67 






61.68 


54.95 


14.62 
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Fig. 3. TzdS, 0-catenin ar>d cyclin DI immunostainin* of th* n i 
. ^ays. Tne leA column of images repress S section, r rCn3j Ca ' Cm ° ma lissue 
^ove.iewo^^ 

Uve clear cell renal carcinoma (a) oflhis lumm < am „i j ' ? ^ FzdS-jmmunoreac. 

The right column of images is taken f^^-f ^ " ™ } c «»«»- 

carcinoma from this rumor sample doe^^ " r ^ ^ 

distinct membrane 0-c a ,enin staining (d) ' ^ ^ ^ Cycl,n Dl < f > bul do « 



Fig. 4. Box plot charts (thick black line = 
median) illustrating tht relationship be- 
tween Fzd5 immunosiaining and 0-caienin 
(a) and cyclin DJ (b) expression in clear cell 
renal carcinoma. No significant correlation 
was observed bet ween the ^aienin scores of 
Fzd5-positive and -negative clear cell renal 
carcinoma. Nuclear cyclin. Dl fining in 
clear cell renal carcinoma showed a signifi- 
cant. di/Terence between Fzd^positive and 
Fzd5-ne S ative tumor samples. 
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might act as an inducer of tumorigenesis in the kidney, nin was not observed in any of the tumors or on the tissue 
This view is supported by the observation that aberrant array. This result is consistent with the data presented for 
activity oftheWnt signaling pathway has been reported in renal cell carcinomas by Kim et aL [19]. They did not 
renaJ-cancer-derived cei) lines. Zang et aL [2 1 ] observed a detect nuclear p-catenin staining in the 52 renal cell carci- 
higber expression level of Wnt5a and Fzd5 mRNA in the nomas examined. The lack of nuclear p-catenin staining 

. renal cancer cell line GRC-1 than in the normal renal cell has also been reported by others in tumors that might 
line HK-2. Expression of P-cateniri was also higher in have arisen from Wnt/p-catenin pathway activation [28- 
GRC-1 than in HK>2. 31]. 

To determine the status of the canonical Wnt signaling While expression of both Wnt5a and Fzd5 does induce 
pathway in our renal carcinoma samples, we have quanti- duplication of the Xenopus head, exogenous expression of 
lated the mRNA levels of three important target genes of Fzd5 in a Xenopus model does not induce duplication of 

. T cell factor/lymphoid enhancer factor activation by the head [32]. Fzd 5 does not activate the P-catenin signal- 
P<atenin- The mRNA levels of these three target genes ing pathway on its own, as the presence of its endogenous 

■ (t>myc, cyclin Dl and PPAR 5) correlated largely with the ligand is also required. Our results suggest that Fzd5 may . 

expression, of F2d5 in these samples, suggesting that the have a role in renal cell carcinogenesis due to its frequent 

canonical pathway is activated. On the kidney tissue overexpression observed in these tumor samples, and we 

array, cyclin Dl protein expression showed a highly sig- hypothesize that iTFzdS is overexpressed, it has a rather 

m'ficant correlation with the Fzd5 expression in the tumor limited effect, on p-catenin signaling, However, in the 

samples (table 3). Cyclin Dl protein is frequently. overex- presence of its endogenous still unknown ligand, it acti- 

pressed in various tumors, but in only a proportion of the vales the canonical Wnt signaling pathway. The elucida- 

cases is it due to amplification of the cyclin D 1 gene [22]. tion of this ligand and its binding characteristics is stiU 

Therefore, other mechanisms such as upregulatiori of gene under investigation. Ultimately, knowledge of the specific 

transcription may play a substantial role in the overex- expression patterns of both Wnt and Fzd members could 

pression of cyclin Dl [23-26]. Our data, showing in- lead to directed tumor targeting or could be used as a 

creased cyclin Dl expression in renal carcinoma samples, tumor marker, 
are consistent with the results of Stassar et ah [27]. They 
studied genes that are associated with human renal carci- 
noma by suppression subtractive hybridization and re- Acknowledgment 
ported 14 diflerentially expressed genes, including cyclin 

Dl. Although we would have expected an increased nil- We are.graieftil lo Prof-E. v an Marck (University of Antwerp) for 

. n - ... ; , . . providing us with the tissue samples. 

clear Jkratenin siaming, nuclear accumulation of P-cate- . 
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Irnmunohistochcmical analysis of NY-ESO-1 antigen expression in normal 
and malignant human tissues, 

Jungbluth AA , Chen Yl\ Stockert E , Busam KJ , Kolb D, Iversen K . Coplan IC 
Williamson B , Aftorki N, Old LJ. 

Ludwig Institute for Cancer Research, Memorial Sloan-Kettering Cancer Center, New 
York, NY, USA. jungblua@mskcc.org 

NY-ESO-1, a member of the CT (cancer/testis) family of antigens, is expressed in normal 
' testis and in a range of human tumor types. Knowledge of NY-ESO-I expression has 
depended on RT-PCR detection of mRNA-and there is a need for detecting NY-ESO-1 at 
the protein level. In the present study, a method for the immunochemical detection of 
NY-ESO- I 'm paraffin-embedded tissues has been developed and used to define the 

. expression pattern of NY-ESO-1 in normal tissues and in a panel of human tumors. No 
normal tissue other than testis showed NY-ESO-1 reactivity, and expression in testis was 
restricted to germ cells particularly spermatogonia. In human tumors, the frequency of 
NY-ESO-l antigen expression corresponds with past analysis of NY-ESO-1 mRNA 
expression e.g., 20-30% of lung cancers, bladder cancers and melanoma, and no 
expression in colon and renal cancer. Co-typing of NY-ESO-1 antigen and mRNA 
expression in a large panel of lung cancers showed a good correlation. There is great 
variability in NY-ESO-1 expression in individual tumors, ranging from an infrequent 
homogeneous pattern of staining to highly heterogeneous antigen expression. Copyright 

. 2001 Wiley-Liss, Inc. 
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Multidrug resistance phospkoglycoprotein (ABCB1) in the mouse 
placenta: fetal protection. 

Kalabis GM, Kostaki A, Andrews MH. Petropoulos S . Gibb W . Matthews SG 

Department of Physiology, University of Toronto, Ontario, Canada. 

The multidrug resistance phosphoglycoprotein ATP-binding cassette subfamily B 
(ABCB1) actively extrudes a range of structurally and functionally diverse xenobiotics as 
well as glucocorticoids. ABCB 1 is present in many cancer cell types as well as in normal 
tissues. Although it has been localized within the mouse placenta, virtually nothing is 
known about its regulation. In the mouse, two genes, Abcbla and Abcblb, encode 
ABCB I. We hypothesized that there are changes in placental Abcbla and Abcblb gene 
expression and ABCB1 protein levels during pregnancy. Using in situ hybridization, we 
demonstrated that Abcblb mRNA is the predominant placental isoform and that there are 
profound gestational changes in the expression of both Abcbla and Abcblb mRNA. 
Placentas from pregnant mice were analyzed between Embryonic Days (E) 9.5 and 19 
(term approximately 19.5d). Abcblb mRNA was detected in invading trophoblast cells 
by E9.5, peaked within the placental labyrinth at El 2.5, and then progressively decreased 
toward term (P < 0.0001). Abcbla mRNA, although lower than that of Abcblb at 
midgestation, paralleled changes in Abcblb mRNA. Changes in Abcbl mRNA were 
reflected by a significant decrease in ABCB1 protein (P < 0.05). A strong correlation 
existed between placental Abcblb mRNA and maternal progesterone concentrations, 
indicating a potential role of progesterone in regulation of placental Abcblb mRNA.' In 
conclusion, there are dramatic decreases in Abcbla and Abcblb mRNA and in ABCB1 at 
the maternal-fetal interface over the second half of gestation, suggestiug that the fetus 
may become increasingly susceptible to the influences of xenobiotics and natural steroids 
in the maternal circulation! 
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Expression of human telomerase reverse transcriptase gene and protein 
and of estrogen and progesterone receptors, in breast tnmors: Preliminary 
data from neo adjuvant chemotherapy. y 

gammoriM , Izumiyama N , Hashimoto M, Nakamura K Okano T Kurabavashi R 
N^okLH, HonmaN, OgawaT . KamlmsluM^i^X 

Division of Breast and Endocrine Surgery, Department of Surgery Graduate School of 

Human telomerase reverse transcriptase (hTERT), the catalytic subunit of telomerase is 
very closely associated with telomerase activity. Telcme^ has been implTcaS ^ n 
ce lular immortalization and carcinogenesis, in situ detection of hTERT S 
de te™ mng the Ionization of telomerase-pos.tive cells. The aim of tWsTtudy was to 
detect express.on of hTERT mRNA, hTERT protein, estrogen receptor (ER) and 
rSr n t " u Pt ° r (P ? " P^ffi-^edded breast tissue samples an I to ^investigate 
the relat.onsh.p between hTERT expression and various dinicopatiological pa35 
m breast tumongenesis. We used in situ hybndization (ISH) to LJS^oSTT' 

ER andPR ^r^™^ (IHC) to Session of n^pr" in 

ER and PR, m breast ussues including 64 adenocarcinomas, 2 phyllode tumors a^d het 
adjacent normal breast tissues. hTERT gene expression was dJEJL b y m n X (88%) 

^ZlHclt^^ 2 PMlQde ^ ^pwtein exp'rfssln fai 
Moreover ER Jl PK * %) CaTCm °™ S > but in neither * 2 phyllode tumors. 
Moreover ER and PR were expressed in 42 (66%) and 42 (66%) carcinomas 

rhTS P f ? T gCne and protem before treatment, neo-adiuvant 
chemotherapy led to disappearance of gene and protein expression in aliases There was 
a slrong correlate between detection of hTERT gene expression b^^oSrER? 
protem by ICH ir ! tissue specimens from breast tumors. Tnese results suggest mat 

po S T by ICH Can be Used * ^inguish breast cSs s a 

potential diagnostic and therapeutic marker. 
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Abstract. Human telomerase reverse transcriptase (hTERT), 
the catalytic subunit of telomerase, is very closely associated 
with telomerase activity: Telomerase has been implicated in 
cellular immortalizabon and carcinogenesis. In situ detection of 
hTERT will aid in determining the localization of telomerase- 
positive cells. The aim of this study was to detect expression of 
hTERT mRNA. bTERT protein, estrogen receptor (ER) and 
progesterone, receptor (PR) in paraffin- embedded breast tissue 
samples and to investigate the relationship between hTERT 
expression and various dinicopathological parameters in breast 
tumorigenesis. We used in situ hybridization (1SH) to examine 
hTERT gene expression, and immunohistocbemistry (JHC) to 
examine^xpression of hTERT protein, ER and PR, in breast 
tissues including 64 adenocarcinomas, 2 phyllode tumors and 
their adjacent normal breast tissues. hTERT sene expression 
was detected by JSH in 56 (88%) carcinomas, but in neither 
of the 2 phyllode tumors. hTERT protein expression was 
detected by IHC in 52 (81%) carcinomas, but in neither of 
the 2 phyllode tumors. Moreover. ER and PR were expressed 
in 42 (66%) and 42 (66%) carcinomas, icspectively. and in 
neither of the 7 phyllode rumors. In 4 cases of breast carcinoma 
that strongly expressed hTERT gene and piotein before 
treatment, neo- adjuvant chemotherapy led to disappearance of 
gene and protein expression in all cases. There was a stiong 
correlation between detection of hTERT gene expression by 
JSH and of hTERT protein by 1CH in tissue specimens from 
breast rumors. These results suggest that detection of hTERT 
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protein by 1CH can be used to distinguish breast cancers as a 
potential diagnostic and therapeutic marker. 

Introduction 

Breast cancer is the most frequent malignancy in women, 
affecting up to one in every eight females worldwide. The 
most important cUnicopathologica) prognostic parameter so 
far identified is the absence or presence of lymph node 
metastasis, but the identification of farther parameters for 
both lymph node-positive and -negative patients would 
facilitate an individually based risk-directed therapy (1). A 
promising emerging molecular marker is telomerase. a 
ribonucleoprotein enzyme complex, which when activated 
or upregulated allows tumor cells to escape from cellular 
senescence and to proliferate indefinitely (2). The human 
telomere is a simple repeat sequence of six bases (TTAGGG) 
that is located at the ends of each chromosome (3). Telomeres 
are believed to protect against degeneration, reconstruction, 
fusion, and loss (4) and to promote the homologous pairing 
of chromosomes (5). The end-to-end chromosome fusions 
observed in some tumors may result from the loss of telomeres 
and may be partly responsible for the genetic instability 
associated with tumorigenesis. Telomerase catalyzes the 
synthesis of telomere DNA and facilitates cell immortal iiatr on 
through the stabilization of chromosomal structure (6-8). 
Although the expression of the human RNA component of 
telomerase (hTERC) is widespread, the restricted expression 
pattern of the mRNA of hTERT, the human telomerase 
catalytic subunit gene, is correlated with telomerase activity 
(8-13). As telomerase activity seems to be the key player in 
tumor cell immortality, it has importance as a target molecule 
for anti-cancer therapy. Telomerase activity has been shown 
to correlate with poor clinical outcome in neuroblastomas 
and other tumors (14). For breast cancer, however, telomerase 
activity is a controversial prognostic marker: some studies 
.supoesi that telomerase activity . dinicopathological parameters 
anddrsease outcome are linked, whereas others do not find 
this association < J 4-23). 
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We have succeeded in very clearly and sensitively demon- 
strating hTERT mRNA in thyroid, colorectal, parathyroid 
and lung tissues by use of an olicoriucleoitde probe ( 13,24-26). 
Strong correlation has been observed Del ween hTERT mRNA 
and/or protein expression and telomerase in a variety of 
. malignant tumors (1 3. 14 ,24 ,25 ,27 ,28). In the present study, 
we used 1SH to examine expression of the hTERT gene, and 
JHC to examine expression of hTERT protein, BR and PR, in 
64 carcinomas and 2 phyllodes rumors of breast to determine 
whether hTERT protein can be used to differentiate breast 
cancers. We also analyzed hTERT mRNA and protein 
expression with special reference to clinical features and 
histological findings to investigate the potential role of 
hTERT mRNA expression analysis in prediciing the biological 
characteristics of breast cancers. Since hTERT expression in 
breast rumors has not previously been analysed by ISH or IHC, 
our investigation also examined various clinicopathological 
parameters, including age, histopathologkal type, tumor si2e, 
lymph node status, relapses, and the expression of ER and PR. 

' Materials and methods . . 

Tissue collection. Sixty-six samples were obtained during 66 
mastectomies; 64 breast carcinomas, 2 phyllodes tumors and 
66 specimens of the adjacent normal breast gland. In 4 cases, 
samples were obtained during core needle biopsies (CNB) 
before neo-adjuvant therapy, and again during mastectomies 
after neo-adjuvant therapy; for these cases, all measurements 
ai)d examinations were performed both before and after the 
neo-adjuvani therapy. The patients ranged in age from 32 to 
90 years. The patients with cajciriomas ranged in age from 37 
to 90, mean 56, years and were all women. The women wiih 
phyllode tumors were aged 32 and .38 years, respectively. 
The surgical and CNB tissue samples were frozen rapidly 
wjih liquid nitrogen and stored at -80*C until fixation. Then, 
they were fixed in 10% buffer formalin solution and 
embedded in paraffin. Surgical and CNB samples were 
collected from the patients after obtaining their informed 
consent, and the study protocol was approved by the Medical 
Depanmeni of the University of Tokyo Ethics Committee. 
The pathologic diagnoses were made by the surgical 
pathological specialists at our institute on the basis of 
examination of hematoxylin-eosin Stained slides. A 
pathological review was performed for all breast tumors 
according to the BRE score. pT and pN staging were assigned 
according to the 1997 WHO classification (7th edition). 

MCF-7 human breast cancer cells, kindly provided by the 
Cell Resource Center for Biomedical Research. Institute of 
Development. Aging and Cancer. Tohoku University, were 
used as positive controls. The cells were incubated in RPJVIl- 
1640 medium with 25 mM HEPES buffer. L-gluiamine. and 
10% fetal bovine serum (Gibco, Giand Island. NY) on a 
chamber-attached slideglass (Lab-Tck* Chamber Slide 1 *: 
Nalge Nunc international. NapervjJlc. IL) in a humidified 5% 
C0 2 atmosphere at 37*C. The cells were then fixed with 10% 
buffeied neutral formalin (Sigma Chemical Co.. St. Louis. 
MO). The cultured MCF-7 cell line that was used as a positive 
control was tested for telomerase with a PCR- based standard 
TRAP assay (6 J 3). These cells were also used to pit-pare cell 
blocks. Brie fly. the cells were fixed in 10% buffeieil neutral 



formalin, resuspended in molten agarose and then embedded in 
paraffin. Sections ftom these cell blocks were used as positive 
procedural controls in ISH and IHC. The negative control in 
ISH was obtained by replacing the oligonucleotide probe with 
RNase. The negative control in IHC was obtained by replacing 
the primary antibody with Tris-buffered saline (TBS). 

OUgonuchohde probe for ISH. The specificity of the oligo- 
nucleotide sequence was initially determined by a GcnEMBL 
database search using the Genetics Computer Group Sequence 
Analysis Program (GCG, Madison, WI) based on the fastA 
algorithm (29); the sequence exhibited 100% homology 
With the hTERT gene sequence. A d(T)^ oligonucleotide 
was used to verify the integrity and lack of degradation of 
the mRNA in each sample. Ail oligonucleotide probes 
were synthesized with a hapten-labcled nucleotide, such as 
digoxigeninrdUTP (Boehringer-Mannheim), at the 3* end via 
direct coupling by using standard phosphoramidite chemistry 
(Research Genetics, Huntsville, AL) (30). The probe used for 
detection of hTERT by ISH was generated from the original 
sequence for Homo sapiens telomerase reverse transcriptase 
(AF0 15950), 2766-2800: S'-GCCTCGTCTTCTACAGGGA 
AGTTCACCACTGTCTT-3' (13,24-26). 
> 

ISH. ISH was performed with the GenPoint nucleic acid hyper- 
detection system (Dako, Carpenteria, CA) (31). Formal in- 
fixed, paraffin-embedded tissue sections (5 pm thick) were 
deparaffinized in xylene and a graded alcohol series. Tissue 
sections and CNB samples were then pretreated with target 
retrieval solution (Dako, SI 700) at 95*C and proteinase K 
(Dako, S3004) at room temperature. Next* the tissues and 
CNB samples were fixed in 03% hydrogen peroxide followed 
by a methyl alcohol series at room temperature. Digoxigenin- 
labeled anti-sense oligonucleotide in mRNA in situ hybri- 
dization solution (Dako. S3304) was placed over the tissues 
and CNB samples. After hybridization at 37"C overnight, 
the slides were washed in stringent wash solution (Dako, 
GenPoint System Kit) at 45"C. The tissues and CNB samples 
were exposed to avidin blocking solution (Dako, X0590) 
at room temperature followed by biotin blocking solution 
(Dako, X0590) at room temperature. The tissues and samples 
were then incubated at room temperature with a sheep 
monoclonal hapten- labeled anti-digoxigenin antibody (Dako. 
p5J04), and the slides were then fixed with biotinyl ryramine 
(Dako, GenPoint System Kit) at room temperature. Finally, 
the Slides were incubated with HRP-conju gated streptavidin 
(Dako, GenPoint System Kit) at room temperature. Since 
33' diaminobenzidine leirahydrochloride (DAB) was used as 
the substrate, a positive reaction was visible as a brown color 
under a light microscope. The sections were weakjy counter- 
Stained wiih 0.1% hematoxylin. 

JHC. IHC was performed by the avidin-biotin complex/ 
horseradish peroxidase method. Tissue sections were stained 
for hTERT with a commercially available monoclonal antibody 
(NCL-L-hTERT; Novocastra, Newcastle upon Tyne. UK). 
Sections were dewaxed in xylene. Antigen retrieval was done 
by incubating sections immersed in 0.01 M citrate buffer at 
pH 6.0 in a minowave oven at 99'C. The sections were 
allowed ro cool down at room temptiatujc. The sections 
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Table L Relationships between mRNA status (negative/ 
positive) by ISH and standard clinical , pathological, and 
biological factors in the 66 tumors. 



No. of patients {%) 



Total 



population bTERT hTERT ' 

(%) negative positive P-value* 



NS 



Bgurt I . Representative results of the TRAP assay. If functional te foments* is 
pr esent, die tn?yme add* DNA lo the substrate in 6-base-paif (bp) increments, 
resulting in a ladder- like distribution of products. The 6- bp ladder signals art 
apparent fen MCF-7 and brrasl cancers (case nos. 6. 14, 23. 44 and 59) and are 
not apparent for lysis buffer as the negative control. An extract of MCF-7 was 
used as » poaibvc control for the TRAP assay and as an Interna) Tclomerase 
Assay Standard (TTAS) positive control for PCR amplifkabon, with lysis 
buffer as the negative control (Lysis). 



were ihen immersed in 1% hydrogen peroxide (H 2 0 2 ) in 
methanol lo block endogenous peroxidase activity. Following 
thai, the sections were washed in TBS (pH 7.6) before being 
incubated in normal rabbit serum for 20 min lo block non- 
specific binding. After draining off the excess serum, the 
sections were incubated with (he primary antibody at room 
temperature. The sections were washed in TBS before being 
incubated with the secondary antibody (biotinylated rabbit 
anti-mouse, Dako). The sections were washed again with 
TBS and incubated with avidin bioiin complex/horseradish 
peroxidase. After washing the sections wiih TBS, peroxidase 
activity was visualized under light microscopy by applying 
DAB ehromogen (Dako). The seciions were counterstained 
wiih hematoxylin, dehydrated in increasing grades of alcohol 
and finally mounted in dibutyl phthalate (DFX) mountanl. 

Homogeneous staining or a speckled/dotted pattern in the 
nucleus was considered positive staining, and absence of 
distinct nuclear staining was taken as negative staining. 
Grading of ihe percentage of stained cells (hTERT labeling 
index) was performed by previously published criteria (1) as 
follows: Grade I, negative staining; Grade 2, I- 109b positive 
slaining nuclei; Grade 3, 11-50% positive nuclei; and Grade 4, 
>50% positive nuclei. Jmmunostained slides for ER and 
PR were scored as previously described (32,33). Jn brief, 
each entire slide was evaluated by light microscopy. First, a 
proportion score ^ns assigned, which represents the estimated 
proponion of positive- staining tumor cells (0, none; I , < 1/100; 
2. I/I0O lo 1710: 3, 1/10 to 1/3; 4, 1/3 10 2/3: and 5, >2/3). 
Next, an intensity score was assigned, which represents the 
average intensity of positive tumor cells (0. none; J, weak, 
2, intermediate; and 3, strong). The proponion and intensity 
scores were then added to obtain a total score, which ranged 
from 0 :o 8. Slides were sco;ed by pathologists who did not 
have knowledge of ligimd- binding Jesuits or patient outcome 





66 




Age 






<50 


26 


6 (23 J) 20(76.9) 


>50 


40 


6(15.0) 34(85.0) 


Hisiopalho- 






logical type 






Scirrhous 


32 


2 (6.4) 30(93.6) 


Papillorubular 


20 


2(10.0) 18(90.0) 


Solid tubular 


6 


1(16.7) 5 (83-3) 


Mucinous 


2 . 


1 (50.0) . 1 (50.0) 


Non-invasive 


4 


2(50.0) 2(50.0) 


PhylJodes 


2 


2 (100) 0 (0) 


Tumor size 






(cm? 






Tl (<2.0) 


20 


0 (0) 20 (100) 


T2 (2.0 5.0) ' 


34 


6(17.6) 28(82.4) 


T3 (>5.0) 


10 


2(20.0) 8(80.0) 



Lymph node 
status* 

pNO 

pNl 

pN2+pN7vl 
Relapse 



ER expression 
+ (^2) 

- (<2) 

PR expression 
+ U2) 

- «2) 



34 6(17.6) 28(82.4) 
26 2 (7.7) 24 (92.3) 
4 0 (0) 4 (100) 



10 0 (0) .10 (100) 

56 10(17.6) 46(82.4) 

42 6(14.3) 36(85.7) 

24 4 (16.7) 20(833) 

42 4 (9.5) 38(90-5) 

24 6(25.0) 18(75.0) 



NS 



NS 



NS 



NS 



NS 



NS 



V test. NS, not significant. b Infonnation b variable for 64 patients. 



Statistical analysis. Differences in p- values were analyzed 
with the x 7 'est for independence, and Fisher's test was, used 
for correlations. In all comparisons. p<0.05 was considered 
significant. 

Results 

Representative results of the TRAP assay arc shown in Fig. 1. 
The cultured cells, which were tested for telomeiase activity 





F.-urt 7 Conebiion be twetn his.olojit diagnosis, human irlomrrase swrx nanscnpiDSt (hTERT) mRNA by ISH. hTERT prolan by IHC. cstrosen 
rtctpio. (EH) by IHC and pio S rsitronc ircrptor (PR) by IHC in brcwi carxm. (d. H4E): (b. hTERT mRNA); <c. hTERT piottin); (d. ER) =»nd (e. PR). 



with the TRAP assay/gave positive results with aJI procedural 
controls (MCF-7 and 5 breast cancer samples) (Fig. I). 

ISH revealed ihat hTERT mRNA was strongly expressed 
in the nuclei and cytoplasm of almost all of the MCF-7 human 
cancer cells (data riot shown). Expression of hTERT mRNA 
was detected in. 56 (88%) of the 64 breast cancer* and in none 
of the phyllodes tumors of ihe breast (Table 1) with the ami- 
. sense probe, whereas no expression was detected with the anti- 
sense probe treated with RNase (data not shown). The levels of 
expression were heterogeneous within the carcinomatous 
regions. Strong expression of hTERT mRNA was not confined 
to the carcinomatous regions but w3s also delected in 
infiltrating lymphocytes (Fie. 2a and b). Higher expression 
levels of both signals of hTERT mRNA were delected in 
some sections containing both carcinomas and lymphocytes, 
compared with the adjacent noncancerous mammary gland, 
but no clear differences in signal intensity were observed 
between carcinomas and lymphocytes. The signals in both 
the normal and cancer tissues were mainly present in the 
lymphocytes. :jn d ,nc signal intensity was similar in both, 
although a precise quantitative comparison of the in jim signals 
was impossible. 



IHC revealed that hTERT protein was strongly expressed 
in the nuclei, nuclear membrane and cytoplasm of almost all 
of the MCF-7 human cancer cells (data noi shown). Expression 
of hTERT protein was delected in 52 (81%) of the 64 breast 
cancers 3nd in none of the phyllode tumors of the breast 
(Table It)- The levels of expression were heterogeneous within 
the carcinomatous regions. As shown in Fig. 2c r strong 
expression of hTERT protein was observed, in nuclei, nuclear 
membrane and cytoplasm, similar to the pattern in MCF-7 
human cancer cells. Normal mammary gland and stromal 
cells generally showed negative immunorcactivity against 
hTERT protein antibody. 

A nuclear signal for the ER (Ftg. 2d), as assessed by IHC. 
was observed in 36 (56%) of the 64 breast cancers and in 
none of the phyllode tumors of the breast, with positive 
scores ranging from 2 to 8 (Tables I and 11). A nuclear signal 
for the PR (Fig. 2e). as assessed by JHC, was observed in 38 
(59%) of the 64 breast canccis and in none of the phyllode 
rumors of the breast, wirh positive scores ranging from 2 io 8 
(Tables 1 and 1!). 

We used ISM and IHC to examine hTERT expression 
in 4 cases of breast cancer before and afler neo adjuvant 



Table II. Relationships between rhRNA status (negative/ 
positive) by IHC and standard clinical, pathological, and 
biological factors in the 66 tumors. 

No. of patients (%) 



Table HI. Relationship of hTERT mRNA and protein 
expression before and after neoadjuvant chemotherapy. 





Total 


hlfcRT 
negative 


kTCDT 

hJhK 1 
positive 


P-vabe 


Total 


66 


14(212) 


52 (78 .8) 




Age 








NS 




26 


10 (38 J) 


16(61.5) 




>50 


40 


4(10.0) 


36 (90,0) 




Histopathological 










type 








NS 


Scirrhous 


M 


I (0.4) 






Papillotubular 


20 


4 (20.0) 


16(80.0) 




^nlifl tttbiilrtr 


6 


4 (66.7) 


2 (333) 




Mucinous . 


2 


0 (0) 


2 (100) 




Non-invasive 


4 . 


2(50.0) 


2(50.0) 




Phyl lodes 


*> 








Tumor si2c(cm)> 








NS 


Tl (<2.0> 


20 


4 (20.0) 


16(80.0) 




T2 (2.0-5.0) 


34 


Attn £\ ■ 
O (I /.o) 






T3 (>5.0) 


10 


2 (20.0) 


o / on a\ 
8 (80.0) 




Lymph node status* 








NS 


pNO 


34 


6(17.6) 


28 (82i4) 




pNI 


26 


6(23.1) 


20 (76.9) 




pN2+pNM 


4 


0 (0) 


4 (100) 




Relapse 








NS 


-t- 


10 


I (10.0) 


9 (90.0) 






56 


13(23.2) 


43 (79.6) 




ER expression 








NS 


+ (2:2) 


42 


6(14J) 


36(85.7) 




- «2) 


24 


8 (333) 


16(66.7) 




PR expression 








NS 




42 


6(14J) 


36 (85.7) 




- (<2) 


24 


8 (33 J) 


16(66.7) 





*X 7 »ci»- NS. not significant. Informal ion available for 64 patients. 



chemotherapy. Before chemotherapy, all 4 of the breast 
carcinomas suongly expressed hTERT by both ISH and 
IHC- After chemotherapy. hTERT expression completely 
disappeared in all 4 cases (Table 111). hTERT expression by 
lymphocytes was detectable by ISM and IHC both before and 
after chemotherapy in all 4 cases, ond the level of expression 
did not appear to be alieied by treatment. 

No correlation was observed between hTERT mRNA 
expression and any of the clinicopaihoioe icat parameters 
age. hisiopatholosicai type, tumor <)7c v lymph node status. 



Case 


Age 


Neoadjuvant , 


hTERT 


mRNA hTERT protein 


Before 


After Before After 


1 


80 


AnastrozoJe 


+ 




2 


78 


AnastrozoJe 


+ 


+ 


3 


35 


FED 




+ 


4 


37 


AO 







"EEC. 5FU (500 rogmr). Epirubicin (70 mg/nr). Cyclophosphamide 
(500 rog/m 2 ). b AC, Doxorubicin (60 mg/m 2 ), Cyclophosphamide 
(500 mg/W). Before, before neoadjuvant chemotherapy. After, 
after neoadjuvant chemotherapy. + . positive;. -> negative. 



relapses, and the expression of ER and PR. Similarly, there 
was no correlation between hTERT protein expression and 
any of these clinicopathological parameters. There was a 
correlation between hTERT mRNA expression and hTERT 
protein expression in breast cancers (p<0.005). 

. Discussion 

This study reports a comparison of hTERT mRNA expression 
by ISH and hTERT protein expression by IHC in tissue 
sections from breast tumors. hTERT mRNA was detected 
by ISH in 56 of the 64 breast cancers and in MCF-7 human 
breast cancer cells. Breast cancer cell nuclei stained strongly 
positive with the specific anti-sense probe but not with the 
anti-sense probe treated with RNase (data not shown). Tissue 
lymphocytes also stained positively with the ami- sense probe, 
but the stromal cells did not. Expression of hTERT protein 
was observed by IHC in 52 of the 64 breast cancers. hTERT 
mRNA and protein expressions were highly correlated in 
bieast cancers (p<0.005). Detection of the hTERT protein by 
IHC has permitted further analysis of carcinogenesis and 
cancer diagnosis (34). 

In recent years, there has been disagreement over the 
suitability of lelomerase activity as a prognostic biologic 
marker in breast cancer that may help to differentiate patients 
for individually based risk- related therapy. Hiyama et at (15). 
in a study of 140 breast cancer specimens with the TRAP 
assay, found a strong association between lelomerase activity 
and stage classification and observed lelomerase activity 
in 68% of stage 1 tumors and 95% of stage IV tumors. 
Poremba et at ( 1), using tissue micTOanays. found a statistically 
significant correlation between tumor -specific survival (overall 
survival) and hTERT expression in breast cancer. However, 
some problems in interpretation may affect this apparent 
consensus. First, some samples of breast cancer tissue may 
be extensively contaminated by infiltrating lymphocytes 
during operative manipulations, especially in advanced disease, 
causing o^ercstimstion cf lelomerase activity and/or hTERT 
expression. In our picviou.% repons. higher expression levels 



of signals for both hTERT mRNA and protein were detected 
in some sections containing borh carcinoma and lymphocytes 
in thyroid and colorectal cancers (13,24), Secondly, Poremba 
et al (!) used polyclonal antibodies against hTERT protein as 
a signal for expression. In our bands, polyclonal antibodies 
against hTERT protein give rise to strong background signals 
and are not clearly specific for measuring expression in cancer 
tissues. We have carefully compared the reactivity against 
hTERT protein of the monoclonal antibody used in the 
present study with that of some polyclonal antibodies. Use of 
the monoclonal antibody in IHC allowed clear demonstration 
of hTERT protein expression, with results similar to those 
of 1SH for hTERT mRNA expression. Furthermore, IHC is 
technically much easier to perform than 1SH, since con- 
tamination of samples by RNase is not an issue in )HC. 

To the best of pur knowledge, this report is the first on the 
study of hTERT expression in breast cancer as a function of 
neoadjuvant treatment. We examined hTERT expression 
in 4 cases of breast cancer before and after chemotherapy. 
Before chemotherapy, hTERT was sirongly expressed in 
all 4 carcinomas, but after chemotherapy hTERT expression 
had completely disappeared in all 4 cases, hTERT expression 
by. lymphocyte's was delectable by 1SH and THC both before 
and after chemotherapy in all 4 cases, and the level of 
expression did not appear to be altered by treatment. 

In conclusion, determination of hTERT mRNA expression 
by 1SH and hTERT protein expression by IHC can be used to 
obtain information contributing to a histopathological diagnosis 
during screening of breast cancers. By use of a monoclonal 
antibody, we could very clearly and sensitively demonstrate 
hTERT protein expression In breast cancer tissues but not in 
noncancerous tissues. We also demonstrated that 4 carcinomas 
wiih originally positive immunoreactiviiy against hTERT 
protein became negative afier neo adjuvant chemotherapy. 
These results suggest Inal determination of hTERT pjoiem 
by 1CH can be used as a poteniial diagnostic and therapeutic 
marker lo distinguish breast cancers. 
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Expression of the ubiquitui-proteasorae pathway and muscle loss in 
experimental caucer cachexia. 

KkajJ, WrfceSM, RusseU ST, ffiaeAV, Tisdalc MJ . 

Pharmaceutical Sciences Research Institute, Aston University, Birmingham, UK. 

Muscle protein degradation is thought to play a major role in muscle atrophy in cancer 

cachexia. To "vcshgate the importance of the ubiquitin-proteasome pathway, which has 
been sugg es ted to be the ajn degradative -^.^ P £ £J? 

cachexia, the express.on of mRNA for proteasome subunits C2 and C5 as well as the 
ub, q u,t,n-conjug a ting enzyme, E2(I4k) ) has been determined in gastrocnemius and 
pectoral muscles of m.ce bearing the MAC 16 adenocarcinoma, using competitive 
.quantitative reverse transcriptase polymerase chain reaction. Protein levels of nroteasome 
subumts and E2(14k) were determined by immunoblotting, to ensure £S£T 

wSh?f f m m Pr ° tein eXprCSSi ° n - MuSC,e correlated iinearHdr 

wciglit loss dunng the course of the study. There was a good correlation between 

expressxon of C2 and E2( 14k) mRNA and protein levels^ gastrocnemius ™5e with 

foSr H .° f 6 t f0W f ° r 02 tW °- f0ld f ° r E2 ( 14k > betwee « 12 20% weStToS 
followed by a decrease m expression at weight losses of 25-27%, although loss ff muscle 
protem contmued. In contrast, expression of C5 mRNA only increased twoS and wa 

t™vZ™i at We,gh ' l0SSeS betWCen 7 - 5 «« 27o/o: Both proteasome funeuoTa 
act, vity and proteasome-spec.fic tyrosine release as a measure of total protein 
£egradat,on was also maximal at 1 8-20% weight loss and decreased at higher weight loss 

S^aSSr EST • ,T c,e foUowed 3 different pattcm 

musde ?nl m ^ mRNA 0nJy b6mg SeC " 3t WCi ^ l0SSCS above 1 7%, although 

Z lr<- mC n, pragreSS1Ve,y With inCrCasin 8 ^ loss - These ™* W 
that act.vat.on of the ub.quifn-proteasome pathway p.ays a major role in protein L in 

P rlTZZT '? 10 200/0 WCight [0SS ' bUt ** ° UlCr f3Ct0rS SU <* * in 
protein synthesis may play a more important role at higher weight loss. 

PMID: 16160695 [PubMed - in process] 
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Increased expressiou of proteasome subunits in skeletal muscle of cancer 
patients with weight loss. 

KhalJ, fling A V, Fearon KC . Dcjong CH . Ttsdale MJ . 

Pharmaceutical Sciences Research Institute, Aston University, Birmingham B4 7ET, UK.. 

Atrophy of skeletal muscle is common in patients with cancer and results in increased 
morbidity and mortality. In order to design effective therapy the mechanism by which 
this occurs needs to be elucidated. Most studies suggest that the ubiquitin-proteasome 
proteolytic pathway is most important in intracellular proteolysis, although there have 
been no reports on the activity of this pathway in patients with different extents of weight 
loss. In this report the expression of the ubiquitin-proteasome pathway in rectus 
abdominis muscle has been determined in cancer patients with weight loss of 0-34% 
using a competitive reverse transcriptase polymerase chain reaction to measure 
expression of mRNA for proteasome subunits C2 and C5, while protein expression has 
been determined by western blotting. Overall, both C2 and C5 gene expression was 
.ncreased by about three-fold in skeletal muscle of cachectic cancer patients (average 
weightloss 14.5+/-2.5%), compared with that in patients without weight loss with or 
without cancer. The level of gene expression was dependent on the amount of weight 
°o SS . o"/ Cr ^ SU ? g max,mall y for both Proteasome subunits in patients with weight loss of 
i/-iy /«. t-urther increases in weight loss reduced expression of mRNA for both 
proteasome subunits, although it was still elevated in comparison with patients with no 

r ig m S^ 6 * WaS n ° CVidenCe f ° r 3,1 inCreas * "» oppression at weight losses less 
than 10/.. There was a good correlation between expression of proteasome 20Salpha 
suburuts, detected by western blotting, and C2 and C5 mRNA, showing that increased 
gene expression resulted in increased protein synthesis. Expression of the ubiquitin 
conjugating enzyme, E2(14k), with weight loss followed a similar pattern to that of 
proteasome subunits. These results suggest variations in the expression of key 
components of the ubiquitin-proteasome pathway with weight loss of cancer patients and 
suggest that another mechanism of protein degradation must be operative for patients 
with weight loss less than 10%. 
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Cell type-specific occurrence of cavcolin-lalpha aad -lbeta in the lung 
caused by expression of distinct iuRNAs. 

KogoH, Alba T, Fujimoto T . 

Department of Anatomy and Molecular Cell Biology, Nagoya University Graduate 
School of Medicine, Showa-ku, Nagoya 466-8550, Japan, likogo@fujita-hu.ac.jp 

Two isoforms of caveolin-l, alpha and beta, had been thought to be generated by 
alternate translation initiation of an mRNA (FL mRNA), but we showed previously that 
a variant mRNA (5 V mRNA) encodes the beta isoform specifically. In the present study 
we demonstrated strong correlation between the expression of the caveolin- 1 protein ' 
isoforms and mRNA variants in culture cells and the developing mouse lung The alpha 
isoform prote,n and FL mRNA were expressed constantly during the lung development 
whereas^expression of the beta isoform protein and 5'V mRNA was negligible in the fetal 
lung before 17.5 days post coitum, and markedly increased simultaneously at 18 5 days 
post coitum, when.the alveolar type I cells started to differentiate. Immunohistochemical 
analysis revealed the cell type-specific expression of the two isoforms; the alveolar type I 
cell expresses the beta isoform predominantly, while the endothelium harbors die alpha 
isoform chiefly. The mutually exclusive expression of caveolin-l isoforms was verified 
by Western blotting of the selective plasma membrane preparation obtained from the 
endothelial and alveolar epithelial cells. The present result indicates that the two 
caveolin-l isoforms are generated from distinct mRNAs in vivo and that their production 
is regulated independently at the transcriptional level. The result also suggests that the 
alpha and beta isoforms of caveolin-l may have unique physiological functions. 

PMID: 15067006 (PubMed - indexed for MEDLINE] 
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Oncogene and growth factor expression in ovarian cancer. 

Kommoss F , BaukiiecbtT, Birmelin G . KohlerM , tesch H , POcidcrer A . 

Department of Gynaecology, Albert-Lud wig University, Freiburg, Germany. 

The varying tumor-biological behavior of ovarian carcinomas probably influences both 
Uieir operabdity and response to chemotherapy, which are the most relevant prognostic 
factors. The phenotype of different ovarian carcinomas is obviously associated with an 
actuation of Ihe EGF/TGF-alpha s.gnai pathway, including c-myc and c-jun expression. 
Analysis ot EGF-R, TGF-alpha, c-myc and c-jun expression in 33 stage IiI/IV and 2 
stage I/II ovarian carcinomas with biochemical, molecular-chemical and 
immunohjstochemical methods showed a correlation between the mRNA and protein 
levels of EGF-R and TGF-alpha for tumors with low or high expressing rates. However 
the concentration of measurable free EGF-Rs seems to depend on the amount of TGF- ' 
alpha expression by the tumors. The EGF-R binding ligand TGF-alpha is produced by 
epithel.al rumor cells; stromal cells are usually TGF-alpha-negative, as shown by 
immunohistochemistry. High expression rates of EGF-R. TGF-alpha and c-myc were 
detected m 6, 7, and 10 out of 35 ovarian carcinomas, respectively. C-jun mRNA was 
detected .n 18/19 cases studied. Non-maiignant tissues originating from myometrium or 
ovary expressed no (or only small amounts of) EGF-R or TGF-alpha mRNA, whereas a 
high c-myc expression was found in 1/7 normal myometria, and in 2/5 normal ovaries 
1 here was no strong correlation between EGF-R/TGF-alpha and c-myc/c-iun 
exprcssion.(ABSTRACT TRUNCATED AT 250 WORDS) 

PMiD: 1 502888 [PubMed - indexed for MEDLINE] 
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Somatostatin receptors in primary human breast cancer: quantitative 
analysis of mRNA for subtypes 1-5 and correlation with receptor protein 
expression and tumor pathology. 

KuinarU, Grigorakis SI, Watt_HL, SasjR, SneHL, Watson P, Chaudhari S . 

Fraser Laboratories For Diabetes Research, Department of Medicine, McGill University 
Royal Victoria Hospital, 687 Pine Avenue West, H3A 1A1 Montreal, Quebec, Canada ' 
ujendra.lotrnar@muhc.mcgill.ca 

Somatostatin receptors (SSTRs) have been identified in most hormone-producing tumors 
as well as in breast cancer. In the present study, we determined SSTR1-5 expression in 
primary ductal NOS breast tumors through semi-quantitative RT-PCR. and 
immunocytochemistry. The results from the analysis of 98 samples were correlated with 
several key histological markers and receptor expression. All five SSTR subtypes are 
variably expressed at the mRNA level in breast tumors with 91% of samples showing 
SSTR1, 98% SSTR2, 96% SSTR3, 76% SSTR4, and 54% SSTR5. SSTR1-5 are 
localized to both tumor cells and the surrounding peritumoral regions as detected by 
irnmunocytochemistry. Levels of SSTR mRNA when corrected for bera-actin levels 
were highest for SSTR3 followed by SSTR1, SSTR2, SSTR5, and SSTR4. Furthermore, 
there was good correlation between mRNA and protein expression with 84% for SSTRl' 
79% for SSTR2, 89% for SSTR3, 68% for SSTR4, .68% for SSTR5, and 78% for all five 
receptors. SSTRl, 2 and 4 were correlated with ER levels whereas SSTR2 showed an 
additional correlation with PR levels. These correlations were independent of patient age 
and histological grade. Moreover, using immunocytochemistry, blood vessels exhibited 
receptor-specific localization for SSTR2 and SSTR5. Our results indicate significant 
correlations between mRNA and protein expression along with receptor-specific 
correlations with histological markers as well as ER and PR levels: Differential 
distribution of SSTR subtypes in tumors and receptor-specific expression in vascular 
structures may be considered as a novel diagnosis for breast tumors with receptor subtype 
agonists. 
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A transcriptomic and protcomic analysis of the effect of CpG-ODN on human 
THP-1 monocytic leukemia cells. 

KhoCC, KuoCW, Ljang:CM, LiangSM. 

Institute of BioAgricultural Sciences, Academia Sinica, Taipei, Taiwan. 

The CpG motif of bacterial DNA (CpG-DNA) is a potent immunostimulating agent whose 
mechanism of action is not yet clear. Here, we used both DNA microarray and proteomic 
approaches to investigate the effects of oligodeoxynucleotides containing the CpG motif (CpG- 
ODN) on gene transcription and protein expression profiles of CpG-ODN responsive THP-1 cells 
:Microarray analysis revealed that 2 h stimulation with CpG-ODN up-regulated 50 genes and 
down-regulated five genes. These genes were identified as being associated with inflammation 
antimicrobial defense, transcriptional regulation, signal transduction, tumor progression cell ' 
differentiation, proteolysis and metabolism. Longer stimulation (8 h) with CpG-ODN enhanced 
transcriptional expression of 58 genes. Among these 58 genes, none except one, namely WNTI 
inducible signaling pathway protein 2, was the same as those induced after 2 h stimulation 
Proteomic analysis by two-dimensional gel electrophoresis, followed by mass spectrometry 
identified several proteins up-regulated by CpG-ODN. These proteins included heat shock 
proteins, modulators of inflammation, metabolic proteins and energy pathway proteins 
Comparison of microarray and proteomic expression profiles showed poor correlation Use of 
more reliable and sensitive analyses, such as reverse transcriptase polymerase chain reaction 
Western blotting and functional assays, on several genes and proteins, nonetheless, confirmed that 
mere is indeed good correlation between mRNA and protein expression after CpG-ODN 
treatment. This study also revealed diat several anti-apoptotic and neuroprotective related 
proteins not prev.ously reported, are activated by CpG-DNA. These findings have extended our 
knowledge on the activation of cells by CpG-DNA and may contribute to further understanding of 
mechanisms that link innate immunity with acquired immune response®. 

PMID: 1 5693060 [PubMed - indexed for MEDLfNE] 
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1 Introduction 

Mammals protect themselves against pathogen infection pri- 
marily wii innate and adaptive immunity [}). The innate 
imm lin e system relies on a set of pattern recognition receptors 
{e.g.. Toll-like receptors) to recognize foreign molecular struc- 
tures such as lipopolysaccharide (LPS) and bacterial DNA \2, 3 J. 
I nnate immune cells recognize these molecular structures rind 
initiate not only irmate but also adaptive immunity by produc- 
ing immunomodulatory cytokines and activating T and B 
immune cells [1]. Bacterial DNA can directly activate B cells to 
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proliferate and secrete immunoglobulins m a T cell-independ- 
ent manner [4— 6J_ It also induces B cells and monocytes to 
activate transcription factor NF-kB and secrete cytokines, 
including interleukin (IL) 12, rumor necrosis factor ce (TNF-a), 
and interferon a/p p-10]. The immuriostimutatory activity of 
bacterial DNA has been assigned tounmethylated CpG motifs 
(GACGTT for murine, GTCGTT for human) [1 1 J. Recent evi- 
dence shows that synthetic oligodeoxyirucleotides containing a 
CpG motif (CpG-ODN), like bacterial DNA with the CpG moi- 
ety (CpG- DNA), induce potent Thl-like immune responses 
that are protective against several infectious agents and 
immune disorders in animal models [1 2, 1 3). Biologically a cove 
CpG-ODN, like bacteria DNA, activates 'macrophages and 
immature dendritic cells to increase expression of MHC class 
11 and costimulatory molecules, thereby transcribing cytokine 
mRNAs, and producing pro-inflammatory cytokines including 
TNFcUL-1, 1X-6 and 1L-12 (9,-14-16). CpG-ODN can therefore 
serve as an adjuvant and irnmunomodulatox in vaccines 
against a wide variety of targets, including infectious agents, 
cancer antigens and allergens (17). 

It has been suggested that unmethylated CpG-DNA-medi- 
ated immune activation functions through a toll-like receptor 9 
(TLR9) signaling pathway |18J. Endocytosis and sequentially 
endosomal maturation as well as bin ding of heat shock protein 
(HSP) 90 to CpG-DNA are essential for induction of TLR9 sig- 
nal transduction |19, 20). It has also been shown that recogni- 
tion of CpG-DNA causes TLR to form a dirrier, which recruits 
the adaptor molecule, myeloid differentiation factor 88 
(MyD88), through interaction between their C-temiina) Toll /I t- 
1R domains. This recruitment of MyD88 to the ToH/lL-lR do- 
main of TLR9 initiates a signaling pathway that sequentially 
'involves IL-lR-associated kinase 1 and TNF-a receptor-asso- 
ciated factor 6 11 8, 21, 22]. Studies using gene-deficient mice 
and RAW2647 cells transiently transfected with dominant- 
negative forms of these molecules have indicated that the 
MyD88-mediated signaling pathway is essential for CpG-DNA- 
induced activation of NF-kB and c-Jun NHj-terminal kinase 
(JNK), as well as subsequent production of cytokines in mono- 
cytic cells 1 18, 21 f 22). The precise mechanism of action of CpG- 
DNA and CpG-ODN, nonetheless, is still not thoroughly 
understood- To further elucidate the molecular events after 
binding of CpG-ODN to TLR9, in this study, we treated CpG- 
ODN responsive THP-1 cells with CpG-ODN and evaluated 
changes by using DNA miaoarray and proteomic approaches. 
We have discovered up-regulation of more than 50 disting- 
uished genes/proteins and identified induction of several anti- 
apopfotic and neuroprotecting genes by CpG-ODN treatment. 

2 Materials and methods 
2.1 Reagents 

Phosphorothioate- modified CpG-ODN and GpC-ODN were 
synthesized by MDBio (Taipei. Taiwan). Human specific 
ODN sequences are: CpG-ODN, 5-TCG TCG TTTTGT CGT 
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TTT GTC GTT-3'; GpC-ODN, 5*-TGC TGC TTT TGT GCT 
TFT GTG CTT-3\ The mouse specific CpG -ODN sequence is 
5*-TCC ATG ACG TTC CTG ATG CT -3*. CHCA was from 
. Sigma (St Louis, MO, USA). 

2.2 Cell culture 

Cell lines were obtained from the American Type Culture 
Collection (Rockville, MD). Mouse RAW264.7 macrophage 
and human embryonic kidney 293 cells (HEK293) were cul- 
tured in DM EM supplemented with 10% heat inactivated 
fetal bovine serum, 100 U/mL penicillin, 100 jig/xnL strep- 
tomycin sulfate, 200mmo]/L I-glutamine, and 50 um (3-mer- 
captoethanol in a humidified atmosphere of 5% C0 2 at 37°C. 
The medium was changed every 2 days for all experiments. 
Human THP-1 monocytic leukemia cells, which have been 
shown to express TLR9 and respond to CpG-DNA stimula- 
tion [23, 24); were cultured in RPM 1 1640 with the same sup- 
plements as for RAW264.7 cell cultures. 

2.3 Human cDNA microarray 

Total RNAs extracted from cultured THP-1 ceDs were isolated 
with TRIzol (lnvitrogen, Leek, The Netherlands) and submitted 
to Genasia Biotechnology (Taipei. Taiwan) for further proces- 
sing. In brie£ 4 pg of total RNA from CpG-ODN stimulated, or 
normal THP-1 ceDs was labeled with a fluorescence marker 
(U-vision, (Taipei, Taiwan)). Different colored fluorescence dyes 
(Cy5 and Cy3) were used to distinguish total RNA from normal 
and ODN stimulated cells. The labeled RNA was used for 
hybridization with the Human 1 cDNA microchip from Agilent 
Technologies (Palo Alto, CA, U SA). The chips were scanned and 
the expression pattern was analyzed using genechip software. 
Genes showing up-regulation or down-regulation of RNA levels 
were analyzed and identified on a genomic database as sug- 
gested by the manufacturer of the microchip. 

2.4 Protein preparation 

THP-1 cells were seeded in a 175 cm 2 tissue culture flask at a 
density of 10 6 cells per milliliter in culture medium. The cells 
were stimulated with or without 1.5 um CpG-ODN at defined 
times and harvested by centrifugation at 4°C, 1000 x g for 
15 mm. CeE pellets were washed twice with ice-cold PBS, 
resuspended and sonicated in extraction buffer containing 
25 mM Tris-HQ {pH 7.5), 2mM p-mercaptoethanol and pro- 
lease inhibitor cocktail. After centrifugation at 30000 x g for 
20min, ammonium sulfate was added to the supernatant unul 
the final concentration reached 50% saturation w/v. The solu- 
tion was stirred at 4°C for 30 min and centrifuged at 30000 x g 
for 30 min at 4°C The supernatant fraction was then trans- 
fen ed into a fresh rube, and the precipitated protein pellet 
solubilized in extraction buffer. To remove salts and oilier con- 
taminants, the extracts were treated with a pre-cooled (— 20°C) 
solution of 10% TCA in acetone with 0.07% p mercaptoetha- 
nol. Proteins were allowed to precipitate overnight at -20*C. 

www. proteomics- journal, tie 



896 



C.-C. Kuo etol 



After centjifngation, the pellet was washed with ice-cold ace- 
tone, containing 0.07% f*-mercaptoethanol. The supernatant 
was discarded and the peDet dried in a SpeedVac system (Model 
AE SI 010; Savant, Holbrook, NY, USA). 

2.5 2-DE 

2-DE was performed using an lPGphore IEF and a Hofer 
DA1T vertical unit (Amersham Biosciences, Piscataway* NJ, 
USA). One milligram of dried protein sample was dissolved 
in 350 uL of rehydration buffer solution, containing 7 M urea, 
2 m thiourea, 4% w/v CHAPS, 5 mM tributyl pbosphine, and 
2% IPG and loaded onto an irnrnobilized pH 3-10 linear 
gradient strip (18 cm), followed by rehydration for 16 h. IEF 
was then performed in the following manner: 100 V for 
iOmin, 250 V for 30rnin, 500 V for 30mm, 1000 V for 
30 min, 4000 V for 30 min, 6000 V for 55 000 Vh. At the end 
of IEF, the IPG strips were equilibrated for 1 5 min in butter 
containing 6 m urea, 2% w/v SPS, 30%v/v glycerol, and 
50 mw Tris, pH 6.8, then reduced with 65 mM dithiorery- 
thritol (DTE) and subsequently alkylated with 135 mM 
iodoacetamide for another 15 min. After equilibration, the 
IPG strips were immediately placed on top of a 12% SDS- 
PAGE (1.5mm, 20 x 24cm). The second dimension gels 
were then overlaid with molten 0.8% agarose solution in 
SDS electrophoresis buffer. Electrophoresis was performed 
at 16'C, starting at 10 mA per gel for 1 h, followed by 45 mA 
per gel until the dye front reached the bottom of the gels. 

2.6 Staining and image acquisition 

Immediately after electrophoresis, gels were stained with 
SYPRO Ruby (Molecular Probes, Eugene, OR, USA). In 
brief, gels were fixed for 30 min in 10% methanol, 7% acetic 
acid, and then stained overnight in SYPRO Ruby stain. The 
staining solution was removed and gels were washed in 30% 
methanol and 7% acetic add for 3 h. After staining, image 
acquisition was carried out on a Typhoon 9200 (Amersham 
Biosciences). To identify a protein, spot detection, quantifi- 
cation and matching of 2-D Jesuits were analyzed using 
ImageMaster software (Amersham Biosciences). The M, of 
the proteins weje calibrated according to the LMW-SDS 
Maiker Kit (Amersham Biosciences), and their pi values 
were estimated from the position of the protein spots on the 
2-D ge? and confirmed with the information supplied by the 
manufacturer. Since most of the pi values for the truncated 
proteins had not been reported previously, the pi values of 
the truncated proteins weie estimated from the position of 
the observed spots. To omit the variation due to tbe use of 
separate gels, after background subtraction, die intensity 
levds of protein spots on each gel were normalized as a pro- 
portion of one reference spot, and protein quantities were 
calculated by integrating the density over the spot area. Pro- 
tein spots that showed reproducible modulation exceeding 
-80% after CpG-ODN treatment in three experiments were 
further analyzed by MS. 
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2.7 In-gei digestion with trypsin arid extraction of 
peptides 

Protein spots were excised from stained gels and cut into 
pieces. In brief, gel spots were dehydrate-d with ACN for 
10 min and dried in a vacuum cenrjifuge. Gel pieces were 
reswelled with 55 mM DTE in 25 mM ammonium bicarbon- 
ate (pH 8.5) at 37°C for 1 h. The solution was then exchanged 
with alkylatjon solution, which contained lOOmM iodoaceta- 
mide in 25 mM ammonium bicarbonate (pH 8.5), at room 
temperature for 1 h. After- alleviation, the gel pieces were 
washed twice with 50% ACN in 25 m M ammonium bicarbo- 
nate (pH 8.5) for 15 min. The wash solution was discarded 
and the pieces of gel were dehydrated with ACN for lOmin 
and dried in a vacuum centrifuge. Tryptic digestion was 
initiated by reswelh'ng the gel in 25 mM ammonium bicar- 
bonate solution with 25 ng of trypsin (Promega, Madison, 
\VI, USA). After incubation at 37°C for 16h, tryptic peptides' 
were extracted twice.with 50% ACN containing 5% formic 
acid for 15 min with moderate sonication. The extracted 
solutions were pooled and evaporated to dryness in a vacuum 
centrifuge. The dried peptide mixture was dissolved in 0.1% 
formic acid and used for MS. 

2.8 MALDI-Q-TOF MS and protein identification 

Tryptic peptides analyses were performed using a Micromass 
Q-TOF Ultima MALD1 {Micromass, Wythenshawe, U.K.) 
equipped with a 337 nm nitrogen laser and operated in 
reflectron positive ion mode. Peptide mixtures (1 uL) were 
premixed with 1 uL of the matrix (5mg CHCAin 50% ACN 
with 0.1% TFA) then spotted onto the MALD1 target plate. 
Mass spectra were acquired for the mass range of 900- 
3500 Da and the individual spectra from MAID) MS or MS/ 
MS were processed using the Micromass Massiynx 4.0 soft- 
ware. The generated peak list files were used to query the 
Swiss-Prot database using the MASCOT program (http:// 
www.matrixscience.com) with the following parameters: 
peptide mass tolerance, 50ppm; MS/MS ion mass tolerance, 
0.25 Da; allowance of missed cleavage, }; and consideration 
for variable modifications such as oxidation of methionine 
and carboxyamidomethylation of cysteines. Only significant 
hits as defined by MASCOT probability analysis were con- 
sidered initially. In addition, when the PMF matches were 
between 5 and 9, at least one peptide sequence was manually 
checked by MA1DI MS/MS analysis. 

2.9 RT-PCR analysis 

cDNA from THP-1 cells was produced with Superscript IJ 
reverse transcriptase (lnvitiogen) using a o!igo(dT) )5 primer 
for 1 h at 42°C PCR of cDNA was performed using specific 
primers for the gene of intejest and control p-actin. AH PCR 
products were electrophor esed on a J. 5% agarose gel, and 
DNA bands were visualized by staining the gel with ethi- 
dium biomide. 
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2-10 Immunoblotting 



3 Results 



Human THP-1 or mouse macrophage RAW264.7 cells 
(5 x 3 x lO^/well) were cultured in a six-well culture 
plate and treated with or without 1.5 um CpG-ODN for 
the . designated times. After stimulation, cells were harvest- 
ed by centrifugation at 1000 x'g fot lSrnin in. a refri-. 
gerated centrifuge and washed twice with cold PBS buffer. 
The . cells were jysed on ice for 15 min with 300 uL lysis 
bufTex (Pierce, Rockford, USA), supplemented with pro- 
tease inhibitor cocktail (Sigma). The fysates were cen- 
trifuged at 12000 x g for 15rnin at 4 D C, and protein 
concentrations of supernatant were determined using the 
Bio-Rad Protein Assay (Hercules, CA> USA). The lysates 
(50 ug of protein/lane) were subjected to 12% SDS-PAGE 
and transferred to NC membranes (Amersham Bio- 
sciences). The membranes were blocked in PBS-O.1% 
Tween 20 (PBST) containing 5% non-fat skim milk at 
room temperature for 1 b, followed by staining with anti- 
ADP-ribosylation factor 3 (ARF-3) monoclonal antibody. 
(0.1 ug/mL; Sigma). The membranes were then incubated 
with horseradish peroxidase-conju gated secondary anti- 
body (dilution, 1:3000) for 1 h. After, washing three times 
with PBST, specific bands were detected by chemilumi- 
nescence according to the manufacturer's protocol 
(Amersham Biosciences). - 

2.1 1 Cell transfection and lucif erase assay 

HEK293 cells (5 x 3 x 10 6 /weU) were transfected using 
FuGENE 6 (Roche Molecular Biochemicals, Indianapolis, 
IN, USA) plus 0.1 ug p5xNF-x:B-luc (Stratagene, La Jolla. CA, 
USA), 0.05 ug pCDNA3.1 p-galactosidase, and pCDNA3:l- 
hTLR9 overnight The cells were incubated with or without 
1.5 um CpG-ODN for 8h and then lysed. NF-kB luciferase 
activity assays were performed as recommended by the 
manufacturer (Promega). 0-gaIactosidase activity was used to 
normalize the data. 

2.12 Enzyme activity assay 

Pyruvate kinase activity was assayed in a solution (1 mL) 
containing lOOmM Tris-HCl (pH 8.0), lOOmw KCI, 10mM 
Mga 2 , 0.2mM NADH, lOmw PEP, 1.5 mM ADP, lunit of 
lactate dehydrogenase, and an appropriate amount of cell 
lysate from CpG-ODN untreated or treated THP-1 cells. The 
reaction was monitored at 30°C for a period of time by 
measuring the decrease in absorbance at 340 nm. PGK ac- 
tivity was assayed in a coupled reaction with glyceraJdehyde 
3 phosphate dehydrogenase (GAPDH) as described by Lee 
|25]. In brief, the assay was performed at 30°C in a total vol- 
ume of 1 mL containing 100mM Tris-HCl (pH 7.9) , IOidm 
MgCI 2 . 0.15 mM NADH, 2 rnM ATP, 6mM 3-phosphoglyce- 
rate, OA mg/mL BSA, 50 mg of GAPDH, and an appropriate 
amount of eel) lysate. NADH consumption was monitored 3t 
340 nm. 



3.1 Effect of CpG-ODN on gene expression profiles of 
human THP-1 cells 

To elucidate the effect of CpG-ODN on gene expression, 
THP-1, a cell line known to express TLR 9 and respond to 
CpG-DNA (23, 24) was cultured with or without CpG- 
ODN. Since preliminary experiments showed that 1.5 um 
CpG-ODN caused more contrasting results between nor- 
mal and CpG-ODN treated cells, 1.5 um CpG-ODN was 
used throughout the experiments. To distinguish CpG- 
ODN treated samples from the control, total cellular RNA 
of normal and CpG-ODN treated cells was isolated and 
labeled with the fluorescence dyes, Cy5 and Cy3, respec- 
tively. The labeled RNA was then used for hybridization 
with a Human 1 cDNA microchip from Agilent Technol- 
ogies. Of the 13 000 human , genes- represented on the 
gene array, a total of .55 genes changed expression sign- 
ificantly after 2h CpG-ODN treatment. Among these, 
50 genes were up-regulated while five genes were down- 
regulated by a factor > 2. These genes were sorted by 
functions and are listed in Tables 1 and 2. They included 
notably, 1L-18 receptor accessory protein, MSG A beta 
gene, thioredoxin, pro-pol-dUTPase polyprotein, Spl40, 
connexin 59 gene, Grb2-like 2, enoyl-coenzyme A hydra- 
tase, propionyl coenzyme A carboxylase, cytochrome P450, 
and WNTl inducible signaling pathway protein 2 (WISP- 
2) etc. The function of these genes are known to be relat- 
ed to inflammatory responses, antimicrobial defense, 
transcriptional regulation, intracellular signal transduc- 
tion, tumor progression, cell differentiation, proteolysis 
etc. . 

Table 1. Genes up-regulated (> 2 fold) in human THP-1 cells after 
2 h of CpG-ODN treatment 



Gene name and description 
(changed fold > 2) 



Genebonk 
number 



Inflammation and receptor 

IL-18 receptor accessory protein AF077346 

T cell receptor V beta gene X58806 

MGSA, beta gene U03019 

Antigen gene (PA) M21896 

TIED NM_004791 

Platelet activating receptor AF002986 

Antimicrobial defence 

Thioredoxin NM_003329 

Pro-Pol-dUTPase polyprotein AC004748 

Nuclear body protein Spl40 U63420 

Transcriptional regulation 

Putative transcription factor LUZP AI98627 1 
General transcription factor II, 1, pseudogene 1 AI700706 

Connexin 59 gene L29277 

Basic helix-loop-belix protein class B 1 AF221520 
(BHLHB1) 
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Table 1. Continued 



Gene name and description 
(changed fold> 2) 



Genebank 
number 



Table 2. Genes dowmregulated (> 2 fold) in human THP : 1 cells 
after 2 h of CpG-ODN treatment . 



Regulatory protein 

Advillin AF041449 

i 

Channel and transport . 

Small GTP binding protein Rab9 U44103 
FXYD domain-containing ion transport AH 25364 

regulator 1 

Signal transduction 

SH3-d6main Grb2-like 2 AF036268 

■ ™ n X90568 

Titin associated protein (165 kD protein) X69089 

KIAA1451 protein AB040884 

Vasoactive intestinal peptide receptor Ul 1087 

Enzyme and protease 

Enoyl-Coen^yme A, hydratase AI800553 

Nephrin, B-type metallocarboxypeptidose U65090 

Propiony) Coenzyme A carboxylase AB01 1 145 

Acyloxyacyl hydrolase (neutrophil) M62840 

Cytochrome P450 U79716 

Intestinal alkaline phosphatase M31008 

Tumor progression and cell differentiation 
. Retinoblastoma! L1 1910 

Human genomic DNA of 9q32 anti-oncogene AB036268 

of flat Epithermm cancer, 

segment 6/10 

WNT1 inducible signaling pathway protein 2 AF 100780 
(WISP-2) 

Structure protein 

Collagen, type IV, alpha 6 D21337 
Beta Myosin heavy chain M58018 

Other 

Homo sapiens Cri-du-chot region mRNA, 

clone N1BB11 
Human mRNA for larninin alpha 5 chain, 

partial cds. 
NIK like and Thyroxin-binding 

globulin precursor 
Hypothetical protein DKF2p434M0331 
Hypothetical protein FLJ11021 similar 

to splicing factor 
Hypothetical protein 
Chromosome 18 open reading frame 1 
Arfaptin 1 

Zinc finger protein 8 (2FP8) 
Zinc finger protein 137 (clone pHZ-30) 
Olfactomedin related ER localized protein 
Cyclin-dependent kinase 8 
Integrin, alpha 1 
KIAA0421 protein 
KIAA1233 protein 
Unnamed protein product 
NEJL4 

BC33119t_l 



U52827 

AB010099 

Z83850 

AL 1377 20 
AK023985 

AL049851 
NM_ 004338 
AW408785 
M29581 
U09414 
AI738468 
BE 467537 
D87462 
AB007881 
AB033059 
AK026362 
X75535 
AAD39268 



Gene name and description 
(changed fold > 2) 



Zinc-finger homeodomain protein 4 . 
Human protein kinase MEKK2b mRNA, 

complete cds. . 
Glyplcari 5 

Human genomic DNA, chromosome 22q11.2, 

clone N75A 12. ., 
Collagen, type I, alpha 1 



Genebank 
number 

BAB03600. 
AF239798 

U66033 
AP000362 

Z74615 



Longer stimulation ofTHP 1 ceils with <CpG-ODN (8h) 
resulted in the up-regnlation of 58 genes. These genes in- 
cluded notably 1L-10 receptor beta, formyl peptide receptor- 
like 1 (FPRLl), vitamin D receptor, nuclear receptor sub- 
family 1 (UR), early B-cell factor, protein kinase C gamma 
(PKC), Nek, Ash. phospholipase C binding protein (NAP4), 
phosphoriboxyl pyrophosphate amidotransferase, dishev- 
eled 3. 'WISP-2 etc. Analysis of the functions of the 58 up- 
regulated genes showed that they are associated with anti- 
inflammation, transcriptional regulation, intracellular signal 
transduction, tumor progression, cell differentiation, pro- 
teolysis, neurodegeneration, neuroprotection etc. {Table 3). 
We also found that the stimulation of THP-1 cells with CpG- 
ODN for different periods of times resulted in different pro- 
files. Several defense related genes such as 1L-18 receptor 
accessory protein, Pio-PoJ-dUTPase polyprotein, SpHO and 
coimexin 59 were transiently up- regulated at 2h short stim- 

Table 3. Genes up-regulated (> 2 fold) in human THP-1 cells after 
8 h of CpG-ODN treatment 



Gene name and description 
(changed fold >?) 



Genebank 
number 



Inflammation and receptor 

IL-10 receptor (beta) - U08988 

Formyl peptide receptor-like 1 (FPRLl) AF081535 

Vitamin D receptor J03258 

NMDAR1 Z32774 

CD44 antigen AW028346 

Nuclear receptor subfamily 1 (LxR) NM_0O5693 

Neuromedin B receptor M73482 

Transcriptional regulation 

Early B-cell factor lebf) AF208502 

Neurogenic differentiation 1 (Neuro D) AB018693 

MAX dimerization protein (IMESH protein) AB037886 

Ribosomal protein S6 kinase AF090421 

ASII2L ABD22785 

Regulatory protein 

Hypothetical protein DKFZp434H0fi2O AL137555 

Peroxisomal farnesytateri protein X75535 

L7Bp 4 AF051344 

Neuronal pentraxin II U29195 
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Table 3. Continued 



Gene name and description 
(changed fotd > 2) 



Structure protein 

Trichohyalin 

Kertain 

Other 

Human transferrin pseudogene 
TIMP-2 

Collagen like protein 

Human genomic DNA, chromosome 21q, 

section 60/105 
Human genomic DNA, chromosome 21q, 

section 64/105 
KIAA0136 
KIAA0379 
K1AA04B9 
K1AA1114 
KIAA1451 
K1AA0756 

Zinc linger protein 267 

Hypothetical protein FU10633 

Hypothetical protein EUROIMAGE 1955967 

Mybl homolog like 1 

Anti^yme inhibitor 

Disintegrin-like and metalloprotease 

(reprolysin type) with Thrombospondin 

type 1 motif, 3 
ADP-rtbosylation lactor 3 (ARF-3J 
Testis specific protein, Y- linked 
Unnamed protein 



Genebank 
number 



Channel and transport 

Gamma-arninobutyric (61 J A receptor 
ATP synthase sub unit F6 
transient receptor potential channel 1 
Cboroideremia (Rab escort protein 1) 

Signal transduction 

Protein kinase C, gamma 
Regulator of G-protein signaling 5 
Nek, Ash and phospholipase C binding . 

protein INAP4) 
Highly similar to adeylate kinase gene 

E nzy m e and protease 
Phenylalanine hydroxylase 
Carboxy peptidase A1 
Xylulokinase 
Pancreatic lipase 
Ubiquitin specific protease 12 
Transmembrane protease, serine2 
Aspartate bata-hydroxylase 
Phosphoribosyl pyrophosphate 
amidotransferase 

Tumor progression and cell differentiation 

CDC 23 

WISP-2 

Microseminoprotein, beta 
- Dishevelled 3 



NM_004961 
M37104 
Z73903 
X57637 

215114 

AI674877 

AB005216 

AB0 16886 

AA203389 

X67318 

AK001205 

J05125 

AF022789 

U75329 

U03109 

D13757 



AF053977 
AF100780 
M34376 
NM_004423 

L09190 
AF061809 

M22376 
U44383 
U67921 
AP001716 

AP001720 

D50926 

AB002377 

AB007958 

AL049732 

AB040884 

AB0 18299 

AF220492 

AK001495 

AK026108 

AK001893 

D88674 

AB002364 



M74491 

M98525 
AK026O4? 



iB^MlfficMfings 899 

Table 4. List of antimicrobial and anti-inflammatory genes 
modulated by CpG-ODN treatment of THP-1 cells 



Gene name 


Genebank 


Expression fold 




number 










2h 


8h 


Connexin 59 gene 


129271 


2.12 ±0,05 


1.74 ± 0.08 


IL-18 receptor 


X58806 


2.32 ± 0.21 


1.19 ±0.13 


accessory protein 








Integrin, alpha 1 


X68742 


2.01 ± 0.03 


1.05 ±0.10 


Nuclear body protein 


U63420 


2.22 ±0.11 


1.39 ±0.04 


Sp140 








Pro-Pol-dUTPase 


AC004748 


2.33 ± 0.18 


0.95 ±0.03 


polyprotein 








Thioredoxin 


NM_003329 


2.20 ± 0.08 


1.07 ±0.01 


FPRLi 


AF081535 


0.95 ± 0.06 


2.13 ±0.11 


IL- 10 receptor 


U08988 


1.22 ± 0.18 


2.21 ± 0.07 


UR 


NM_005693 


0.90 ± 0.03 


2.37 ±0.31 


Vitamin D receptor 


J03258 


1.37 ±0.23 


2.39 ±0.11 



Expression fold is designated as the ratio of CpG-ODN treated 
over control 



ulation but were down-regulated thereafter, while anti- 
inflammatory associated genes such as FPRLI, IL10 recep- 
tor, vitamin D receptor and LxR were up-regulated after 8h 
stimulation (Tables 1, 3 and 4). 

3.2 Verification of the micro array results with RT-PCR 
or Western blotting 

To verify the results from the microarray analysis,, we also 
performed RT-PCR on the up-regulated genes (Fig. 1). Con- 
sistent with results obtained in the microarray gene expres- 
sion analysis, RT-PCR studies showed thai the mRNA levels 
of some selected genes, including ubiquitin specific pro- 
tease 12, regulator of G-protein signaling 5, NAP4 and 
ASH 21*, were increased in response to CpG-ODN (Fig. 1, 
Table 3). In addition, the protein expression level of ARF-3 



- GpC-ODN 



CpG-ODN 



Untreated 



Ubiquitin specific 
protease 12 
Regutatot of G- 
prottt'm signaling 5 j 

NAM 



~ i ■ ■ ■ i 

Figure 1. Induction of various genes by CpG-ODN. THP1 cells 
were stimulated with medium alone, 1.5 pM GpC-ODN (as the 
negative control) or CpG-ODN for the indicated times.- RT-PCR 
was then performed to analyze gene expression levels, p-actin 
was used as an internal contiol. The experiment was repeated 
three times with similar results. 
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Untreated 



Actin 

Figure 2. CpG-ODN induced ARF-3 protein expression in human 
THP-? or mouse RAW264.7 cells. THP-1 (A) or mouse RAW264.7 
(B) cells were incubated with 1.5 mm CpG-ODN for the indicated 
time points. The protein expression level of ARF-3 was deter- 
mined by Western blotting of cell extracts using anti-ARF-3 anti- 
body- The experiment was repeated three times with similar 
results. 

was shown to increase in Western blotting analysis in cell 
lysates from THP-1 cells treated with CpG-ODN for 8-24 h 
(Fig. 2A). Similar studies showed that the ARF-3 protein was 
also induced by mouse specific CpG-ODN in other TLR9 
expression cell lines such as the mouse macrophage 
RAW264.7 cell line (Fig 2B). 

3.3 Proteins regulated in CpG-ODN stimulated THP-1 
cells 

To further assess whether there was any correlation between 
regulation of gene expression and expression of cellular pro- 
teins, a proteomic approach was adopted to identify protein 
expression profiles. THP-1 cells were treated with CpG-ODN 
for defined times (from 8 to 40h), and their cytoplasmic 
proteins were extracted for 2-DE analysis. Although the use 
of high concentrations of urea might give us a broader view 
of all the proteins affected by CpG-ODN, preliminary results 
from 2D gels showed that the resolution of the protein mix- 
tures were not satisfactory. To improve and get the best reso- 
lution from 2-DE, total proteins were roughly separated into 
supernatant and precipitated fractions using 50% saturated 
ammonium sulfate solution. To remove salts and other con- 
taminants, both protein fractions were precipitated with TCA 
solution and then subjected to 2-DE. By protein spot deter- 
mination analysis, about 500 and 450 well-resolved spots 
were observed on each pH 3.0-10.0 gel for precipitated or 
supernatant fractions, respectively. Comparative analysis of 
2-DE between treatments and control showed that the inten- 
sities of the protein spots from the ammonium sulfate pre- 
cipitated fraction did not change, while several protein spots 
weie up-regulated by at least -80% in the supernatant frac- 
tion of 8h CpG-ODN stimulated THP-] tells (Fig. 3). 

The protein spots were individually excised from gels for 
further identification. After trypsin digestion, several protein 
spots were identified without ambiguity by MS MALD1- 




Figure 3. The effect of CpG-ODN on the 2-DE profile of THP-1 
cells. THP-1 cells were treated with or without 1.5 pM.CpG-ODN 
for 8h. Total proteins were extracted and roughly separated into 
two fractions by 50% saturation with ammonium sulfate. The 
supernatant fraction was then separated by 2-DE. Protein spots 
were visualized by SYPRO Ruby staining. Comparison of CpG- 
ODN treated THP-1 celts to untreated cells showed that these 
proteins changed in intensity by over 80%. Protein spots were 
identified by trypsin digestion and MS. Localization of protein 
spots 1 (enoly-coehzyme A hydratase), 2 Iproteasome a) and 3 
(cyclophilin A; two isotorms) are shown. The experiment was 
repeated three times with similar results. 



TOF. These proteins included HSP60, HSP90, cycJophi- 
lin A, enoyj-coenzyme A hydratase, eukaryotic translation 
elongation factor, proteasome ct and p drain and ATP syn- 
thase beta chain (Table 5). Similar experiments on cells 
treated for a longer period of time with CpG-ODN stimu- 
lation (25 h) revealed that 27 protein spots were changed in 
intensity by at least -80%. These protein spots contained 
members of HSPs {HSP27, hsc70, grp78 and grp94), 
metabolic enzymes (phosphoglycerate kinase (PGK) and 
pyruvate kinase (PYK)), macrophage capping protein and 
cyclophilin A (Table 6). Among these proteins, macrophage 
capping protein, PGK, PYK, cyclophilin A and HSP27 
(Figs. <l and 5 A) were found to be up-regulated. Interest- 
ingly, we found that a truncated form of grp78 with an 
expected mass of 25kDa and pJ of 5.3 was up-regulated 
while grp78 itself was down- regulated. A similar situation 
was also found for grp9't and hsc70 and their truncated 
derivatives (Table 7 and Fig. 5). In addition, we also ob- 
served six down -regulated protein spots on 2-D gels in 
samples after 25 h CpG-ODN treatment. Among these six 
proteins, we have successfully identified three as 40s ribo- 
sornal protein SA, grp78 and hsc70," respectively (Table 6). 
while the other three, due to their relative low abundance," 
have not been identified yet. 
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Table 5. List of proteins modulated by 8h CpG-ODN treatment 



Protein name 


Accession 


H" . 


P' 


Matched 


Cover- 


Score 


Expression 




no. 


(theor.) 


(theor.) 


no. 


3ge% 




fold 


ATP synthase beta-chain 


gi1 14549 


56 525 


5.26 


16 


58 


171 


2.33 ± 0.06 


Cyclophilin A 


P05092 


17 870 


7.82 


5 


35 , 


62 


2.85 ± 0.13 


Enoyl-Coenzyrhe A 


g»4503447 a> 1 35 971 


6.61 


11 


44 


62 


2.52±Q.05 


hydratase 
















Eukaryotic translation 


gi4503481 


50 087 


6.25 


6 


25 


68 


3.41 ±0.21. \ 


elongation factor 
















HSP60 


P10809 b) 


57 963 


5.24 


13 


27 


76 


2.78 ± 0.03 


HSP90-beta 


P08238 


83 133 


4.97 


10 


18 


65 - 


2.36 ±0.10 


Proteasome a chain 


g>450618l 


25882 


6,92 


11 


59 


80 


2.52 ±0.11, 


Proteasome p chain 


gi4506193 


26472 


8,27 


9 


46 


84 


3.85 + 0.17 



Expression fold is designated as the ratio of CpG-ODN treated over control 

a) . NCBI accession number 

b) Swiss-Prot accession number 



Table 6. List of proteins modulated by 16 and 25 h CpG-ODN treatment 



Protein name Swiss-Prot M r pi Match- Cover- Score Expression fold 





no. 


itheor.) 


. Itheor.) 


ed no. 


age% 




16h 


25h 


Cyclophilin A 


P05092 


17 870 


7.82 


5 


35 


62 


2.53 ± 0.02 


2.48 ±0.15 


78kDa glucose regulat- 


P11021 


72 288 


5.07 


13 


30 


148 


0.61 ± 0.04 


0.29 ± 0.06 


ed protein tgrp78) 


















HSP27 


P04792 


22 768 


5.98 


12 


59 


124 


1.00 ±0.01 


2.61 ±0.12 


Heat shock cognate 


P11142 


70 854. 


5.37 


16 


34 : 


114 


0.64 ± 0.01 


0.31 ± 0.03 


70kDa protein 


















thsc70) 


















Macrophage capping 


P40121 


38 494 


5.88 


9 


30 


58 


1.31 ±0.01 


2.58 ± 0.03 


protein 


















Phosphoglycerate 


P00558 


44 284 


7052 


11 


33 


71 


2.32 ± 0.11 


4.23 ± 0.19 


kinase 


















Pyruvate kinase 


P14618 


57 710 


7.95 


17 


32 


114 


1.65 ± 0.07 


2.70 ±0.12 


40s ribosornal protein 


P08865 


32 833 


4.79 


5 


23 


61 


0.33 ± 0.02 


0.35 ± 0.05 



SA (RSP40) . 



Table 7. List of truncated proteins detected in THP-1 cells after 25 h CpG-ODN treatment 



Protein name 


Swiss-Prot 




p/ 


Matched 


Cover- 


Score 


Expression 




no. 


[obs.> 


fobs.) 


no. 


age% 




fold 


94 kDa glucose- regulated 


PI 4625 


j -59700 


-5.00 


12 


14 


104 


New* 


protein (grp94) 
















Truncated form of grp78 


P11021 


-25 000 


-5.30 


12 


22 


75 


New 


Truncated form of hsc70 


P11142 


-22 000 


-5.80 


12 


20 


96 


New 


Truncated form of nsc70 


P11142 


-19 000 


-6.10 


11 


18 


113 


New 



d) New designated proteins detected in the CpG-ODN treated gel but not in the corresponding control gel 



3.4 Comparison of microarray and proteomic results 

Table 8 shows the expression of six genes and their corre- 
sponding proteins that were modulated by 8h treatment of 
THP-1 cells with CpG-ODN. Besides enoyl-coenzyroe A 
hydratase, there was poor correlation between the expression 



of genes and their corresponding proteins (Table 8), sug- 
gesting that more in-depth studies were needed. To further 
evaluate whether changes observed in protein expression 
correlated with changes in mKNA levels, we randomly chose 
two proteins (PGK and PYK) that were induced after 16h 
CpG-ODN treatment and determined their mRNA levels by 
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Figure 4. 2-D gel clectrophoretic analysis of CpG-ODN-treoted 
THP-1 cells. (A) Total cell protein from unstimulated THP-1 cells 
was subjecled to 2-DE,{B) THP-1 cells were treated whh or with- 
out 1.5 pm CpG-ODN for defined times. Cellular proteins were 
extracted and separated by 2-DE. Several up-regulated proteins 
are shown in the SYPRO Ruby staining gel. Comparison of CpG- 
ODN treated THP-1 cells to untreated cells showed that these 
prote.ns changed in intensity by over 80%. The experiment was 
repeated three times with similar results. 




Figure 5. 2-D gel electrophoretic analysis of CpG ODN treated 
THP-1 cells. THP-1 cells were treated with or without 1.5 M m CpG- 
ODN to, deHned times. Cellular proteins were extracted and 
separated by 2-DE. Protein spots were detected by SYPRO Ruby 
staining. |A) Expression of HSP27 was induced by increasing the 
penod of CpG-ODN stimulation. A truncated form of hsc70 was 
detected on the gel. IB) The native form of g, p 78 was detected in 
untreated cells, while the truncated form of gr P 78 was observed 
after 25 h stimulation. The experiment was repeated three times 
with similar results. 
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Table & Comparison of gene, and protein expression levels in 
. THP-1 cells after 8 h CpG-ODN treatment 



Protein name 


Gene 


Protein 




expression 


expression 




fold from 


fold from 




microarray 


2-D gel 


Enoy (-Coenzyme A hydratase 


1.72 ± 0.31 


2.52 ± 0.05 


Eukaryotic translation 


0.95 ± 0.01 


3:41 ± 0.21 


elongation factor 






HSP60 


0.94 ± 0.07 


2.78 ± 0.03 


HSP90-beta 


1.58 i 0.11 


2.3B + 0.10 


Proteasome a chain 


0.98 ± 0.03 


2.52 :t0.H 


Proteasome p* chain 


1.09 ± 0.11 


3.85 ±0.17 



RT-PCR. Char results showed that mRNA levels of PYK 
increased after 16 h CpG-ODN treatment, while mRNA 
levels of PGK were dramatically increased after 24 h stimu- 
lation (Fig. 6). In addition, we also performed enzyme activ- 
ity analysis and found that the activity of PGK and PYK were 



3.5 
3 



(A) 

DGpC ODN 
S CpG-ODN 




— 2 



t> 1.5 



(B) 



□ GpC-ODN 
0 CpG-ODN 



16 



24 



"248 
Time (h) 

Figure 6. Activities o* PYK and PGK induced by CpG-ODN. THP1 
cells were stimulated with medium alone, 1.5 m m GpC-ODN (as 
the negative control! or CpG-ODN for the indicated times. Cell 
lysotes were extracted and assayed lor (A) PYK and IB) PGK ac- 
tivities. Data represent mean -x SEM. |n = 3). 
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Figure 7. Induction of PYK and PGK. transcripts by CpG-ODN. 
THP1 cells were stimulated with medium alone, 1.5 >im GpC-ODN 
or CpG-ODN for the indicated. times. RT-PCR was then performed 
to anaryze gene expression, fcactin was used as an internal con- 
trol. The experiment was repeated three times with similar 
results. 



indeed increased by. a factor —2.5 after CpG-ODN stimula- 
tion (Fig. 7). To confirm that mRNA induced by CpG-ODN 
would also be accompanied by an increase in protein 
expression even though it was not detected in 2-D gel anary ; 
sis, we used more sensitive and specific Western blotting 
analysis. As shown in Table 3 and Kg. 2, ARF-3 was identi- 
fied in the miaoarray gene piofile but not in the proteomic 
expression profile. Nevertheless, we observed enhanced pro- 
tein expression of ARF-3 after CpG-ODN stimulation by 
Western blotting. Moreover; to investigate whether the up- 
regulation of ARF-3 by CpG-ODN is mediated through the 
TLR9 pathway, TLR9-deficient HEK293 cells were transiently 
cotiansfected with hTLR9 and luciferase-reporter gene driv- 
en by a NF-xB-dependent promoter. Our data showed that 
NF-kB activity of untransfected HEK293 cells were not 
responsive to CpG-ODN stimulation, while in transfected 
HEK293 cells expressing hTLR9, NF-kB luciferase activity 
was up-regulated 12-fold after 8 h CpG-ODN stimulation. 
The activation of NF-kB induced by CpG-ODN was blocked 
by pretreatroent of the transfected cells with an ARF-3 in- 
hibitor, such as brefeldinA (Fig. 8), suggesting CpG-ODN 
induces ARF-3 and activates NF-kB after the interaction of 
CpG-ODN with TLR9. 



4 Discussion 

In this study both microarray and proteomic approaches 
were used to evaluate the effect of CpG-ODN on gene/pro- 
tein expression profiles of THP-] cells at several rime points. 
Comparison of the gene expression profiles showed that 
stimulation of the cells with CpG-ODN for different periods 
of time resulted in different profiles (Tables 1-4). The differ- 
ences in mRNA expression between the cells with short and 
long stimulation could be attributable to the low reproduci- 
bility. However, to avoid experimental variations, we not only 
used the same batch of miooanays from the same manu- 
facturer but also applied the samples of short and long term 
stimulation at the same time. In this way, we (o\md that the 



changes in expression fold of mRNA after CpG-ODN treat- 
ment were quite reproducible as shown by their mean 
± SEM (Table 4). A more likely explanation for the difference 
in the expression level of mRNA after different periods of 
stimulation with CpG-ODN is that the transient increase or 
decrease in these mRNA by CpG-ODN plays a significant 
role in modulating biological functioiis. For example, we 
found that the 11.-18 receptor accessory protein from THP-1 
cells was up-reguhted after 2h of CpG-ODN stimulation. 
The 1L-18/IL18R system is known to activate Thl-mediated 
immune responses, that play a critical role in host defense . 
against infection [26]. Together with JL-18/IL18 R, several 
genes for antimicrobial defense were also increased, includ- 
ing thioredoxin, Pro-Pol-dUTPase polyproteiri and Spl40. 
After 8h of CpG-ODN stimulation, however, none of these 
genes was activated any more (Table 4); Since sustained or 
excessive production of these antimicrobial molecules might 
lead to inflammation and cellular damage [27], a plausible 
explanation is that THP-1 cells fight against the invasion of 
pathogens by up-regulating antimicrobial defense-associated 
genes at an early stage of stimulation and then shut them 
down to avoid over-activation. Whether this explanation is . 
true remains to be verified. 

It is noteworthy that our data also identified the up-reg- 
ulation of several anti-inflammatory associated genes after 
8h of CpG-ODN stimulation. These genes included FPRU, 
1L-10 receptor, vitamin D receptor, and IxR (Table 3). FPR 
and FPRL1 have been defined as chemotactic factors 
involved in host defense against bacterial infection and in the 
clearance of damaged cells. Additional studies have indicated 
that FRPL1 interacts with a menagerie of structurally diverse 
pro- and anti-inflammatory ligands associated with diseases, 
including amyloidosis, Alzheimer's diseases, prion disease 
and HTV |28, 29). Therefore, FPRJL1 may play an important 
role in regulating and/or balancing the production of pro- 
and anti-inflammatory molecules in CpG stimulated THP-1 
cells. Additionally, a recent study has demonstrated that LxRs 
and their ligands act as negative regulators of macrophage 
inflammatory gene expression and inhibit the expression of 
inflammatory mediators such as inducible nitric oxide syn- 
thase, cydooxygenase and 1L-6 in response to bacterial 
infection or LPS stimulation [30). Of interest, we found that a 
transcription factor gene connexin 59, a regulator of JL-6 
expression, was up-regulated after 2 h of CpG-ODN stimula- 
tion. It is 0ms likely that CpG-ODN stimulation of THP-1 
cells for 2h may induce the expression of the pro-inflamma- 
tory cytokine 1L-6 through the up-regulation of the connexin 
59 gene, while 8h of CpG-ODN treatment may counter- 
balance the initial inflammatory response by inducing LxR to 
inhibit 1L-6 production. 

Signal transduction molecules play an important role in 
cellular activation. Intracellular signal transduction systems 
employing various intermolecular interactions through 
docking elements, including SH2 and SH3 domains, have 
been reported |3l-33]. Here we found that THP-1 cells 
treated with CpG-ODN for 2h up-regulated gene expression 
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of Grb2-liJce protein (which contains an SH3 domain), while 
8h of stimulation induced Nek, Ash and phospholipase C 
binding protein (NAP4 which contains an SH2 domain). It is 
thus possible that Grb2-h"ke protein and NAP4 may play 
important roles in CpG-ODN mediated signaling- pathways. 
Furthermore, recent studies have also revealed that binding 
of CpG-DNA to TLR? results in activation of ]NK [34). Since 
JNK is activated oy Nek adaptor protein and Nek interacting 
kinase [35, 36], it is possible that CpG-ODN may activate JNK 
via up-regulation of NAP4. Although a recent publication 
described the gene expression profiles of a cultured mouse 
macrophage cell line afler CpG-DNA stimulation [34J, their 
miaoarray results were only conducted at one time-point 
(6h stimulation). Moreover, they did not report the meas- 
urement of protein expression profiles in response to CpG- 
ODN stimulation. 

Comparison of the gene and protein expression profiles 
showed that there was discordance between mRNA and pro- 
tein levels (Table 8). Similar discordance between the 
expression partem of genes and proteins was also reported in 
other system using different stimuli [37-41]. The dis- 
cordance between mRNA and protein levels could be due to 
screening capability such as detection sensitivity, choice of 
cut-off point, quano'Utivity of microarray and 2-D gels, as 
well as time discrepancy between gene and protein expres- 
sion [39, 40, 42, 43]. Alternatively, it could also be explained 
by post-trancriptional events, such as alternative splicing or 
PTM [39, 40, 42, 43]. Another possible explanation is that 
most of the spots observed in the 2-D gels are isoforms of 
some proteins. The intensity of each spots does not neces- 
sary represent total amount of a certain protein and thus 
does not correlate with its mRNA level. Our finding that 
microarray results correlated better with Western blotting 
results (e.g., ARF-3 in Fig. 2), an approach more suitable than 
2-D gels for determining the total amount rather than iso- 
forms of a given protein, seems to suggest that formation of 
isoforms should be carefully taken into consideration when 
one tries to correlate mRNA and protein expression data. 

Using a proteomics approach, we found that CpG-ODN 
treatment up-regulated the expression of many proteins 
including HSPs, metabolic enzymes, structural proteins, as 
well as macrophage capping protein, cydophilin and protea- 
some a and [J chain etc. HSPs are the most abundant and 
ubiquitous soluble intracellular proteins. They are up-regu- 
lated by various stressors including temperature, glucose 
deprivation, microbial infection and cancer [44]. They func- 
tion as molecular chapeiones to prevent protein aggregation 
and contribute to the folding of nascent and altered proteins. 
In addition, they are able to regulate immune responses, 
including production of inflammatory cytokines and che- 
mokines and activation or maturation of immune cells [45 
46]. Beside HSPs, cydophilin as well as proteasome a and p 
chain have also been reported to be involved in the immune 
response |39, 47]; proteasome p chain is consistently up- 
regulated in human neutrophils following LPS exposure [39]. 
Our finding that the protein levels of HSPs, cydophilin, and 
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proteasome a and P chain were increased after CpG-ODN 
treatment suggests that these molecules might play a role hi 
the immunostimularing effect of CpG-ODN. To what extent 
these proteins contribute to the immune responses of the 
cells to CpG-ODN is currently under study Proteomic anal- 
ysis also showed that truncated forms of grp78, grp94 and 
hsc70 were induced, a phenomenon similar to calreticuhn. 
observed by Richards and his coworkers [4SJ. The expression 
of full length hsc70 and grp78 were decreased while the 
levels of their truncated derivatives was increased after CpG- 
QDN treatment. These-results suggest that the degradation 
of these proteins has been enhanced We also found that 
proteosome a and p chains as well as ubiouitm specific pro- 
tease 12 were increased by CpG-ODN. Whether these 
enzymes or other enzymes were responsible for the genera- 
tion of truncated hsc70 and grp78 remains to be eluddateel 
Cells rely on multiple signaling pathways to determine 
their fates of survival, proliferation or apoptosis (49J. In fact, 
apoptosis plays an important role in regulating pathogen' 
infection. To be able to grow and replicate in the target cells, 
pathogens may have to block apoptosis. Results from several 
laboratories have made it dear that HSP70 and HSP27 pro- 
tect cells not only from heat, but also from most apoptoric 
stimuli |4S, 50] by binding to Akt and subsequently mediat- 
ing anti-apoptotic activity through activation of Akt (51-53J. 
Since our data revealed that CpG-ODN induced the expres- 
sion of HSP90 and HSP27, it is possible that CpG-ODN 
might prevent apoptosis by up-regulation of HSPs. 

Interestingly, our microarray data also showed that CpG- 
ODN mediated the induction of a set of genes associated 
with tumor progression and cell proliferation. Among these, 
one gene, W1SP-2, was up-regulated by CpG-ODN after both 
2 and 8h stimulation. WISP genes were first identified as 
downstream targets of the WnMp-catemn signaling path- 
way They belong to the CCN family of growth factors that 
have been receiving increasing attention lately due to some 
of the family members having been reported to be involved 
in angiogenesis and turn ori genesis [54J. It would be inter- 
esting to evaluate whether CpG-ODN plays a role in angio- 
genesis and mmorigenesis by regulating WISP-2. In addi- 
tion, we found that some genes associated with neurode- 
generation or neuroprotection, such as FPRU, NMDAR 
(NMDA) receptor, PKC and dishevelled 3 were upregulated. 
To our knowledge, this is the first report to suggest an asso- 
dation between these genes and CpG-ODN stimulation. As 
mentioned above, FPRU plays a crudal role in proin- 
flammatory aspects of systemic amyloidsis and neurodegen- 
erative disease such as Alzheimer's disease and prion disease 
[28J. NMDAR, PKC and disheveled are involved in modulat- 
ing amyloid precmsor protein metabolism, which is central 
to the pathogenesis of Alzheimer's disease (55-57J. Most 
notably recent studies have shown that the TLR4- dependent 
pathway is involved in neu regeneration of the central 
nervous system [58J. Whether CpG-ODN moieties of patho- 
gens play any role in neurodegenerative diseases such as 
Alzheimer's remains to be elucidated. 
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Figure 8. ARF-3 participates in the CpG-ODN-TLR9-NF-icB path- 
way. HEK293 cells were cotransfected with p5xNF-KB and human 
TLR9: After overnight transfection, the cells were incubated, with 
or without 1.5 pM CpG-ODN for 8h in the presence or absence of 
Increasing concentrations of the ARF-3 inhibitor brefeldiri A. 
After incubation, cells were K/sed and NF-xB luciferase activity 
was measured. Data represent mean ± SEM. in = 3). 



Exposure of cells to LPS or microbial infection has been 
known to induce several genes encoding metabolic enzyme 
|34, 39). Our microarray data also revealed that a large number 
of genes encoding proteins involved in energy synthesis and 
fatty acid oxidation, such as enoyJ-coenzyrne A hydra tase, 
propionyl coenzyme A carboxylase and cytochrome p450 
were activated by CpG-ODN treatment. In addition, we found 
that other proteins such as ARF-3 were up- regulated (Table 3, 
Fig. 8). ARFs are 20kDa GTPases of the ras superfamily that 
are critical to vesicular trafficking, including exocytic protein 
transport and endocytosis (59, 60]. This study demonstrates 
for the fust time that ARF-3 is involved in the activation of N F- 
kB induced by CpG-ODN (as shown in Fig. 8} 

CpG-DNA/ODN has been shown to elicit primarily 
responses via the TLR9/MyD88 dependent pathway [18, 21, 
22]. Chromosome location analysis showed that instead of 
localizing on one or two chromosome, the genes/proteins 
modulated by CpG-ODN stimulation are scattered on all 
chromosomes except chromosomes 23 and 24. These results 
seem to suggest that CpG-ODN either affects multiple chro- 
mosomes simultaneously or subsequently via cascades of 
cellular messengers. More studies are needed lo elucidate its 
mechanism of actions. 



5 Concluding remarks 

In summary, by using microanay and proteomic approaches 
to evaluate the effect of CpG-ODN at different ti me points, 
we have found that genes/proteins regulated by CpG-ODN 



are related to inflammatory responses, antimicrobial defense, 
transcriptional regulation, intracellular signal transduction, 
tumor progression, cell differentiation, proteolysis, anti-apop- 
tosis as well as neurodegeneration and neuroprotection- Our 
results may herp delineate the CpG-ODN mediated pathway 
and contribute to further understanding of mechanisms that 
link innate immunity with acquired immune response(s). 
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Expression of the multidrug resistance-associated protein (MRP) mRNA 
and protein in normal peripheral blood and bone marrow haemopoietic 
cells. 
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We studied the expression of multidrug resistance-associated protein (MRP) in normal 
haemopoietic cells from peripheral blood and bone marrow. The MRP mRNA levels 
were estimated by RT/PCR and in situ hybridization (ISH) assay, and the protein levels 
by flow cytometry. 21 samples of peripheral blood and 21 samples of bone marrow (1 1 
normal bone marrow donors, 10 patients in complete remission after chemotherapy for 
large cell lymphoma or acute myeloid leukaemia) were analysed. In peripheral blood the 
mean MRP mRNA level in CD3+ cells was statistically higher than in the other ceils (3- 
fold by the methods used). The levels of MRP in CD3+ varied from one individual to 
another (4.5-34,8 units by RT/PCR and 5-23 grains/cell by TSH); however, this was 
proportional to the variation in all the cell lineages of same individual (r = 0.84). In bone 
marrow the mean MRP levels of the various cell lineages (including CD34+) were 
similar to the basal level in HL60 cells. Individual expression levels were again variable; 
however, diere was no difference between untreated normal bone marrow and post 
chemotherapy normal bone marrow. MRP protein expression was determined by flow 
cytometry with the monoclonal antibody MRPm6. The CD4+ lymphocytes exhibited a 
higher MRP protein expression than die other cell lineages, including CD8+ cells. There 
was a good correlation between the .three methods used (RT/PCR and ISH, P = 0.000 1 , r 
= 0.87; RT/PCR and flow cytometry, P = 0.0001, r = 0.85; ISH and flow cytometry P = 
0.002, r = 0.67): . y, 
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Vascular endothelial growth factor enhances cardiac allograft 
arteriosclerosis. 

Lemstrom KB, Krebs R, Nykanen AI . TUdcanen JM , Sthvola RK Aaltola EM . 
}imzMiWoo<W, Ahtak) K Yla-Herttuala S. Koskiacn PK . 

Cardiopulmonary Research Group, Transplantation Laboratory, University of Helsinki . 
and Helsinki University Central Hospital, Helsinki, Finland. KarlXemstrom@helsirJd.fi 

BACKGROUND: Cardiac allograft arteriosclerosis is a complex process of alloimmune 
response, chronic inflammation, and smooth muscle cell proliferation that includes cross 
talk between cytokines and growth factors. METHODS AND RESULTS: Our results in 
rat cardiac allografts established alloimmune response as an alternative stimulus capable 
of inducing vascular endothelial growth factor (VEGF) mRNA and protein expression in 
cardiomyocytes and graft-infiltrating mononuclear inflammatory cells, which suggests 
that these cells may function as a source of VEGF to the cells of coronary arteries. Linear 
regression analysis of these allografts with different stages of arteriosclerotic lesions 
revealed a strong correlation between intragraft VEGF protein expression and the 
development of intimal thickening, whereas blockade of signaling downstream of VEGF 
receptor significantly reduced arteriosclerotic lesions. In addition, in cholesterol-fed 
rabbits, intracoronary perfusion of cardiac allografts with a clinical -grade adenoviral 
vector that encoded mouse VEGF(164) enhanced the formation of arteriosclerotic 
lesions, possibly secondary to increased intragraft influx of macrophages and 
neovascularization in the intimal lesions. CONCLUSIONS: Our findings suggest a 
positive regulatory role between VEGF and coronary arteriosclerotic lesion formation in 
the allograft cytokine microenvironment. 
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CHAPTER 29 
Regulation of transcription 



The phehotypic differences that distinguish the 
various kinds of cells in a higher eukaryote are 
largely due to differences in the expression of 
genes that code for proteins, that is, those tran- 
scribed by RNA polymerase 1J. In principle, the 
expression of these genes might be regulated at 
any one of several stages. The concept of ihe 
"level of control" implies that gene expression 
is not necessarily an automatic process once it 
has begun. It could be regulated in a gene- 
specific way at any. one of several sequential 
steps. We can distinguish (at least) five poten- 
tial control points, forming the series: 

Activation of gene structure 
i 

Initiation of transcription 
I 

Processing the transcript 
I 

Transport to cytoplasm 

■ I 
Translation of mRNA 

The existence of the first step is implied by 
the discovery that genes may exist in either of 
two structural conditions. Relative to Ihe slate 
of most of the genome, genes are round m 
an "active" stale in Ihe cells in. which they 
arc expressed (see Cbapler 27). The change of 
incline is distinct from the act of Ironscrip- 
ll or>, and indicates that the gene is "transcrib- 
a»le. v This suggests that acquisition of the 
"active* structure mwsl be the first step in gene 
e *pression. 

irnn seripti en of a gr..nr, in i|, r nr. rive >tntc i.s 



controlled at the stage of initiation, tnat is, by 
the interaction of RNA polymerase with its pro- 
moter. This is now becoming susceptible to 
analysis in the in vitro systems (see Chapter 
28). For most genes, this is a major control 
point; probably it is the most common level of 
regulation. 

There is at present no evidence for control 
at subsequent stages of transcription in eixkary- 
otic cells, for example, via anfiterrnmation 
mechanisms. 

The primary transcript is modified by capping 
at the 5' end, and usually also by poiyadenyla- 
tion at the 3' end: Introns must be spliced out 
from the transcripts of interrupted genes. The 
mature RNA must be exported from the nucleus 
to the cytoplasm. Regulation of gene expression 
by selection of sequences at the level of nuclear 
RNA might involve any or all of these stages, 
but the one for which we have most evidence 
concerns changes in splicing; some genes are 
expressed by means of alternative splicing pat- 
terns whose regulation controls the type of pro- 
tein product (see Chapter 30). 

Finally, the translation of an m\\NA m Die cyto- 
plasm can be specifically controlled. There is little 
evidence for ihe employment of this mechanism in 
advdt somatic cells, but it does occur in some 
embryonic situations, as described in Chapter 7. 
The mechanism is presumed to involve die blodi- 
ing of imitation of translation of some mRNAs by 
specific protein factors. 

But having acknowledged that control of gene 
expression can occur at multiple stages, 
that production of RNA cannot inevitably be 
equated u'hti production of protein, il is clear 
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that the overwhelming majority of regulatory 
events occur at the initiation of transcription. 
Regulation of tissue-specific gene transcription 
lies at the heart of euiaryotic differentiation; 
indeed, we see examples in Chapter 38 in 
which proteins that regulate embryonic devel- 
opment prove to be transcription factors. A reg- 
ulatory transcription factor serves to provide 



common control of a large number of target 
genes, and we seek to answer two questions 
about this mode of regulation: what identifies 
the common target genes to the transcription 
factor; and how is the activity of the transcrip- 
tion factor itself regulated in response to intrin- 
sic or extrinsic signals? 



Response elements identify genes under common 



regulation 



The principle that emerges from characterizing 
groups of genes under common control is that 
they share a promoter element that is recognized 
by a regulatory transcription factor. An element 
that causes a gene to respond to such a factor 
is called a response element; examples are the 
. HSE (Ileal shock response element), GRE 
(glucocorticoid response element), SiVE (serum 
response element). 

The properties of some inducible transcription 
factors and the elements that- they recognize are 
summarized in Table 29. J. Response elements 
have the same general characteristics as 
upstream elements of promoters or enhancers. 
They contain short consensus sequences, and 
copies of the response elements found in dif- 
ferent genes are closely related, but not neces- 
sarily identical. The region bound by the factor 
extends for a short distance on either side of 

if 



. Table 29.1 inducible transcription factors bind to"::, 
response elements that identify groups of promoters . 
or enhancers subject to coordinate control, }. 



Regulatory Agent Module Consensus F actor 

Heat shock USE CNNGAANN7CCNWG HS7F 

GlucocorticoKi GRE 1 GG7 ACAAA7 GTT CH Receptor 

Phorbol ester IRE TGAC1CA APi 

Serum ERE CCATA11AGG SRF 



the consensus sequence, tn promoters, the ele- 
ments are not present at feed distances from 
the startpoinl, but are usually <200 bp upstream 
of it The presence of a single element usually 
is sufficient to confer, the regulatory response, 
but sometimes there are multiple copies. 

Response elements may be located in pro- 
moters or in enhancers. Some types of elements 
are typically found in one rather than the other: 
usually an USE is found in a promoter, while a 
GRE is found in an enhancer. We assume that 
all response elements function by the same 
general principle. A gene is regulated by a 
sequence at the promoter or enhancer that is 
recognized by a specific protein. Ihe protein 
functions as a transcription factor needed for 
RNA polymerase to initiate. Active protein is 
available only under conditions when the gene is 
to be expressed; its absence means that the pro- 
moter is not activated by this particular circuit 
An example of a situation in which many 
genes are controlled by a single factor is pro- 
vided by the heat shock response. This is com- 
mon to a wide range of prokaryotes and 
euk3ryotes and involves multiple controls of 
gene expression: an increase tn temperature 
turns oJT transcription of some genes, I urns on 
transcription of the heat shock genes, and 
causes changes in the translation of mRNAs. 
The control of the heal shock genes .illustrates 
the differences between prokaryolic and 
euk aryotic modes of control. In bacteria, a net* 
sigma factor is synthesized ihai directs JVfNA 
polymerase holoenxyrne to recognize an alter 
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Retinal preconditioning and the induction of heat-shock protein 27. 

LTY, RothS, LascrM, MaJX, Crosson CE . 

PURPOSE: Brief periods of ischemia have been shown to nroterf th, a- 

of heat shock prattTffL^^Sl On^ pmT ^ ^ CeHu,ar Chan 8 es in leve,s 
associated wiS r ^l^ T mRNA and expression of Hsp in the rat retina 
ischemia ^c^^^^^^^f Unilateral retinal 
then left for 1 hour to day" to Towlt Sp 3gU f - Dawle / ™* for 5 minutes. Rats were 

changes in expression of Hs P 27 70 Jf™"! blot f a| y ses conducted to detect 
identify region" S 

a second ischemic event 60 minutes mT^in^ T tcd u aiuma,s wer e subjected to 
^-thfunc.^ 

and sham-treated animals retinal H<;n?7 7rt » n A on ^* ^^^AS. In control 

expression remained elevated for up to 72 hours and th™ ul !. P 
levels Hs P 27 protein levels were increased by *E* oZ S ^Zl h 
retinal preconditioning, remained at this level for nZZ nd hen l^ „7 ^ , 
levels. In conlrast, no consistent change in HsriTO or S/ ? d l ° C ° nlro1 

^^^^ 

PMID: 12601062 [PubMed - indexed for MEDLINE] 
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Enhanced expressions of arachidonic acid-sensitive tandem-pore domain 
potassium channels in rat experimental acute cerebral ischemia. 

LLZB, ZhangHX, LiLL, WangXL. 

lastitute of Materia Medica, Clunese Academy of Medical Sciences and Peking Union 
Medical College, Beijing 100050, China. g 

To finther explore the pathophysiological significance of arachidonic acid-sensitive 
potass.u m channels, RT-PCR and Western blot analysis were used to invite the 
express.on changes of TREK channels in cortex and hippocampus in rat Spenmentel 

TfSntil!!^ J m W PP ocam P us sh0 ^d significant increases 2 h 

fte Il^tee r ^ u7 ° CdUS,0n (MCAO) - ™ e levels of 

flie all three channel subtypes increased significantly 24 h after MCAO in cortex and 
kppocamp^At the same time> the ^ ^ « hal e Tp"L 

TST mCreaS / 24 h aftCr MCA ° ^ COrtCX ^ hi PP««-Pu S , but oSy 
Si?T? ^ eXpreSSi ° n 2 h 8fter MCAO in «*« 311(1 Wppoeampus 
Immunolus tochem.cal experiments verified that all the three channel protei3d h iHher 
expression levels in cortical and hippocampal neurons 24 h after MCAC Tese 2 

ChamdS m,ght P r0Vlde a neuroprotective mechanism in the pathological process. 
PMDD: 15652517 [PubMed - indexed for MEDLINE] 
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Increasing expression of tissue plasminogen activator and plasminogen 
activator inhibitor type 2 in dog gingival tissues with progressive 
inflammation. 

Liadbcrg P, Kjimby B, Lecander I , LangjNP, Matssoa L . 

Center for Oral Health Sciences, Malmo University, S-214 21 Malmo, Sweden. 
pia.lindberg@od.rhaIi.se 

Urokinase and tissue-type plasminogen activators (u-PA and t-PA) are serine proteases 
that convert plasminogen into plasmin, which degrades matrix proteins and activates 
metalloproteinases. The PAs are balanced by specific inhibitors (PAI-- 1 and PAI-2) 
Local production of t-PA and PAI-2 was recently demonstrated in human gingival • 
ttssues. The a.m now was to investigate the production and localization of t-PA and PAI- 
-2 in gingival tissues.from dogs in three well-defined periodontal conditions; clinically 
healthy gingiva, chronic gingivitis and an initial stage of ligature-induced loss of 
attachment. At the start of the experiment the gingiva showed clear signs of 
inflammation. Clinically healthy gingiva were obtained after 2 1 days period of intense 
oral hygiene. Attachment loss was induced by placing rubber ligatures around the neck of 
some teeth. Biopsies were taken from areas representing the different conditions and 
prepared form situ hybridization and immunohistochemistry. In clinically healthy 
gingiva both t-PA mRNA and antigen were expressed in a thin outer layer of the sulcular 
and junctional epithelia. No t-PA signals or staining were seen in connective tissue. Both 
mRNA signaling and immunostaining for t-PA were stronger in chronic gingivitis In 
areas with loss of attachment, t-PA mRNA as well as antigen werefdund in the sulcular 
and junctional epitheha to a similar degree as in gingivitis. Occasionally the connective 
tissue was involved, especially in connection with vessels. PAI-2 mRNA was seen in a ■ 
thin outer layer of the sulcular and junctional epithelia in clinically healthy gingiva but 
no signals were seen in connective tissue. PAI-2 antigen was found primarily in the 
outer layer of the sulcular and junctional epithelia. -Some cells in the connective tissue 
were stained. In gingivitis, PAI-2 signals were mainly found in the same locations, but 
more intense and extending towards the connective tissue. Immunostaining was seen in 
the outer half of the sulcular and junctional epithelia as well as in the upper part of the 
connect.ve tissue, close to the sulcular epithelium. In sites with loss of attachment PAI- 
2 mRNA was found throughout the sulcular and junctional epithelia, as was the antigen 
which stained intensely. No PAI-2 mRNA was seen in connective tissue; the antigen was 
found scattered, especially near vessels. This study shows that the expression of both t- 
PA and PAI-2 increases with experimental gingival inflammation in the dog, and 
furthermore, the two techniques demonstrate a strong correlation between the' 
topographical distribution of the site of protein synthesis and the tissue location of the 
antigens for both t-PA and PAI-2. The distribution correlates well with previous 
findings in humaris. 
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Effect of duration of fixation on quantitative reverse transcription 
polymerase chain reaction analyses. < 

Macabeo-Onff ivr, GinzingerDG, DekkerN. McMillan A . Regezi JA Wong DT 
Jordan RC . ' ' 

Oral Pathology, Department of Stomatology, University of California San Francisco 
California 94143-0424, USA. 

Increasingly, there is the need to analyze gene expression in tumor tissues and correlate 
these findings with clinical outcome. Because there are few tissue banks containing 
enough frozen material suitable for large-scale genetic analyses, methods to isolate and 
quantify messenger RNA (mRNA) from formalin-fixed, paraffin-embedded tissue 
sections are needed. Recovery of RNA from routinely processed biopsies and 
quantification by the polymerase chain reaction (PCR) has been reported; however the 
effects of formalin fixation have not been well studied. We used a proteinase K-sal't 
precipitation RNA isolation protocol followed by TaqMan quantitative PCR to compare 
the effect of formalin fixation for 24, 48, and 72 hours and for 1 week in normal (2) oral 
epithelml dysplasia (3), and oral squamous cell carcinoma (4) specimens yielding 9 fresh 
and 36 formalin-fixed samples. We also compared mRNA and protein expression levels 
using immunohistochemistry for epidermal growth factor receptor (EGFR) matrix 
metalloproteinase (MMP)-l, p21, and vascular endothelial growth factor (VEGF) in 15 
randomly selected and routinely processed oral carcinomas. We were able to extract 
RNA suitable for quantitative reverse transcription (RT) from all fresh (9/9) and 
formalin-fixed (36736) specimens fixed for differing lengths of time and from all (15/15) 
randomly selected oral squamous cell carcinoma. We found that prolonged formalin 
fixation (>48 h) had a detrimental effect on quantitative RT polymerase chain reaction 
results that was most marked forMMP-1 and VEGF but less evident for p21 and EGFR 
Comparisons of quantitative RT polymerase chain reaction and immunohistochemistry ' 
showed that for all markers, except p21, there was good correlation between mRNA and 
protein levels. p21 mRNA was overexpressed in only one case, but protein levels were 
elevated in all but one tumor, consistent with the established translational regulation of 
p21 These results show that RNA can be reliably isolated from formalin-fixed paraffin- 
embedded tissue sections and can produce reliable quantitative RT-PCR data However 
results for some markers are adversely affected by prolonged formalin fixation times. ' 

PMID: 1221 8216 [PubMed - indexed for MEDLINE] 
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Id-1 and Id-2 arc overcxpresscd iu pancreatic cancer and in dysplastic 
lesions in chronic pancreatitis. 

Maruvama H , KjcefTJ, WtjdjS, Frjcss H. Bucl.kr IVtW . Israel MA l^M 
Division of Endocrinology, Department of Medicine, University of California, Irvine, 

Uuri. 

Id proteins antagonize basic helix-loop-helix proteins, inhibit differentiation, and enhance 
cell proliferation. In this study we compared the expression of Id- 1 , Id-2 and Id-3 in the 
normal pancreas, in pancreatic cancer, and in chronic pancreatitis (CP). Northern blot 
analysis demonstrated that all three Id rnRNA species were expressed at high levels in 
pancreaHc cancer samples by comparison with normal or CP samples. Pancreatic cancer 
cell lmes frequently coexpressed all three Ids, exhibiting a good correlation between Id 
mRNA and protein levels, as determined by iinmunoblotting with highly specific anti-Id 
anybodies. Immunohistochemistry using these antibodies demonstrated the presence of 
taint Id-1 and Id-2 immunostaining in pancreatic ductal cells in the normal pancreas 
whereas Id-3 immunoreactiviry ranged from weak to strong.In the cancer tissues, many 
of the cancer cells exhibited abundant Id-1, Id-2, and Id-3 immunoreactivity. Scoring on 
he basis of percentage of positive cells and intensity of immunostaining indicated that 
ld-1 and Id-2 were increased significantly in the cancer cells by comparison with the 
respective controls. Mild to moderate Id immunoreactiviry was also seen in the ductal 
cells in the CP-like areas adjacent to these cells and in the ductal cells of small and 
interlobular ducts in CP. In contrast, in dysplastic and atypical papillary ducts in CP, Id-1 
and Id-2 immunoreactmty was as significantly elevated as in the cancer cells These 
findings suggest that increased Id expression may be associated with enhanced 
proliferative potential of pancreatic cancer cells and of proliferating or dysplastic ductal 
cells in CP. 
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ld-1 and ld-2 Are Overexpressed in Pancreatic 
Cancer and in Dysplastic Lesions in 
Chronic Pancreatitis 



Haruhisa Maruyama,* Jorg Kleeff,* Stefan WiloV . 
Helmut Friess, 1 Markus W. Buchler* 
Mark A. Israel * and Murray Korc* 

pfx>m the Division of Endocrinology, Diabetes, and Metabolism*. 
Departments of Medicine, Biological Chemistry and 
Pharmacology, University of California, Irvine, California; the ' 
Department of Visceral and Transplantation Surgery? University 
of Bern; Bern, Switzerland; and the Preuss Laboratory? 
Department of Neurological Surgety. University of California, 
San Francisco, California . 



Jd proteins antagonize basic helix-Joop-belix pro- 
teins, inhibit differentiation, and enhance cell prolif- 
eration. In this study we compared the expression of 
Jcl-l, Jd-2, and Id-3 in the normal pancreas, in pan- 
creatic cancer, and in chronic pancreatitis (CP). 
Northern bJot analysis demonstrated that all three Id 
mKNA species were expressed at high levels in pan- 
creatic cancer samples by comparison with normal or 
CP samples. Pancreatic cancer cell lines frequently 
coexpressed all three Ids, exhibiting a good correla- 
tion between Id niRNA and protein levels, as deter- 
mined by inununoblotting with highly specific anti-Id 
antibodies. Ixoinvinohistochemistry using these anti- 
bodies demonstrated the presence of faint Id ] and 
ld-2 immvMiostaining in pancreatic ductal cells in the 
normal pancreas, whereas ld-3 imnmnoreactivity 
ranged from weak to strong. In the cancer tissues, 
many of the cancer cells exhibited abundant Id-3, 
ld-2, and Jd-3 immunoreactivity. Scoring on the basis 
of percentage of positive cells and intensity of immu- 
nostaming indicated that Id-land ld-2 M ere increased 
significantly in the cancer cells by comparison with 
the respective controls. Mild to moderate Id immuno- 
reactivity was also seen in the ductal cells in the 
CP- like areas adjacent to these cells and in the ductal 
cells of small and interlobular duels in CP. In con- 
trast, in dysplastic and atypical papillary ducts in CP, 
Id- J and ld-2 immunoreactivity was as significantly 
elevated as in the cancer cells. These findings suggest 
that increased Id expression may be associated with 
enhanced proliferative potential of pancreatic cancer 
cells and of proliferating or dysplastic ductal cells in 
CP. (A »i J Fa tb ol J<J*J<J, 155:815-822) 



Basic helix-loop-helix (bHLH) proteins play an important 
role as transcription factors in cellular development, pro- 
liferation, and differentiation: 12 The basic domain of the 
bHLHs is required for binding to an E-box DNA se- 
quence, thus promoting transcription of specific target 
genes. The HLH domain promotes dimer formation with 
various members of the bHLH protein family. 1 * 2 Ho- 
modimers of the class B family of bHLH proteins, includ- 
ing MyoD, NeuroU, and numerous other proteins, are 
known to activate tissue-specific genes. 3 ; 5 These tissue- 
specific bHLHs typically form heterodimers with widely 
expressed class A bHLHs, which include proteins en- 
coded by E2A,.E2-2, HEB, and other genes (also termed 
E-proteins). 6 " 9 These heterodimers activate transcription 
of genes that are associated with differentiation. 

fd genes encode a family of tour HLH proteins that lack 
the basic DNA binding domain. 1 - 10 They act as dominant- 
negative HLH proteins by forming high affinity het- 
erodimers with other bHLH proteins, thereby preventing 
them from binding to DNA and inhibiting transcription of 
differentiation-associated genes. 10 " 12 Id gene expres- 
sion is down-regulated on differentiation in many cell 
types in vitro and in vivo. 1 3-10 In addition, Id proteins seem 
to be required for cell cycle progression through G^S 
phase in certain cell types, and interaction between ld-2 
and pRB is associated with enhanced proliferation in 
some cell lines in vitro? 9 ' 23 

Pancreatic cancer is the fifth leading cause of cancer 
death in. the United States, with a mortality rate that vir- 
tually equals its incidence rate. 24 This malignancy is often 
associated with the overexpression of a variety of myo- 
genic growth factors and their receptors, and by onco- 
genic mutations of K-ras and inactivation of the p53 tumor 
suppressor gene 25 We have recently reported that pan- 
creatic cancers overexpress the HLH protein ld-2, and 
that enhanced expression of this protein is evident in the 
cytoplasm of the cancer cells within the pancreatic tumor 
mass. 26 It is not known, however, whether the expression 
of other Id proteins is altered in this malignancy, or 
whether their expression is altered in chronic pancreatitis 
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(CP), an inflammatory, disease that is characterized by 
dyspiastic ducts, foci of proliferating ductal celts, acinar 
cell degeneration, and fibrosis. 27 We now report that 
there is a five- to sixfold increase in ld-1 and ld-2 mRNA 
levels and a twofold increase in ld-3 mRNA levels in 
pancreatic cancer by comparison with the normal pan- 
creas. In contrast, overall Id mRNA levels are not in- 
creased in CP. 



Patients and Methods 

Normal human pancreatic tissue samples from 7 male 
and 5 female donors (median age 41.8 years, range 
14-68 years), CP tissues from 13 males and 1 female 
(median age 42. 1 years; range 30-56 years), and pan- 
creatic cancer tissues from 10 male and 6 female donors 
(median age 62.6 years; range 53-83 years) were ob- 
tained through an organ donor program and from surgi- 
cal specimens from patients with severe symptomatic 
chronic pancreatitis or pancreatic cancer. A partial 
. duodenopancreatectomy (Whipple/pytorus-preserving 
Whipple; n = 13), a left resection of the pancreas (n = 2), 
or a total pancreatectomy (n = 1 ) were carried out in the 
pancreatic cancer patients. According to the TNM clas- 
sification of the Onion Internationale Centre le Cancer 
■ (UICC) 6 tumors were stage 1, 1 was stage 2, and 9 were 
stage 3 ductal cell' adenocarcinoma. Freshly removed 
tissue samples were fixed in 10% formaldehyde solution 
for 12 to 24 hours and paraffin-embedded for histological 
analysis. In addition, tissue samples were frozen in .liquid 
nitrogen immediately on surgical removal and maintained in 
-80°C until use for RNA extraction. Alf studies were ap- 
proved by the Ethics Committee of the University of Bern, 
Bern, Switzerland, and by the Human Subjects Committee 
at the University of California, Irvine, California. 

Northern Blot Analysis 

Northern blot analysis was carried out as described pre- 
: viously. 26 28 ' Briefly, total RNA was extracted by the single 
step acid guanidinium thiocyanate phenol chloroform 
method. RNA was size-fractionated on 1.2% agarose/1.8 
mol/L formaldehyde gels, electrotransferred onto nylon 
membranes, and cross-linked by UV irradiation. Blots 
were prehybridized and. hybridized with cDNA probes 
and washed under high stringency conditions. The fol- 
lowing cDNA probes were used: a 979-bp human ld-1 
cDNA probe, a 440-bp human ld-2 cDNA probe, and a 
450-bp human ld-3 cDNA probe, covering the entire 
coding regions of ld-1, ld-2, and ld-3, respectively. A 
BamHI 190-bp fragment of mouse 7S cDNA thai hybrid- 
izes with human cytoplasmic RNA was used to confirm 
equal RNA loading and transfer. Blots were then exposed 
at -80°C to Kodak BioMax-MS films and the resulting 
auioradiographs were scanned to quantify the intensity ol 
the radiographic bands. 26 28 For each sample the ratio of . 
Id mRNA expression to 7S expression was calculated. To 
compare the relative increase in expression of the re- 
spective Id mRNA species in the cancer and CP sam- 
ples, the same normal samples were used for normal/ 
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Figiwe 1. mRNA expression of Jd-J, Jd-2, and ld-3 in pancreatic cancer and 
chronic pancreatitis. Total RNA (20 up/lane) from six normal, eighl cancer- 
ous, and seven chronic pancreatitis tissue samples were, subjected to .North- 
em bio* analysis using 32 P-bbe)erf cDNA probes (500,000 cpm/ml) specific 
for Id-], )d-2, and Jd-3. respectively." A 7S cDNA probe (50,000 cpm/ml) was 
used as a loading and transfer control.' Exposure limes of the normal/cancer 
blots were 1 day for all Id probes, ;md 2 days for the normal/CP blots. 
Exposure time was 4 hours for mouse 7S cDNA. By comparison with the 
normal samples, ld-1 and ld-3 mRNA levels were elevated in 8 and 9 cancer 
Samples, respectively, whereas Jd-2 was elevated in 6 cancer samples. 

cancer and normal/CP membranes. The median score for 
ld-2, and ld-3 mRNA levels in these normal samples 
was set to 100. Statistical analysis was performed wilh 
SigmaStat software (Jandel Scientific, San Raphael, CA). 
The rank sum test was used, and P < 0.05 was taken as 
the level ol significance. 

Cell Culture and Western Blot Analysis 

PANC-1. MIA-PaCa-2, ASPC-1, and CAPAN-1 human 
pancreatic cell lines were obtained from ATCC (Manas- 
sas, VA). COLO- 357 human pancreatic cells were a gift 
from Dr. R. S. Metzger (Durham, NC). Cells were routinely 
igrown in DM EM (COLO-357, MIA-PaCa-2. PANC-1) or 
RPIVH (ASPC-1. CAPAN-1) supplemented with 10% fetal 
bovine serum. 100 U/rnl penicillin, and 100 /ig/rnl strep- 
tomycin. For imrnunoblot analysis, exponentially growing 
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Figure 2. Densitometry analysis of Northern Wow. Autoradiographs of 
Northern blots from I? normal, 14 CP, and J6 pancreatic cancers were 
analyzed by densitometry. mllNA levels were determined by calculating the 
ratio of die optical density for. I he respective )d niRNA species in rebl ion lo 
the optical density of mouie 7S cDNA. To compare the relative increase in 
expression of the respective Id mRNA species in ibe cancer and CP samples 
the Mmc normal samples were used to, normal/cancer and normal/CP 
membranes. Normal pa»oe:iiic tissues are indicated by circles, CP tissues bv 
triangles, and cancer (issues by squares. I>ata are expressed as median 
scores ± SIX By comparison with the normal samples, only the oncer 
samples exhibited significant increases: 6.5-*"old (P < n.O)) for Id-], fivefold 
(P < 0.01) for ld-2. and twofold (P = 0.027) for Id- 3. % 



cells (60-70% confluent) were solubifized in lysis buffer 
containing 50 mmol/L Tris-HCi, pH 7.4, 150 mmOl/L NaCI; 
1 mmol/L EDTA, 1 f ig/m\ pepstatin A, i mrnol/L phenyl- 
methylsulfonyl fluoride (PMSF). and 1% friton X-100. Pro- 
teins were subjected to sodium dodecyl sulfate polyacryl- 
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F'tgure 3- Id niRNA and protein expression in pancreatic cancer cell lines. 
Upper panels; Tow) RNA (20 fig/lane) from 5 pancreatic cancer eel! lines 
were subjected to. Northern blot analysis using Vp-labeled cDNA probe? 
(300,000 rpm/ml) specific for ld-K )d-2, and ld-3 : respectively. Exposure 
times were 1 day for all Id probes. Lower panels.- Immunobloiiing. Cell 
lysates (30 pg/lane) were subjected toSDS-l'AGE. Membranes were probed 
wiih specific )d-l, M-2, :tnd Id- 3 antibodies. Visualization was performed by 
enhanced ohemilmninescence. 

amide gel electrophoresis (SDS-PAGE), transferred to 
Immobilon P membranes, and incubated for 90 minutes 
with the indicated antibodies and for 60 minutes with 
secondary antibodies against rabbit IgG. Visualization 
was performed by enhanced chemiluminescence. 



immunohistochemisiry 

Specific rabbit anti-human Jd-1 (C-20), Id-2 (C-?0). and 
Id- 3 (C-20; all from Santa Cruz Biotechnology, Santa 
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Figure 4. Normal and cancerous pancreatic tissues were subjected to immu- 
noslaining using highly specific anti-ld-1 (A-C), ami- id- 2 (D-F). and ami-)d-3 
(G-l) antibodies as described in the Methods section. Moderate to strong td-l 
immunoreactivity was present in the cytoplasm of duel -like oncer cells (A 
and C r left panel). In the normal pancreas there was weak Id- 1 immunore- 
aciiviiy in the ductal cells (B). Preabsorption with the Jd-j-specific blocking 
peptide abolished the Id-] immunoreacliviry (C, right paneQ. Strong ]d-2 
immunoreacliviry was observed in the cyioplasro of the cancer cells thai 
exhibited duct-like structures (D and F. left panel), whereas in l lie normal 
pancreas, there was only weak ld-2 immunoreacliviry in the ductal cells (E). 
Preabsorption wiih the ld-2-speeifie blocking peptide abolished the ld-2 
imninnoreactivity (F. right panel). Moderate to strong Jd-3 immunoreaciiviiy 
way present in the due i- like cancer cells (G and I, left panel). Moderate to 
strong Id- 3 immunoreaciiviiy w:is a (so present in ihe ductal cells of normal 
pancreatic tissue samples (H). td-3 immunoreactiviiy wa.s compteielv abol- 
ished by preubsurpiion with ihe )d-3 spec! He blocking peptide <\ right 
panel). A. D, and G constitute serial sections of a pancreatic cancer sample, 
revealing coexpression of the three Jd proteins. Scale bars, 25 pm. 

Cruz, CA) polyclonal antibodies were used for immunhis- 
tochemistry. These affinity- purified rabbit polyclonal anti- 
bodies specifically react with ld-1, ld-2, and ld-3, respec- 
tively, of human origin, as determined by Western 
blotting. ParaHin-embedded sections (4 M m) were sub- 
jecled to immunostaining using Ihe streptavtcftn-peroxi- 
dase technique. Where indicated, immunostaining tor all 
three Id proteins was performed on serial sections. En- 



dogenous peroxidase activity was blocked by incubation 
for 30 minutes with 0.3% hydrogen peroxide in methanol. 
Tissue sections were incubated for 15 minutes (23°C) 
with 10% normal goat serum and then incubated for 16 
hours at 4°C with the indicated antibodies in PBS con- 
taining 1% bovine serum albumin. Bound antibodies 
were detected with biotinylated goat anti-rabbit IgG sec- 
ondary antibodies and streptavidin-peroxidase complex, 
using diarninobenzidine tetrahydrochloride as the sub- 
strate. Sections were counter stained with Mayer's hema- 
toxylin. Preabsorption with fd-1-, ld-2-, or ld-3-specific 
blocking peptides completely abolished immunoreactiv- 
ity of the respective primary antibody. The immunohisto- 
chemical results were serniquantitatively analyzed as de- 
scribed previously. 29 ; 30 The percentage of positive 
cancer cells was stratified into four groups: 0, no cancer 
cells exhibiting immunoreactivity; 1 , <33% of the cancer 
cells exhibiting immunoreactiviiy; 2, 33 to 67% of the 
cancer cells exhibiting immunoreactrvity; 3 >67% of the 
cancer cells exhibiting immunoreactivity. The intensity of 
the immunohistochemical signal was also stratified into . 
four groups: 0, no immunoreactivity; 1 , weak immunbre- 
aclivity; 2, moderate immunoreactivity; 3. strong immu- 
noreactivity. Finally, the sum of the results of the cell, 
score and the intensity score was calculated. Statistical 
analysis was performed with SigmaStat software. The 
rank sum test was used, and P < 0.05 was taken as the 
level of significance. 



Results 

Northern blot analysis of total RNA isolated from 12 nor- 
mal pancreatic tissues and 16 pancreatic cancers re- 
vealed the presence of the 1.2-kb ld-1 transcript and the 
1.6-kb Id2 mRNA transcript in 11 of the 12. normal pan- 
creatic samples, and the 1.3-kb ld-3 mRNA transcript in 
all normal pancreatic samples (Figure 1A, 2). In the can- 
cer tissues, ld-1 mRNA levels were elevated in 8 of 16 
samples, ld-2 mRNA levels were elevated in 9 of these 
samples, and ld-3 mRNA levels were elevated in 6 of 
these samples (Figure 1A, 2). Concomitant overexpres- 
sion of all three Id species was observed in 6 of the 
cancer samples (38%). In contrast, none of the Id mRNA 
species were overexpressed in CP by comparison with 
normal controls (Figure 1B, 2). Densitometry analysis of 
all of the autoradiograms indicated that there was a 6.5- 
told increase (P < 0.01) in ld-1 mRNA levels, a fivefold 
increase (P < 0.01) in ld-2 mRNA levels, and a twofold 
increase (P = 0.027) in ld-3 mRNA levels in the pancre- 
atic cancer samples in comparison to normal controls 
(Figure 2). In contrast, there was no statistically signifi- 
cant difference in the expression levels of ld-1, ld-2. and 
ld-3, in CP tissues in comparison to the corresponding 
levels in the normal pancreas (Figure 2). 

Next, we assessed the expression of the three Id 
genes in 5 human pancreatic cancer cell lines by North- 
ern and Western blot analyses. !d- 1 mRNA was present 
at varying levels in all 5 cell lines (Figure 3). ASPC-1. 
CAPAN-t. MIA-PaCa-2, and PANC-1 expressed moder- 
ate to high levels of Id- 1 mRNA, whereas COLO- 357 cells 
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expressed relatively low levels of this mRNA moiety. 
Western blotting with a highly specific anti-ld-1 antibody 
confirmed the presence of the approximately 14-kd ld-1 
protein in the 4 cell lines that expressed high levels ol 
ld-1 mRNA (Figure 3). Furthermore, the three celt lines 
with the highest ld-1 mRNA expression (CAPAN-1, MIA- 
PaCa-2, and PANC-1) also exhibited the highest ld-1 
protein expression. Variable levels ot the 1.6-kb fd-2 
mRNA transcript were present in all 5 cell lines. In addi- 
tion, a minor band of approximately 1.2 kb was visible in 
COLO-357 and MIA-PaCa-2 cells, tmmunoblot analysis 
with a highly specific anti-Id-2 antibody revealed two 
bands of approximately 16 and 18 kd at relatively high 
levels in all of the cell lines with exceplion of PANC-1 
cells, in which the 16-kd band was relalively faint (Figure 
.3). With the exception of MIA-PaCa-2 cells, (here was a 
good correlation between fd-2 mRNA and protein levels 
(Figure 3). ld-3 mRNA was present at high levels in 
MIA-PaCa-2 cells, at moderate levels in COLO-357 cells 
and at tow levels in PANC-1 cells, ld-3 mRNA was not 
detectable in ASPC-1 and CAPAN-1 cells (Figure 3) 
Immunoblot analysis with a highly specific anti-ld-3 anti- 
body revealed an approximately 14-kd band that was most 
abundant in MIA-PaCa-2 cells, and was also readily appar- 
ent in COLO-357 and PANC-1 cells. In contrast, only a faint 
ld-3 band was seen in ASPC-1 and CAPAN-1 cells. Thus, 
with the exception of PANC-1 cells, theie was a good cor- 
relation between ld-3 mRNA and pfotein levels; 



To determine the localization of ld-1, ld-2, and ld-3, 
immunostaining was carried out using the same highly 
specific anti-Id antibodies. In the pancreatic cancers, 
moderate to strong ld-1 immunoreactivity was present in 
the cancer cells in 9 of 10 Randomly selected cancer 
samples. An example of moderate ld-1 immunoreactivity 
is shown in Figure 4A, and of strong immunoreactivity in 
Figure 4C (left panel). In contrast, in the normal pancreas, 
faint ld : 1 immunoreactivity was present only in the ductal 
cells of pancreatic ducts (Figure 4B, arrowheads). Pre- 
absorption with the ld-1-specific blocking peptide com- 
pletely abolished the ld-1 immunoreactivity (Figure 4C, 
right panel). The cancer cells also exhibited strong ld-2 
(Figure 4, D and F, left panel) and moderate lo strong ld-3 
immunoreactivity. An example ot moderate ld-3 immuno- 
reactivity is shown in Figure 4G, and of strong immuno- 
reactivity in Figure 41 (left panel). In contrast, only faint 
ld-2 immunoreactivity was present in the ductal cells in 
the norma! pancreas (Figure 4E), whereas ld-3 immuno- 
reactivity in these cells was more variable and ranged 
from moderate to occasionally strong (Figure 4H). Islet 
celts and acinar cells were always devoid of Id immunore- 
activity. Preabsorption of the respective antibody with the 
blocking peptides specific for fd-2 (Figure 4F, right panel) 
and ld-3 (Figure 41. right panel) completely abolished irrv 
munoreaclivity. Analysis of serial pancreatic cancer sec- 
lions revealed that there was often colocafization of the 
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three Id proteins. An example of serial sections from a 
pancreatic cancer tissue is shown in Figure 4, A, D, and 6. 

ld-1, ld-2, and ld-3 immunoreaciivity was also present 
at moderate levels in the cytoplasm of ductal cells wilhin 
CP-like areas adjaceni to Ihe cancer cells (Figure 5, A-C). 
As in the normal pancreas, islet cells (outlined by arrow- 
heads) did not exhibit Id immunoreaciivity. In 4 of 9 CP 
samples, there were foci of ductal cell dysplasia of rela- 
tively large interlobular ducts, all ol which exhibited mod- 
erate to strong ld-1. ld-2, and ld-3 immunoreactivity (Fig- 
ure 5, D-F). Five of 9 CP samples also contained foci of 
■ large duels exhibiting atypical papillary epithelium. Serial 
section analysis ol one of those CP samples revealed 
mild to moderate Id- 1 and ld-2 immunoreactivity and 
weak ld-3 immunoreaclivily in the cells of these atypical 
papillary ducts (Figure 6, A-C). In contrast, in some of 
these CP samples, moderate to strong ld-3 immunoreac- 
tivity was also observed (Figure 6D). However, most of 
Ihe ductal cells forming the typical ductular structures ol 
CP, such as targe interlobular ducts and small proliferat- 
ing ducts, exhibited generally only weak to occasionally 
moderate Id imrnunoreactivify (data not shown). 



The immunohistochemical data for ld-1, ld-2, and ld-3 
are summarized in Table 1. In the case of ld-1 and ld-2. 
the cancer ceils as well as the dysplastic and atypical 
papillary ducts in CP exhibited a significantly higher 
score than the ductal cells in the normal pancreas. In 
contrast, due to the marked variability in ld-3 imrnuno- 
staining in the normal pancreas, the differences between 
normal and cancer cells and normal and dysplastic cells 
did not achieve statistical significance. 



Discussion 

Id proteins constitute a family ol HLFI transcription factors 
that are important regulators of cellular differentiation and 
pioliferation. 1 - 2 To date, four members of Ihe human Id 
family have been identified. 1 - 10 -" 12 Their expression is 
enhanced during cellular proliferation and in response to 
mitogenic stimuli. 19,31 and overexpression of Id genes 
inhibits differentiation and/or enhances proliferation in 
several different cell types. 1 *- 3 *- 34 The forced expression 
ol Id- 1 in mouse small intestinal epithelium results in 
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Table 1. Histological Scoring 




Id-t 


ld-2 


ld-3 


Normal {n - 6) Ductal cells 

Cancer (n = 10) Cancer cells 

CP {n ~ 9) TypicaJ CP lesions (n = 9) , 
Dysplastic ducts (n = 4) 
Atypical papillary ducts (n = 5). 


2.0 ± 0.4 
4.5* ± 0.5 
. 2.7 ± 0.5 
5.3^0.2 
4.4* ± 0.2 


2.3 ± 0.2 
5.2 s ±0.3 

3.1 ±0.6 
5.8* ± 0.2 
5.2* ± 0.2 


2.5 ± 0.9 
4.5 ± 0.6 
3.4 ± 0.7 
5.3 ± 0.4 
5.0 ± 0.4 



Scoring of. the histological specimens was performed as described in the Patienls and Methods section. Values are the means ± SD of the number 
of samples indicated in parenthesis. P values ate based on comparisons with the respective controls in the normal samples. 
\ P < 0.02; *P.< 0.01; *P = 0.004; § P = 0.001. 



adenoma formation in these animals. 35 The growth-pro- 
moting effects of Id genes are thought to occur through 
several mechanisms. For example, ld-2 can bind to mem- 
bers of the pRB tumor suppressor family, thus blocking 
their growth-suppressing activity, 20 ' 21 and ld-1 and ld-2 
can antagonize the bHLH-mediated activation of known 
inhibitors of ceil cycle progression such as the cyclirv 
. dependent kinase inhibitor p21. 23 

In the present study; we determined by Northern blot 
analysts that a significant percentage of human pancre- 
atic cancers expressed increased ld-1, ld-2, and ld-3 
mRNA levels. Increased expression was most evident for 
ld-1 (6.5-fold) and ld-2 (fivefold). In contrast, ld-3 mRNA 
levels were only twofold increased in the cancer samples, 
partly because this mRNA was present at relatively high 
levels in the normal pancreas. Immunhistochemical anal- 
ysis confirmed the presence of ld-1, ld-2, and ld-3 in the 
cancer cells within the tumor mass, whereas in the normal 
pancreas faint ld-1 and ld-2 immunoreaclivity and mod- 
erate to occasionally strong ld-3 immunoreactivity was 
present in some ductal cells. Pancreatic acinar and islet 
cells in the normal pancreas were devoid of ld-1 f fd-2, 
and Jd^3 immunoreactivity. In the cancer samples, all 
three Id proteins olten colocalized in the cancer cells. 
Coexpression of all three Id genes was also observed in 
cultured pancreatic cancer cell lines, which often exhib- 
ited a close correlation between Id mRNA and protein 
expression. However, in MIA-PaCa-2 there was a diver- 
gence of ld-2 mRNA and protein levels, and in PANC-1 
cells, ld-3 mRNA levels did not correlate well with ld-3 
protein expression. These observations suggest that in 
these cells, the hall-life of either Id mRNA or Id protein 
may be altered by comparison with the other cell fines. 
Interestingly, ld-2 immunoblorting revealed two closely 
spaced bands of approximately 16 and 18 kd in 4 of 5 
cell lines. In view of the fact that two possible initiation 
codons have been reported for the ld-2 gene, 36 our 
observation raises the possibility that the two ld-2-immu- 
noreactive bands may represent separate translation 
products of the ld-2 gene. 

Pancreatic cancers often harbor p53 tumor suppressor 
gene mutations 37 and exhibit alterations in apoptosts 
pathways. Thus, these cancers often exhibit increased 
expression of anti-apoptotic proteins such as Bcl-2 38 and 
abnormal resistance to Fas-ligand- mediated apoplo- 
sis 39 It has been shown recently that forced, constitutive 
expression ol Id genes together with the expression of 
anti-apoptotic genes such as Bcl-2 or BclX, can result in 



malignant transformation ol human fibroblasts, 11 raising 
the possibility that the enhanced Id expression in pan- 
creatic cancers together with increased expression of 
anti-apoptotic genes may contribute to the malignant 
potential of pancreatic cancer cells in v/Vo. 

In the CP tissues there was no significant increase in 
ld-1, ld-2, and ld-3 mRNA levels in comparison to the 
norma) pancreas. Immunohtstochemical analysis of pan- 
creatic cancer samples revealed colocaltzation of weak 
to moderate ld-1, ld-2, and ld-3 immunoreactivity in pro- 
liferating ductal cells in the CP-like regions adjacent to 
the cancer cells, indicating that Id expression was not 
restricted to the cancer cells. Similarly, analysis of CP 
samples indicated weak ld-1, ld-2, and ld-3 immunore- 
activity in the cells of small proliferating ducts and large 
ducts without dysplastic changes. In general, (here was a 
correlation between weak immunoreactivity and low Id 
mRNA levels. However, in samples that harbored large 
ducts with papillary structures there was moderate Id 
immunoreactivity, and in the cells forming dysplastic 
ducts there was moderate to .strong Id immunoreactivity. 
In these CP samples, Id mRNA levels were relatively 
higher than in the CP samples thai were devoid of these 
histological changes. Overall, however, increased Id ex- 
pression, most notably of ld-1 and ld-2, distinguished a 
subgroup of pancreatic cancers from CP (Table 1). 

Epidemiological studies have shown that the risk of 
developing pancreatic cancer is increased up to 16-fold 
in patients with pre-existing CP in comparison to the 
general population." 0 The mechanisms that contribute to 
neoplastic transformation in CP are not known. Although 
there is no established tumor progression model for pan- 
creatic cancer, such as the adenoma-carcinoma se- 
quence of colorectal carcinoma, 41 it is generally ac- 
cepted that K ras and p16 mutations occur relatively 
early in pancreatic carcinogenesis, whereas p53 muta- 
tions occur late in this process. 37 - 41 " 43 Increased Id ex- 
pression may contribute to malignant transformation ol 
cultured cell lines in vitro 11 and has been linked lo ceil 
invasion in a murine mammary epithelial cell line. 44 In 
view of the current findings that ld-1, ld-2, and ld-3 are 
overexpressed in pancreatic cancer and in dysplastic/ 
metaplastic ducts in CP, these observations raise the 
possibility that elevated levels of ld-1, ld-2. and, to a 
lesser extent, ld-3 may represent relatively early markers 
ot pancreatic malignant transformation and may contrib- 
ute to the pathobiology of pancrealic cancer. 
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c-fos and estrogen receptor gene expression pattern in the rat uterine 
epithelium during the estrous cycle. 

Mendoza-Rodn> ,,e7,rA McrchMH^anosH, Segura-VaM., MI. Moreno- 

Different studies in oyariectomized estrogen treated animals support the idea that c-fos 
play, a role in the proliferation of uterine epithelial cell, However, these srudTes invYte us 
to reassess the role played by c-fos in epithelial cell types of the endometrium 

™2;<Em J?" 1 study was Undertaken 10 determine c " fos - d 

receptor (ER) gene expression pattern in the rat uterine epithelium during the estrous 
cycle „ whuchnatural and cyclic changes of steroid hormones occur, an* corretatettese 

during (fae estrous cycle us.ng bromodeoxyuridine incorporation to DNA. ERalpha and 
beta proteins were assessed by immunohistochemistry. The regulation of c-fos gene 
both T£ h uT™ ° f intact « ,s d -ng the estrous cyde was evaluatelusL 
Sr« pS" hy ? nd : Zat,0n ^ immunohistochemistry. Estradiol (E(2)) and progesterone 
Sam m , 7'? Tf by radioi «™^assay. The rJlti indicated that 

^Ir^ 8 " 131 " Cp,the,ia (GE) preSented maximal Proliferation during the 

rTr^nf j ( ^ (D) ^ H ° Wever > durin 6 Proestrus (P) day only LE 

presented prohferat,on, and during the estrus (E) day only the stroma] cells proliferate! 
A marked immunostaining for ERalpha was detected in both LE and GE ceHsduringlhe 

un ete^le in " ^ P ** *» E h TOn ^t ERbeta Avas 

undetectable ,n both ep.theha dunng all stages of the cycle. The highest c-fos mRNA 
level was detected m both epithelia on the M day, followed by a significant reSn 
dunngtheomer aysofthecycle.Thehighes^ 

and D days, and the m.mmal value was detected on the E day. The c-Fos protein levd i„ 
prohferat.o„ pattern of th.s cell type. In conclusion, the overall results indicate that c-Fos 
^ToVgTT J T tk UtCrine e P ithe,ia « P^'feration of LE. In Z 

^SotktlT^cP nC l Wa ™TA d ' a,th ° Ugh n ° Sign, ' ficanl was 
cZ ~ I u ' C " f0S mRNA d,d n0t Strictl y corre,ate wi *h its protein levels 

c-fos seems to have a postranscriptional regulation in uterine epithelial cells duringlhe 
rats estrous cycle.. Copyright 2003 Wiley-Liss, Inc. . curing ine 
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(3HJMK-801 binding and the mRNA for the NMDAR1 subunit of the 
NMDA receptor arc differcutiaUy distributed in human and rat forebrain. 

Mcoi » p . Mugnaini M. Bunnemanu BIT Trist DG . Bowery NG . 

Department of Pharmacology, Medical School, University of Birmingham UK 
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The distributions of [3H]MK-801 binding and the NMDA NR1 subunit mRNA were 
studied using receptor autoradiography and in-situ hybridization in rat and human brain 
whole-hemisphere coronal sections. Receptor protein detected by radioligand 
autoradiography and the mRNA for the key subunit of the receptor presented similar 
distortions m the forebrain, with a few areas showing an imbalance between the levels 

reCept ° r Pr0tein - Human frontal cortex-showed a relative abundance of 
NMDAR1 mRNA as compared to [3HJMK-801 binding. The same area in rat brain did 
not show any difference in the two distributions. In comparison, the rat daustrum 
presented a relative excess of NMDAR1 mRNA which was not detected in human 
sections. Human caudate nucleus exhibited relatively high levels of [3H1MK-801 binding 
Aat were unmatched in rat caudate. The hippocampi ofeither species presented similar 
levels of [3HJMK-801 binding and NMDAR1 mRNA, but when the two signals were 
measured in specific subfields of the hippocampal formation, the differential distribution 
or the two signals reflected the anatomy of hippocampal connections assuming a 
preferential dendritic distribution for MK-801 binding: Interestingly, rat and human 
hippocampi also showed some important species-dependent difference in the relative 
distribution of the receptor protein and mRNA. The data presented show an overall good 
correlation between the mRNA for the key subunit of the NMDA receptor and the 
tonct.onal receptor detected with radioligand binding and highlight the presence of local 
differences in their ratio. This may reflect different splicing of the mRNA for (he 
NMDAR1 subumt in specific brain areas of rat and human. The species-dependent 
differences in the relative distribution of the mRNA for the key subunit of (he NMDA 
rC ^ Pt K^ d ° f 3 marker offu nctional receptors also highlights important differences 
in the NMDA function in rat and human brain. 
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Abstract 

Translation initiation is regulated in response to 
nutrient availability and mitogenic stimulation and is 
coupled with cell cycle progression and cell growth. 
Several alterations in translations! control occur in 
cancer. Variant mRNA sequences can alter the 
transnational efficiency of individual mRNA molecules, 
Which in turn play a role in cancer biology. Changes in 
the expression or availability of components of the 
translational machinery and in the activation of 
translation through signal transduction pathways can 
lead to more global changes, such as an increase in 
the overall rate of protein synthesis and translational 
activation of the mRNA molecules involved in cell 
growth and proliferation. We review the basic 
principles ol translational control, the alterations 
encountered in cancer, and selected therapies 
targeting translation initiation to help elucidate new 
therapeutic avenues. - 

Introduction 

The fundamental principle of molecular therapeutics in can- 
cer is to exploit the differences in gene expression between 
cancer cells and normal cells. With the advent of cDNA array 
technology, most efforts have concentrated on identifying 
differences in gene expression at the level of mRNA, which 
can be attributable either to DMA amplification or to differ- 
ences in transcription. Gene expression is quite complicated, 
however, and is also regulated at the level of mRNA stability,' 
mRNA translation, and protein stability. 

The power of translational regulation has been best recog- 
nized among devetoprnentaJ biologists, because transcription 
does not occur in earfy embryogenesis in eukaryotes. For ex- 
ample, in Xenopus, the period of transcriptional quiescence 
continues until the embryo reaches midblastula transition, the 
4000-cell stage. Therefore, ail necessary mRNA molecules are 
transcribed during oogenesis and stockpiled in a translationally 
inactive, masked form. The mRNA are Uanslationalfy activated 
at appropriate times during oocyte maturation, fertilization, and 
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early embryogenesis and thus, are under strict translational 
control. 

Translation has an established role in cell growth. Basi- 
cally, an increase in protein synthesis occurs as a. conse- 
quence of mitogenesis. Until recently, however, little was 
known about the afteralioris In mRNA translation in cancer, 
and much is yet to be discovered about their role in the 
development and progression of cancer. Here we review the 
basic principles of translational control, the alterations en- 
countered in cancer, and selected therapies targeting transte- . 
tion initiation to elucidate potential new therapeutic avenues. 

Basic Principles ol Translational Control 
Mechanism of Translation initiation 
Translation initiation is the main step in translational regulation. 
Translation initiation is a complex process in which the initiator 
tRNA and the 40S and 60S ribosomnJ subunits are recruited to 
the 5' end of a mRNA molecule and assembled by eukaryotic 
translation initiation factors into an 80S ribosome at the start 
codon of the mRNA (Fig. 1). The 5' end of eukaryotic mRNA is 
capped, le. t contains the cap structure m 7 GpppN (7-methyI- 
guarwsine-triphospho-S'-nT^ucleoside). Most translation in 
eukaryotes occurs in a cap-dependent fashion, /.e., the cap is 
specifically recognized by the eINE, 3 which binds the 5' cap. 
The elF4F translation initiation complex is then formed by the 
assembly of etF4E,.the RNA helicase elF4A, and eIF4G, a 
scaffolding protein that mediates the binding of the 40S ribo- 
somal subunit to the mRNA molecule through interaction with 
the elF3 protein present on the 40S ribosome. elF4A and e(F4B 
participate in melting the secondary structure of the 5' UTR of 
the mRNA The 43S initiation complex {40S/elF2/Met-tRNA/ 
GTP complex} scans the mRNA in a 5'-*3' direction until it 
encounters an AUG start codon. This start codon is then base- 
paired to the anticodort of initiator tRNA, forming the 48S initi- 
ation complex. The initiation factors are then displaced from the 
48S complex, and the 60S ribosome joins to form the 80S 
ribosome. 

Unlike most eukaryotic translation, translation initiation of 
certain mRNAs, such as the picomavirus RNA, is cap inde- 
pendent and occurs by internal ribosome entry. This mecha- 
nism does not requ»e elF4E. Either the 43S complex can bind 
the initiation codon directly through interaction with the IRES in 
the 5' UTR such as in the erxxc^alomyocarditis virus, or it can 



The abbrevbfcns used are: etME. eukaryotic initiation (actor 4£; UTR 
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Binding of cIF4F complex to 
mRHA 5* cap 




Unwinding of secondary structure 
and formation of AZS 




Fig. 1. Translation initiation in eukaryoles. The <>E-BPs are hyperphos- 
phorytated to release etF4E so that it can interact with the 5' cap, and the 
e!F4F initiation complex is assembled. The interaction ol poty(A) binding 
protein with the initiation complex and circularization oi the mRNA is not 
depicted in the diagram. The secondary structure ol the 5' UTH is melted, 
the 4 OS ribosomat subunil is bound to CIF3, and the ternary complex 
consisting ol elF2, GTP, and the Met-tRNA are recruited to the mRNA. The 
ribosome scans the mRNA in a 5'-*3' direction until an AUG start codon 
is found in the appropriate sequence context The iniliaikm factors are 
released, and the targe ribosomal subunil b recruited. 



initially attach to the IRES and then reach the initiation codon by 
scanning or transfer, as is the case with the pofiovirus (1). 

Regulation of Translation Initiation 
Translation initiation can be regulated by alterations in the 
expression or phosphorylation status of the various factors 
involved. Key components in translation^ regulation that 
may provide potential therapeutic targets follow. 

elRE. elF4E plays a central rote in translation regulation. 
It is the least abundant of the initiation factors and is con- 
sidered the rate-limiting component for initiation of cap- 
dependent translation. etF4E may also be involved in mRNA 
splicing, mRNA 3'*processing, and mRNA nucleocytoplas- 
mic transport (2). elF4E expression can be increased at the 
transcriptional level in response to serum, or growth factors 
(3). elF4E overexpression may cause preferential translation 
Of mRNAs containing excessive secondary structure in their 
5' UTR that are normally discriminated against by the trans- 



lations) machinery and thus are inefficiently translated (4-7). 
As examples of this, overexpression of elF4E promotes in- 
creased translation of vascular endothelial growth factor, 
fibroblast growth factor-2, and cyclin D1 £?, 8, 9). 

Another mechanism of control is the regulation of elF4E 
phosphorylation. elF4E phosphorylation is mediated by the 
mitogen-activated protein kinase-interacting kinase t, which 
is activated by the mitogen-activated pathway activating 
extraceRufer signal-related kinases and the stress-activated 
pathway acting through p38 mitogen-activated protein ki- 
nase (10-13). Several mitogens, such as serum, platelet- 
derived growth factor, epidermal growth factor, insulin, 
angiotensin II, sre kinase overexpression, and ras over- 
expression, lead to e!F4E phosphorylation (14). The phos- 
phorylation status of elF4E is usually correlated with the 
translations rate and growth status of the cell; however; 
elF4E phosphorylation has also been observed In response 
to some cellular stresses when transtalional rates actually 
decrease (15). Thus, further study is needed to understand 
the effects of e)F4E phosphorylation on elF4E activity. 

Another mechanism of regulation is the alteration of elF4E 
availability by the binding of elF4E to the e!F4E-binding pro- 
teins (4E-BP, also known as PHAS-t). 4E-BPs compete with 
e!F4G for a binding site in elF4E. The binding of elF4E to the 
best characterized elF4E-blnding protein, 4E-BP1, is regur 
lated by 4E-BP1 phosphorylation. Hypophosphorylated 4E- 
BP1 binds to elF4E, whereas 4E-BP1 rryperphosphorylation 
decreases this binding. Insulin, angiotensin, epidermal 
growth factor, platelet-derived growth factor, hepatocyte 
growth factor, nerve growth factor, insulin-like growth factors 
I and II, interleukin 3, granulocyte- macrophage colony-stim- 
ulating factor + steel factor, gastrin, and the adenovirus have 
all been reported to induce phosphorylation of 4E-BP1 and 
to. decrease the ability of 4E-BP1 to bind elF4E (15, 16). 
Conversely, deprivation of .nutrients or growth factors result*- 
in 4E-BP1 dephosphorylation, an increase in elF4E binding, 
and a decrease in cap-dependent translation. 

p70 S6 Kinase. Phosphorylation of ribosomal 40S protein 
S6 by S6K is thought to play an important role in translational 
regulation. S6K -/- mouse embryonic cells proliferate more 
slowly than do parental cells, demonstrating that S6K has a 
positive influence on ceH proliferation (17). S6K regulates the 
translation of a group of mRNAs possessing a 5' terminal 
oltgopyr imidine tract (5' TOP) found at the 5' UTR of ribosomal 
protein mRNAs and other mRNAs coding for components of 
the translational machinery. Phosphorylation of S6K is regu- 
lated in part based on the availability of nutrients (1 8, 1 9) and is 
stimulated by several growth factors, such as platelet-derived 
growth factor and insulin-like growth factor I (20). 

eiF2cr Phosphorylation. The binding of the initiator tRNA 
to the small ribosomal unit is mediated by translation initia- 
tion factor elF2. Phosphorylation ot the a-subunit ol elF2 
prevents formation of the elF2/GTP/MeMRNA complex and 
inhibits global protein synthesis (21 , 22). elF2a is phospho- 
rylated under a variety of conditions, such as viral infection, 
nutrient deprivation, heme deprivation, and apoptosis (22). 
elF2a is phosphorylated by heme-regulated inhibitor, nutrient- 
regulated protein kinase, and the IFNnnduced. double- 
stranded RNA-activated protein kinase (PKR; Ref. 23). 
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The mTOR Signaling Pathway. The macroRde antibiotic 
rapamycin (Siralimus; Wyeth-Ayerst Research, Cofiegeviile, 
PA) has been the subject of intensive study because it irv 
hibrts signal trar«cHictfon r^thways involved in T-ceB activa- 
tion. The rapamydn-sensitive component of these pathways 
is mTOR (also called FRAP or RAFT1). mTOR is the mam- 
malian homologue of the yeast TOR proteins that regulate G, 
progression and translation in response .to nutrient available 
ity (24). mTOR is a serine-threonine kinase that modulates 
translation initiation by altering the phosphorylation status of 
4E : BP1 and S6K (Fig. 2; Ret 25). 

4&43P1 is phosphorytated on multiple residues. mTOR pbos- 
phorylates theThr-37 and Thr-46 residues of 4E-BP1 in vitro 
(26); however, phosphorylation at these sites is not associated 
with a loss of efF4E binding. Phosphorylation of Thr-37 and 
Thr-46 is required for subsequent phosphoryfation at several 
COOH-tenronal, serum-sensitive sites; a combination of these 
prwsfjhorylation events appears to be needed to inhibit the 
binding of 4E-BP1 to eIF4E (25). The product of the ATM gene, 
p3a/MSK1 pathway, and protein kinase C^afeo play a role in 
4E-BP1 phosphorylation (27-29). 

S6K and 4E-BP1 are also regulated, in part, by PI3K and its 
downstream protein kinase Akt. PTEN is a phosphatase that 
negatively regulates PBK signaling. PTEN nun cells have 
constitutive^ active of Akt. with increased S6K activity and 
S6 phosphorylation (30). S6K activity is inhibited both by 
PI3K inhibitors wortmannin and LY294002 and by mTOR 
inhibitor rapamycin (24). Akt phosphorylates Ser-2448 in 
mTOR in vitro, and this site is phosphorylated upon Akt 
activation in vivo (31-33). Thus, mTOR is regulated by the 
PI3K/AW pathway; however, this does not appear to be the 
only mode of regulation of mTOR activity. Whether the PI3K 
pathway also regulates S6K and 4E-BP1 phosphorylation 
independent of mTOR is controversial. 

Interestingly, mTOR autophosphorylation is blocked by wort- 
mannm but not by rapamycin (34). This s^ming inconsistency 
suggests that mTOR-responsive regulation of 4E-BP1 and S6K 
activity occurs through a mechanism other than htrinsicmTOR 
kinase activity. An alternate pathway for 4E-BP1 and S6K phos- 
phorylation by mTOR activity is by the inhibition of a phospha- 
tase. Treatment with carycuBn A, an inhibitor of phosphatases 1 
and 2A, reduces raparnycin-induced dephosphorylatfon of 4E- 
BP1 and S6K by rapamycin 05). PP2A interacts with fufMength 
S6K but not with a S6K mutant that is resistant to dephospho- 
rylalion resulting from rapamycin. mTOR phosphorylates PP2A 
>n vitro; however, how this process alters PP2A activity is not 
known. These results are consistent with the model that phos- 
- phorylat/on of a phosphatase by mTOR prevents dephospho- 
rybtron of 4E-BP1 and S6rC and conversely, that nutrient dep- 
nvaton and rapamycin block inhibition of the phosphatase by 

Polyadenylation. The poty(A) tail in eukaryotic mRNA is 
*mportant in enhancing translation initiation and mRNA sta- 
bility. Poryadbnylation plays a key role in regulating gene 
express™ during oogenesis and early embryogenesis 
Some mRNA that are translationalfy inactive in the oocyte are 
polyadenylated concomitantly with transitional activation in 
oocyte maturation, whereas other mRNAs that are transit 
honally aclive during oogenesis are deadenylated and trans- 
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Cap-dependent Translation 

(mRNAs with highly structured 5' UTRs) 

Ctf <> Re9 ? aU °? rf*** 1 *" W^tion by signal transaction path- 
^ agnate* v,a p38. extraedtubr signal-rented kinase, PKJK, and 
mTOR can afl actfvate translation initiation. 



lationally silenced (36-58). Thus, control of porv(A} tail syn- 
thesis is an important regulatory step in gene expression 
The 5' cap and pory(A) tail are thought to function synergis- 
ttcally to regulate mRNA translational efficiency (39, 40) 

RNA Packaging. Most RNA-binding proteins ar e assem- 
bled on a transcript at the time of transcription, thus deter- 
mining the translational fate of the transcript (41) A highly 
conserved family of Y-box proteins is found in cytoplasmic 
messenger ribonucleoprotein particles, where the proteins 
are thought to play a role in restricting the recruitment of 
mRNA to the translational machinery (41-43). The major 
mRNA-associated protein, VB-1, destabilizes the interaction 
of elF4E and the 5' mRNA cap in vitro, and overexpression of 
YB-1 results in translational repression in vivo (44) Thus 
alterations in RNA packaging can also play an important role 
in translational regulation. 

Translation Alterations Encountered in Cancer 
Three main alterations at the translational level occur in cancer 
variations in mRNA sequences that increase or decrease trans- 
anal effioency, changes in the expression or availability of 
components of the translational machinery, and activation of 
translatJon through abenantly activated signal transduction 
pathways. The first alteration affects the translation of an indi- 
vidual mRNA that may play a role in carcinogenesis. The sec- 
ond and third alterations can lead to more global changes such 
as an increase in the overall rate of protein synthesis and the 
translational activation of several mRNA species. 

Variations in mRNA Sequence 

Variations in mRNA sequence attect the translational effi- 
cency of the transcript. A brief description of these variations 
and examples of each mechanism follow 

Motions. Mutations in tho mRNA sequence, especially 
in the 5 UTR, can alter its translational efficiency, as seen in 
the following examples. 
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c-myc. Sarto et at proposed that translation of full-length 
c-myc is repressed, whereas in several Burkitt lymphomas 
that have deletions of the mRNA 5' UTR, translation of c-myc 
is more efficient (45). More recently, it was reported that the 
5' UTR of c-myc contains an IRES, and thus c-myc transla- 
tion can be initiated by a cap-independent as well as a 
cap-dependent mechanism (46, 47). | n patients with multiple 
myeloma, a C-^T mutation in the c-myc IRES was identified 
(48) and found to cause an enhanced initiation of translation 
via internal ribosomal entry (49). 

BRCA 1. A somatic point mutation (1 17 G-*Q in position 
-3 with respect to the start codon of the BRCA1 gene was 
identified in a highly aggressive sporadic breast cancer (50). 
Chimeric constructs consisting of the wild-type or mutated 
BRCA1 5' UTR and a downstream luciferase reporter dem- 
onstrated a decrease in the translation^ efficiency with the 5* 
UTR mutation. 

.Cyclin-dependent Kinase Inhibitor 2A. Some inherited 
melanoma kindreds have a G-*T transversion at base -34 
of cydin-dependent kinase inhibitor-^, which encodes a 
cyan-dependent kinase 4/cycIin- dependent kinase 6 kinase 
mhfbitor important in G, checkpoint regulation (51). This 
mutation gives rise to a novel AUG translation initiation 
codon, creating an upstream open reading frame that com- 
petes for scanning ribosomes and decreases translation 
from the wild-type AUG. 

Alternate Splicing and Alternate Transcription Start 
Sites. Alterations in splicing and alternate transcription sites 
can lead to variations in 5' UTR sequence, length, and second- 
ary structure, ultimately impacting translation^ efficiency. 

ATM. The ATM gene has four noncoding exons in its 5' 
UIR that undergo extensive alternative splicing (52) The 
contents of 12 different 5' UTRs that show considerable . 
diversity m length and sequence have been identified These 
divergent 5' leader sequences play an important role in the 
translations! regulation of the ATM gene. 

mdm. In a subset of tumors, overexpression of the onco- 
protem mdm2 results in enhanced translation of the mdm2 
mRNA Use of different promoters leads to two mdm2 tran- 
scripts that differ only in their 5' leaders (53). The longer 5' 
UTB conta f ns two upstream open reading frames, and this 
mRNA is loaded with ribosomes inefficiently compared with 
the short 5' UTR. 

BRCA1. In a normal mammary gland. BRCA1 mRNA is 
expressed with a shorter leader sequence (5'UTRa) whereas 
in sporadrc breast cancer tissue, BRCA1 mRNA is expressed 
' with a longer leader sequence (5' UTRb); the translations 
emc.ency of transcripts containing 5' UTRb is 10 times lower 
man that of transcripts containing 5' UTRa (54) 

TGF-p3> TGF-03 mRNA includes a 1.1 -kb 5' UTR which 
exerts an inhibitory effect on translation. Many human breast 
cancer ceil lines contain a novel TGF-03 transcript with a 5' 
UTR hat B 870 nucleotides shorter and has a 7-fold greater 
translahonal efficiency than. the normal TGF-03 mRNA (55) 
Alternate Polyadenylation Sites. Multiple polyadenyl- 
^on signals leading ,o the generation ol several transcripls 
w.th diffenng 3* UTR have been described lor several mRNA - 
spec.es. such as the RET proto-oncogene (56). ATM gene 
i^l tissue mlnb.tor of metalloproteinases-3 (57). RHOA 



protc-oncogene (58), and calmodulin-l (59). Although the 
effect of these alternate 3' UTRs on translation is not yet 
known, they may be important in RNA-protein interactions 
that affect translational recruitment- The role of these alter- 
ations in cancer development and progression is unknown. 

Alterations in the Components of the 
Translation Machinery 

Alterations in the components of translation machinery can 
take many forms. 

Overexpresssion of elF4E. Overexpression of elF4E 
causes malignant transformation in rodent cells (60) and the 
deregulation of HeLa cell growth (61). Polunovsky etal. (62) 
found that elF4E overexpression substitutes for serum and 
individual growth factors in preserving viability of fibroblasts, 
which suggests that elF4E can mediate both proliferative and 
survival signaling. 

Elevated levels of elF4E mRNA have been found in a broad 
spectrum of transformed ceil lines (63). elF4E levels are 
elevated in all ductal carcinoma in situ specimens and inva- 
sive ductal carcinomas, compared with benign breast spec- 
imens evaluated with Western blot analysis (64, 65). Prelim- 
inary studies suggest that this overexpression is attributable 
to gene amplification (66). 

There are accumulating data suggesting that e!F4E overex- 
pression can be valuable as a prognostic marker. eIF4E over- 
expression was found in a retrospective study to be a marker of 
poor prognosis in stages I to III breast carcinoma (67). Verifica- 
tion of the prognostic value of elF4E in breast cancer is now 
under way in a prospective trial (67). However, in a different 
study, elF4E expression was correlated with the aggressive 
behavior of norvHodgkins lymphomas (68). In a prospective 
analysis of patients with head and neck cancer, elevated levels 
of e!F4E in histologically tumor-free surgical margins predicted 
a significantly increased risk of local-regional recurrence (9). 
These results all suggest that elF4E overexpression can be 
used to select patients who might benefit from more aggressive 
systemic therapy. Furthermore, theheadand neck cancer data 
suggest that e!F4E overexpression is a field defect and can be 
used to guide local therapy. 

Alterations in Crtter Initiation Factors. Alterations in a 
number of other initiation factors have been associated with 
cancer. Overproduction of e(F4G, similar to elF4E, leads to 
malignant transformation in vitro (69). e(F-2a is found in 
increased levels in bronchioloatveolar carcinomas of the lung 
(3). Initiation factor elF-4A1 is overexpressed in melanoma 
(70) and hepatocellular carcinoma (71). The p40 subunil of 
translation initiation factor 3 is amplified and overexpressed 
in breast and prostate cancer (72), and the e»F3-pt 10 subunit 
is overexpressed in testicular seminoma (73). The role that 
overexpression of these initiation factors plays on the devel- ' 
opment and progression of cancer, if any, is not known. ; 

Overexpression of S6K. S6K is amplified and highly 
overexpressed in the MCF7 breast cancer cell tine, com- 
pared with normal mammary epithelium (74).. In a study by 
Barlund et at (74), S6K was amplified in 59 of 668 primary 
breast tumors, and a statistically significant association was 
observed between amplification and poor prognosis. 
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Overexpression of PAP. PAP catalyzes 3' poly(A). syn- 
thesis. PAP is overexpressed in human cancer cefls com- 
pared with norma) and viralfy transformed cells (75). PAP 
enzymatic activity in breast tumors has been correlated with 
PAP protein levels (76) and, in mammary tumor cytosols, was 
found to be an independent factor for predicting survival (76). 
Little is known, however, about how PAP expression or ac- 
tivity affects the translational profile. 

Alterations in RNA-binding Proteins. Even less is known 
about alterations in RMA packaging in cancer. Increased ex- 
pression and nuclear localization of the RNA-binding protein 
YB-1 are indicators of a poor prognosis for breast cancer (77), 
non-small cell lung cancer (78), and ovarian cancer (79). How- 
ever, this effect may be mediated at least in part at the level of 
transcription, because YB-1 increases chemoresistance by en- 
hancing the transcription of a multidrug resistance gene (80). 



Activation of Signat transduction Pathways 
Activation of signal transduction pathways by loss of tumor 
suppressor genes or overexpression of certain tyrosine kinases 
can contribute to the growth and aggressiveness of tumors. An 
important mutant in human cancers is the tumor suppressor 
gene P7EW, which leads to the activation of the PI3K/Akt path- 
way. Activation of P13K and Akt induces the oncogenic trans- 
formation of chicken embryo fibroblasts. The transformed cells 
show constitutive phosphorylation of S6K and of 4E-BP1 (81). 
A mutant Akt that retains kinase activity but does not phos- 
pborylate S6K or 4E-BP1 does not transform fibroblasts, which 
suggests a correlation between the oncogenicity of PJ3K and 
Akt and the phosphorylation of S6K and 4E-BP1 (81). 

Several tyrosine kinases such as platelet-derived growth 
factor, insulin- like growth factor, HER2/neu, and epidermal 
growth factor receptor are overexpressed in cancer. Be- 
cause these kinases activate downstream signal transduc- 
tion pathways known to alter translation initiation, activation 
of translation is likety to contribute to the growth and aggres- 
siveness of these tumors. Furthermore, the mRNA for many 
of these kinases themselves are under translational control 
For example, HER2/neu mRNA is translattonally controlled 
both by a short upstream open reading frame that represses 
HER2/neu translation in a cell type-independent manner and 
by a distinct cell type- dependent mechanism that increases 
translational efficiency (82). HER2/neu translation is different 
in transformed and normal cells. Thus, it is possible that 
alterations at the translational level can. in part account for 
the discrepancy between HER2/neu gene amplification de- 
tected by fluorescence in situ hybridization and protein levels 
detected by immunohistochemical assays. 



Translation Targets of Selected Cancer Therapy 
Components of the translation machinery and signal path- 
ways involved in the activation of translation initiation repre- 
sent good targets for cancer therapy. 

Targeting the mTOR Signaling Pathway: Rapamycin 
and Tumstattn 

Rapamycin inhibits the proliferation of lymphocytes. It was 
initially developed as an immunosuppressive drug lor organ 



transplantation. Rapamycin with FKBP 12 (FK506-binding 
protein, U t 12,000) binds to mTOR to inhibit Hs function. 

Rapamycin causes a small but significant reduction in the 
Initiation rate of protein synthesis (83). It blocks cell growth in 
part by blocking S6 phosphorylation and selectively sup- 
pressing the translation of 5' TOP mRNAs, such as ribosorria! 
proteins, and elongation factors (83-85). Rapamycin also 
blocks 4E-BP1 phosphorylation and inhibits cap-dependent 
but not cap- independent translation (17, 86). 

The rapamveirvserertive signal transduction pathway, acti- 
vated during malignant transformation and cancer progression, 
is how being studied as a target for cancer therapy (87). Pros- 
tate, breast, small cell lung, glioblastoma, melanoma, and T-cell 
leukemia are among the cancer lines most sensitive to the 
rapamycin analogue CCI-779 (Wyeth-Ayerst Research; Ref. 
87). In rhabdomycosarcoma cell fines, rapamycin is either cyto- 
static or cytoddal, depending on the p53 status of the cell; p53 
wild-type cells treated with rapamycin arrest in the G t phase 
and maintain their viability, whereas p53 mutant cells accumu- 
late in G, and undergo apoptosis (88, 89). In a recently reported 
study using human primitive neuroectodermal tumor and 
medulbbfastonria models, rapamycin exhibited more cytotox- 
icity in combination with dsplatin and camptotbecin than as a 
single agent In vivo, CCI-779 delayed growth of xenografts by 
1 60% after 1 week of therapy and 240% after 2 weeks. A single 
high-dose administration caused a 37% decrease in tumor 
volume. Growth inhibition in wvo was 1.3 times greater, with 
cisplatin in combination with CCI-779 than with cispfatin alone 

(90) . Thus, preclinical studies suggest that rapamycin ana- 
logues are useful as single agents and in combination with 
chemotherapy; 

Rapamycin analogues CCI-779 and RAD001 (Novartis, 
Basel, Switzerland) are now in clinical trials. Because of the 
known effect of rapamycin on lymphocyte proliferation, a 
potential problem with rapamycin analogues is immunosup- 
pression. However, although prolonged immunosuppression 
can result from rapamycin and CCI-779 administered on 
continuous-dose schedules, the immunosuppressive effects 
of rapamycin analogues resolve in -24 h after therapy 

(91) . The principal toxicities of CCI-779 have included der- 
matological toxicity, myelosuppiession, infection, mucositis, 
diarrhea, reversible elevations in liver function tests, hyper : 
glycemia, hypokalemia, hypocalcemia, and depression (87, 
92-94). Phase II trials of CCI-779 have been conducted in 
advanced renal cell carcinoma and in stage lll/IV breast 
carcinoma patients who failed with prior chemotherapy. In 
the results reported in abstract form, although there were no 
complete responses, partial responses were documented in 
both renal cell carcinoma and in breast carcinoma (94, 95). 
Thus, CCI-779 has documented preliminary clinical activity in 
a previously treated, unselected patient population. 

Active investigation is under way into patient selection for 
mTOR inhibitors. Several studies have found an enhanced 
efficacy of CCI-779 in PTEN-null tumors (30, 96). Another 
study found that six of eight breast cancer cell lines were 
responsive to CCI-779, although only two of these lines 
lacked PTEN (97) There Was, howevei, a positive correlation 
between Akt activation and CCI-779 sensitivity (97). This 
correlation suggests that activation of the PI3K-AW pathway, 
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regardless of whether it is attributable to a PTEN mutation or 
to ovef expression of receptor tyrosine kinases, makes can- 
cer cell amenable to mTOR-directed therapy. In contrast, 
lower levels of the target of mTOR, 4E-BP1, are associated 
with rapamycin resistance; thus, a lower 4E-BP1/elF4£ ratio 
may predict rapamycin resistance (98). 

Another mode of activity for rapamycin and its analogues 
appears to be through inhibition of angiogenesis. This activ- 
ity may be both through direct inhibition of endothelial cell 
proliferation as a result of mTOR inhibition in these cells or by 
inhibition of translation of such proangiogenic factors as 
vascular endothelial growth factor in tumor cells (99, 100). 

The angiogenesis inhibitor tumstatin, another anticancer 
drug currently under study, was also found recently to inhibit 
translation in endothelial cells (101). Through a requisite in- 
teraction with integrin, tumstatin inhibits activation of the 
P13K/Akt pathway and mTOR in endothelial cells and pre- 
vents dissociation of elF4E from 4E-BP1, thereby inhibiting 
cap-dependent translation. These findings suggest that en- 
dothelial cells are especially sensitive to therapies targeting 
the mTOR- signaling pathway. 



ment afeo reduces the expression of angiogenic factors (115) 
and has been proposed as a potential adjuvant therapy for head 
and neck cancers, particularly when elevated etF4E is found in 
surgical margins. Small molecule inhibitors that bind the eWAGJ 
4E-BP1 -binding domain of elF4E are proapoptotic (116) and 
are also being actively pursued. 



Exploiting Selective Translation for Gene Therapy 
A different therapeutic approach that takes advantage of the 
enhanced cap-dependent translation in cancer cells is the use 
of gene therapy vectors encoding suicide genes with highly 
structured 5' UTFL These mFlNA would thus be at a competitive 
disadvantage in rorrnal cells and not translate weH, whereas in 
cancer cells, they would translate more efficiently. For example, 
the introduction of the 5' UTFt of fibroblast growth factor-2 5' to 
the coding sequence of herpes simplex virus type- 7 thymidine 
kinase gene, allows for selective translation of herpes simplex 
virus type-1 thymidine kinase gene in breast cancer cell lines 
compared with normal mammary cefl fines and results in se- 
lective sensitivity to ganciclovir (1 17). 



Targeting elF2cc EPA, Clotrimazole, mda-7, 
and Fiavonoids 

EPA is an n-3 polyunsaturated tatty acid found in the fish- 
based diets of populations having a low incidence of cancer 
(102). EPA inhibits the proliferation of cancer cells (103), as 
well as in animal models (104, 105). It blocks cell division by 
inhibiting translation initiation (105). EPA releases Ca 2 * from 
intracellular stores while inhibiting their refilling, thereby ac- 
tivating PKR. PKR, in turn phosphorylates and inhibits elF2a, 
resulting in the inhibition of protein synthesis at the level of 
translation initiation. Similarly, clotrimazole, a potent antipro- 
liferative agent in vitro and in vivo, inhibits cell growth through 
depletion of Ca 2 ^ stores, activation of PKR, and phospho- 
rylation of elF2a (106). Consequently, clotrimazole preferen- 
tially decreases the expression of cyclins A, E, and D1, 
resulting in blockage of the cell cycle in G v 

mda-7 is a novel tumor suppressor gene being developed 
as a gene therapy agent. Adenoviral transfer of mda~J (Ad- 
mda7) induces apoptosis in many cancer cells including 
breast, colorectal, and lung cancer (107-109). Ad-mda7 also 
induces and activates PKR, which leads to phosphorylation 
of elF2a and induction of apoptosis (110). 

Fiavonoids such as genistein and quercetin suppress tu- 
mor cell growth. All three mammalian elF2a kinases, PKR, 
heme- regulated inhibitor, and PERK/PEK, are activated by 
fiavonoids, with phosphorylation of elF2a and inhibition of 
protein synthesis (111). 



Targeting elF4A and e)F4E: Antisense RNA 
and Peptides 

Antisense expression of elF4A decreases the proliferation rate 
of melanoma cells (112). Sequestration of elF4E by ovcrexpres- 
sion ot 4E-BP1 is proapoplolic and decreases tumorigenicrty 
(113, 114). Reduction of elF4E with antisense RNA deer eases 
soft agar growth, increases tumor latency, and increases the 
rates of tumor doubling times (7). Antisense e!F4E RNA treat- 



Toward the Future 

Translation is a crucial process in every cell. However, several 
alterations in translationat control occur in cancer. Cancer cells 
appear to need an aberrantly activated translational state for 
survival, thus allowing the targeting of translation initiation with 
surprisingly low toxicity. Components of the translational ma- 
chinery, such as elF4E, and signal transduction pathways in- 
volved in translation initiation, such mTOR, represent promising 
targets for cancer therapy. Inhibitors of the mTOR have already 
shown some preliminary activity in clinical trials. It is possible 
that with the development of better predictive markers and 
better patient selection, response rates to single-agent therapy 
can be improved. Similar to other cytostatic agents, however, 
mTOR inhibitors are most likely to achieve clinical utility in 
combination therapy. In the interim, pur increasing understand- 
ing of translation initiation and signal transduction pathways 
promise to lead to the identification of new therapeutic targets 
in the near future. 
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Overexpression of chemokines, fibrogenic cytokines, and myofibroblasts in 
human membranous nephropathy. 

Mezzano SA , Proguett MA, Burgos ME . Ardiles LG . Arcs CA . Caorsi I . Egido J . 

Division of Nephrology, School of Medicine, Universidad Austral, Valdivia Chile 
smezzano@uach.cl ' ' 

Overexpression of chemokines, fibrogenic cytokines, and myofibroblasts in human 
membranous nephropathy. BACKGROUND: Proteinuria plays a central role in the 
progression of glomerular disease, and there is growing evidence suggesting that it may 
determine tubular cell activation with release of chemokines and fibrogenic factors 
leading to interstitial inflammatory reaction. However, most studies on this subject have 
been performed in experimental models, and the experience in human kidney biopsies 
has been scarce. We analyzed the tissue sections of patients with idiopathic membranous 
nephropathy (DVTN), a. noninflammatory glomenilar disease that may follow a progressive 
disease w lt h heavy persistent proteinuria, interstitial cell infiltration, and decline of renal 
function. METHODS: Paraffin-embedded biopsy specimens from 25 patients with DMN 
(13 progressive and 1 2 nonprogressive) were retrospectively studied by 
immunohistochemistry [monocyte chemoattractant protein- 1 (MCP- 1 ), regulated on 
activation normal T-cell expressed and secreted chemokine (RANTES), osteopontin 
(OPN), platelet-derived growth factor-BB (PD-GF-BB)] and in situ hybridization [MCP- 
I , RANTES, PDGF-BB, transforming growth factor-beta I (TGF-beta 1 )]. Moreover we 
studied the presence of myofibroblasts, which were identified by the expression of alpha- 
smooth muscle actin (alpha-SMA), the monocytes/macrophages (CD68-positive cells) 
and T-cell infiltration (CD4+ and CD8+ cells). All of the patients were nephrotic and * 
without treatment at time of the biopsy. RESULTS: A strong up-regulation of MCP-1 , 
RANTES, and OPN expression was observed, mainly in tubular epithelial cells with a 
significant major intensity in the progressive IMN patients. A strong correlation between 
the mRNA expression and the corresponding protein was noted. The presence of these 
chemokines and OPN was associated with interstitial cell infiltration. TGF-beta and 
PDGF were also up-regulated, mainly in tubular epithelial cells, with a stronger 
expression in the progressive IMN, and an association with the presence of 
myofibroblasts was found. CONCLUSIONS: Patients with severe proteinuria and 
progressive IMN have an overexpression in tubular epithelial cells of the chemokines 
MCP-1, RANTES, and OPN and the proftbrogenic cytokines PDGF-BB and TGF-beta 
Because this up-regulation was associated with an interstitial accumulation of 
mononuclear cells and an increase in myo fibroblastic activity, it is suggested that those 
mediators are potential predictors of progression in IMN. Finally, based on experimental 
data and the findings of this article, we speculate that severe proteinuria is the main factor 
responsible for the up-reguiation of these factors in tubular epithelial cells. 
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Decreased uncoupling protein expression and intramyocytic triglyceride 
depiction in formerly obese subjects. 
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™ CTIVE: T ° e ? mine the mUscuIar unc °upling protein expression 2 (UCP2) and 

g^e expression in morbid obese subjects before and after bariatric surgery [bilio- 
pancreat.cdwers.onCBPD)]. RESEARCH METHODS AND PROCEDURES Eleven 

BPD e sS e M (BM : = r^" 2 ^ m(2)) ^ Stud ^"^-BPD and 24 monSer 
BPD. Skeletal muscle UCP2 and UCP3 mRNA was measured using reverse 
rransenptase-competitive polymerase chain reaction and UCP3 protein by Western 
Wotting Intramyocytic triglycerides were quantified by high-performance liquid 
chromatography. Twenty-four-hour energy expenditure and respiratory quotient (RO) 
were measured in a respiratory chamber. RESULTS: After BPD, die average weUt loss 

TTalTT^, T' 0 - N ° npr0tein RQ WaS inCreaSed in the P-^ese sub e^ 73 
t 0 ?6 to 0 ^ "nnr 1 ,ntram ^ ic trigl^ride level dropped (3 66 

UC?2 tnd licit If J**™ m f ° f frCSh tiSSUC ' P < 0 0 ° 01 > after BPD " Ex P r ^n of 
UCP2 and UCP3 mRNA was significantly reduced (from 35.9 +/- 6.1% to 18 6 +/- 4 5% 

VCnl^T* f ' ^ t H0% 10 33 4 +/ " 85% > P = °- 03 = -pectivelyX 
a\ rfi P™ temcoatent was 3330 significantly reduced (272.19 +/- 84.13 vs 175 78 +/ 

levelsVo - 0 P nn?; 04) ' A ^Fu TGSression m&l y sis = 0.90) showed that DvITG 

pp * " r ^P resented the most Powerful independent variable for predicting 
UCP3 i vanahon. DISCUSSION: The strong correlation of UCP expression^ decrease 

% S ^ g f S ** tHg,yCeride C ° ntent ^ 311 eve " more ^Portant role m 
die regulation of UCP gene expression than the circulating levels of free fatty acids or the 
achieved degree of weight loss. "yauuborine 



PMID: 12740453 [PubMed - indexed for MEDLINE] 



435: Brain Res. Mol Brain.Res- 1994 Jul;24(l -4); 129-39. 



Related Articles, Links 



Differential expression of the short and long forms of the gamma 2 subunit 
of the GABAA/benzodiazepine receptors. 

Mealies CP, Gutierrez A . Khan ZU . Vitoi ica J . De Bias AL 

Division of Molecular Biology and Biochemistry, School of Biological Sciences, 
University of Missouri-Kansas City 641 10-2499. 

The distribution of the mRNAs encoding the gamma 2S and gamma 2L subunits of the 
GABAA receptor in the rat brain has been revealed by in situ hybridization, northern blot 
and dot blot analysis using specific antisense oligonucleotides. In addition, the 
quantitative distribution of the gamma 2S and gamma 2L subunit peptides participating in 
the fully assembled GABAA receptors/benzodiazepine receptors has been mapped by 
immunoprecipitation with specific anti-gamma 2S and anti-gamma 2L antibodies. 
Several neuronal types and brain regions are enriched in gamma 2L such as neurons of 
the layer H of striate cortex and cerebellar Purkinje cells as well as the inferior colliculus, 
superior colliculus, deep cerebellar nuclei, medulla and pons. Other neuronal types and 
regions are enriched in gamma 2S such as the mitral cells of the olfactory bulb, pyramidal 
neurons of the pyriform cortex, layer VI of the neocortex, granule cells of the dentate 
gyrus and pyramidal cells of the hippocampus. Other cortical areas and cerebellar granule 
cells express both gamma 2S and gamma 2L in comparable amounts: There is a good 
correlation between the relative expression of gamma 2S and gamma 2L mRNAs and the 
relative presence of these protein subunits in fully assembled and mature receptors in the 
studied brain regions. The differential distribution of gamma 2S and gamma 2L might 
result in. differentia! ethanol sensitivity of the neurons expressing these GABAA receptor 
subunits. 
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Follicle-Stimulating Hormone Receptor and Its Messenger Ribonucleic Acid Are 
Present in the Bovine Cervix and Can Regulate Cervical Prostanoid Synthesis 1 

D. Mizrachi and M. Shemesh 2 
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ABSTRACT 

The hypothesis that FSH regulates the bovine cervical pros- 
taglandin E 2 (PGE 2 ) synthesis that is known to be associated with 
cervical relaxation and opening at the time of estrus was inves- 
tigated. Cervical tissue from pre-estrous/estrous, luteal, and 
postovulatory cows were examined for 1) the presence of bovine 
(b) FSH receptor (R) and its corresponding mRNA and 2) the 
effect of FSH on the PGE 2 regulatory pathway in vitro. The pres- 
ence of bFSHR mRNA in the cervix (maximal during pre-estrus/ 
estrus) was demonstrated by the expression of a reverse tran- 
scription (RT) polymerase chain reaction (PCR) product (384 
base pairs) specific for bFSHR mRNA and sequencing. Northern 
blotting revealed three transcripts (2.5, 3.3, and 3.8 kilobases 
[kb]) in cervix from pre-estrous/estrous cows. The level of FSHR 
(75 kDa) was significantly higher (p < 0.01) in Western blots of 
pre-estrous/estrous cervix than in other cervical tissues. There 
was a good correlation between the 75-kDa protein expression 
and its corresponding transcript of 2.55 kb throughout the es- 
trous cycle as described by Northern blot analysis as well as RT- 
PCR. Incubation of FSH (10 ng/ml) with pre-estrous/estrous cer- 
vix resulted in a 3-fold increase in the expression of FSHR and 
a 2-fold increase in both G protein (a s ) and cyclooxygenase II. 
FSH (5-20 ng/ml) significantly increased (p < 0.01) cAMP, ino- 
sitol phosphate (p < 0.01), and PGE 2 (p < 0.01) production by 
pre-estrous/estrous cervix but not by cervix at the other stages. 
We conclude that bovine cervix at the time of the peripheral 
plasma FSH peak (pre-estrus/estrus) contains high levels of 
FSHR and responds to FSH by increasing the PGE 2 production 
responsible for cervical relaxation at estrus. 

INTRODUCTION 

FSH and LH are the two known pituitary hormones that 
regulate gonadal functions. FSH is essential for female fer- 
tility, specifically for folliculogenesis, and human ovarian 
failure has been related to mutations both in the FSH re- 
ceptor (FSHR) and in the FSHf3 gene [1]. However, unlike 
LH, for which extra-gonadal binding has been documented 
for the uterus in the cow [2], pig [3], and human [4], no 
extra-gonadal effects of FSH have been reported. The uter- 
ine LH receptors have been shown to be physiologically 
active in the cow [2. 5]. The concentration of uterine LH 
receptors is related to the phase of the cycle, and binding 
of LH to uterine receptors increases uterine prostaglandin 
synthesis [2, 5]. Preliminary observations in our laboratory 
indicated that both FSH and LH receptors are also present 
in the cervix. We therefore investigated whether FSHRs are 
associated with the regulation of cervical prostaglandin, 
particularly around the time of estrus, when both cervical 
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PGE 2 production [6] and peripheral FSH concentrations are 
maximal. 

The present investigation was therefore conducted to de- 
termine the presence of FSHR and its mRNA, and its phys- 
iological activity, in the bovine reproductive tract. Since the 
cervix is a prostaglandin Ei (PGE|)- and PGE 2 -sensitive 
tissue and responds to hormonal stimulation, we also de- 
termined whether FSH could increase cervical cyclooxy- 
genase expression and production of PGE 2 in vitro, partic- 
ularly at the time near estrus. 

MATERIALS AND METHODS 

Animals 

Cervical and ovarian tissues were collected from Hol- 
stein cows at a local abattoir. The stage of the cycle was 
determined according to signs of ovulation and the status 
of corpora lutea (CL; weight, color). These parameters al- 
lowed the classification of the cervix into three groups: 1) 
pre-estrous/estrous (follicular, 18-20 days postovulation; 
regressed CL < 1.0 g; presence of cervical mucus; n = 
54); 2) postovulatory (1-4 days postovulation; presence of 
corpora hemorrhagica; signs of follicular rupture; n = 35), 
and 3) luteal (12-16 days postovulation; CL 4-6 g; n = 
44). Bovine granulosa cells were collected from pre-es- 
trous/estrous follicles and used as a positive control for 
FSHR expression [7]. 

The cervix external os segment was taken as the initial 
3 cm of the cervix (total length of cervix is about 9 cm). 
The cervical os segments, primarily luminal ly oriented 
muscularis with epithelium, were sliced into horizontal 
strips (1 cm in length), pooled, and minced finely with a 
scalpel. The Kimron Veterinary Institute Animal Care and 
Use Committee approved all procedures. 

FSHR Gene Expression 

RNA isolation. Bovine cervical or ovarian tissues were 
rapidly dissected and frozen in liquid nitrogen (within 20 
min after slaughter). Total RNA was extracted using the 
acid phenol, guanidine thiocyanate technique [8], using 
TriReagent (Molecular Research Center, Inc., Cincinnati, 
OH) according to the manufacturer's instructions. 

Reverse transcription (RT)-polymerase chain reaction 
(PCR). Bovine FSHR mRNA was detected as previously 
described for the LH receptor mRNA [5]. Oligonucleotides 
corresponding to the published sequence of the bovine 
FSHR [7] were synthesized and used as primers in a PCR 
reaction. Bovine ovarian and cervical tissue cDNA ob- 
tained by RT of 2 fig RNA was used as a template in the 
PCR reaction using a pair of primers corresponding to the 
transmembrane segments of the FSHR. The primers, 20- 
mer each, were selected using the OL1GO Program (Oligo 
Rl primer analysis software; National Biosciences, Plym- 
outh. MN). The forward primer corresponded to position 
1399-1419 (5'CGGCTTTTTCACTGTCTTTG3') on the 
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bovine FSHR mRNA [7]. The reverse primer was a 20-mer 
oligonucleotide complementary to position 1763-1783 
(5'CGCTTGGCTATCTTGGTGTC3'). The predicted size 
of the RT-PCR product was 384 basepairs (bp). Four mi- 
croliters of the RT reaction were used as a template for 
PCR; the reaction was allowed to proceed 35 cycles using 
2 U Termus Aquaticus DNA polymerase (recombinant) 
provided by MBI Fermentas, Vilnius, Lithuania, and 200 
pmol of each primer. The cycling parameters of the PCR 
were 98°C for 20 sec, 56°C for 45 sec, and 72°C for 45 
sec. After amplification the samples were separated on a 
1% agarose gel, stained with ethidium bromide, and pho- 
tographed under UV light. The 384-bp fragment was ex- 
tracted using the Wizard PCR Preps Kit (Promega, Madi- 
son, WI) and sequenced with the upper primer using an 
automatic sequencer (Applied Biosystems, Foster City, 
CA). For internal control, a bovine p-actin fragment of 890 
bp was produced from an upper primer (5'ACCA- 
ACTGGGACGACATGG AG3 ' ; 21 mer) and a lower prim- 
er (5'GCATTTGCGGTGGACAATGGA3'; 21 mer) as pre- 
viously described [5]. Each PCR amplification was stan- 
dardized using ovarian granulosa cDNA to produce a spe- 
cific band of the predicted size. Standardization was based 
on temperature, cycle number, Mg level, and pH. 

Western Blots 

Cervical, bovine seminal vesicle, or muscle minces were 
sonicated for 45 sec, lysed in 500 pi lysing buffer (1% 
Nonidet-40 [Sigma, St. Louis, MO], 2 mM EGTA, 5 nM 
MgCl2, 1 mM PMSF in PBS) and incubated for 2 h at 4°C. 
After centrifugation, aliquots were taken for protein deter- 
mination using gamma globulin (Sigma) as reference stan- 
dard [9] and a protein dye binding method (Bio-Rad Lab- 
oratories, Richmond, CA). Sixty micrograms of protein of 
the soluble tissue extract was then separated by electropho- 
resis on 5-9%-gradient SDS-PAGE and electroblotted onto 
nitrocellulose paper (Amersham, Little Chalfont, Bucks, 
UK) as described previously [9]. The nitrocellulose mem- 
brane was washed with PBS containing 0.05% TWEEN-20 
and blocked with 10% horse serum (Kimron Veterinary In- 
stitute) in washing solution. The nitrocellulose membrane 
was then treated with either 1) an anti-peptide antibody 
(code name 179) raised against human (h) FSHR peptide 
265-295, diluted 1:500 (donation of Dr. J.A. Dias, Wads- 
worth Center, New York State Department of Health); 2) 
rabbit antiserum for bovine G protein (a ? ) diluted 1:1000; 
UBI, Lake Placid, NY); or 3) rabbit anti-bovine cycloox- 
ygenase polyclonal antiserum (diluted 1:200; Kimron Vet- 
erinary Institute) [9]. Different dilutions were used for each 
antibody, and the final dilution used was 75% of the dilu- 
tion, which gave a maximal signal as previously described 
[9]. The nitrocellulose paper was then incubated with horse- 
radish peroxidase-conjugated goat anti-rabbit lgG (Sigma 
Israel, Rehovot, Israel; diluted 1:2000 in washing buffer) 
for 1 h at room temperature. The presence of FSHR, G 
protein (ot s ), or cyclooxygenase was then visualized by 
means of a color reaction as follows. The nitrocellulose 
paper was incubated in a substrate solution containing 3'3'- 
diaminobenzidine (0.5 mg/ml; Sigma) in a mixture of PBS 
containing 0.5% CaCl 2 and 6% H 2 0 2 . The antibody to 
hFSHR recognized the 75-kDa protein of the FSHR. The 
antibody to G (ct s ) recognized both the 42-kDa and 87-kDa 
forms of this protein, and the antibody to cyclooxygenase 
recognized the 72-kDa form (cyclooxygenase II). Extract 
of bovine seminal vesicles that are known to have a high 



content of cyclooxygenase was used as positive control as 
previously described [9]. The densitometric scans were ob- 
tained using a bio-imaging system (B.I.S. 2020; Rhenium 
Dingo, Jerusalem, Israel) and processed with Tina 2.0 soft- 
ware (Fuji, Japan). Linearity of detection [9] was deter- 
mined for densitometry for both Western and Northern 
blots. Each Western blot was evaluated in the absence of 
the first antibody, and no signal was detected. 

Northern Blot Analysis 

For Northern blots, 10 u,g of total RNA was denatured 
in 20 pj 50% formamide/2.2 M formaldehyde in single- 
strength 3-(«-morpholino) propanesulfonic acid (MOPS) 
buffer (0.04 M MOPS, 10 mM sodium acetate, 1 mM 
EDTA) at 65°C for 10 min. Samples were placed on ice, 
and 5 uJ of loading buffer (0.5% SDS, 0.25% bromophenol 
blue, 25% glycerol, 25 mM EDTA) was added. Samples 
were subjected to electrophoresis through a 1.2% agarose 
formaldehyde gel and transferred by capillary blotting to 
nylon membranes (Nytran; Schleicher and Schuell, Keene, 
NH). RNA was subsequently UV cross-linked to mem- 
branes. The RT-PCR product fragments of 384 bp for 
FSHR and 890 bp for p-actin were used to generate bioti- 
nylated probes using random primer biotin labeling of DNA 
for chemiluminescence (NEBlot phototype kit; Biolabs, 
Beverly, MA) according to the manufacturer's instructions. 
The blotted membrane was prehybridized at 65°C for 1 h 
in prehybridization* solution (6-strength SSC [3 M NaCl, 
0.3 M sodium citrate], 5-strength Denhardt's reagent [1% 
ficoll, 1% polyvinylpyrrolidone, 1% BSA], 0.5% SDS, and 
100 (xg/ml denatured salmon sperm DNAs) and hybridized 
overnight in the same solution with the denatured biotiny- 
lated probe to the target RNA. The membrane was then 
washed in double-strength SSC, 0.1% SDS at room tem- 
perature for 10 min and washed again in 0.1 -strength SSC, 
0.1% SDS at 68°C for 15 min followed by subsequent 
chemiluminescence, development, and detection on x-ray 
film (RX; Fuji Film, Tokyo, Japan). To obtain quantitative 
data for the specific mRNA, we used the densimetric value 
for p-actin mRNA to normalize each specific mRNA value. 
To remove FSHR probe, membranes were stripped using 
water for 15 min, then incubated in 0.4 M NaOH, 0.1% 
SDS at 80°C for 30 min. The membrane was rinsed again 
in 0.2 M Tris-HCl, 0.1 -strength SSC for 30 min at 25°C. 
After stripping, prehybridization and hybridization with 
actin probe were performed as described for FSHR mRNA. 

Cyclic AMP Determinations 

Cervical tissue minces (50 mg/ml) were incubated for 
10 min in wells of 1.5 ml containing 1.0 ml of Tissue Cul- 
ture Medium- 199 (TCM-199 without serum (Biological In- 
dustries, Beit Haemek, Israel) in the absence or presence 
of physiological (5, 10 ng/ml) and pharmacological doses 
(20, 40 ng/ml) of FSH (USDA-bFSH-8-1; no further in- 
crease in cAMP production was seen at 50 or 100 ng/ml) 
or forskolin, a stimulator of adenylate cyclase (10 pjvl; Sig- 
ma). The dose of forskolin was selected to give a 3-fold 
increase in cAMP. At the end of the incubation, tissues were 
removed, blotted on filter paper to remove mucus, and in- 
cubated overnight at 4°C with 400 jjlI of 3% HC10 4 . The 
solution was then neutralized with 150 |xl of K.HCO3 and 
centrifuged, and 50-pJ aliquots were taken for the radiore- 
ceptor assay as described by Brown et al. [10] and modified 
for endometrium by Miyazaki et al. [II]. Standards (0- 
1000 pg of cAMP) or samples were incubated at room tern- 
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FIG. 1 . RT-PCR amplification of bovine cervix FSHR and p-actin mRNA. 
RT-PCR was performed as described in Materials and Methods for FSHR 
and p-actin. Separate RT-PCR amplification products of FSHR (estimated 
384 bp) and 0-actin (estimated 890 bp) were combined and separated on 
agarose gel electrophoresis and stained with ethidium bromide. RT-PCR 
was performed using 2 jxg RNA of bovine granulosa cells as a positive 
control (lane 1 ), cervical tissue at the pre-estrous/estrous (lanes 3-4), post- 
ovulatory (lanes 5-6), and luteal phases (lanes 7-8). Lanes 9-10 represent 
reamplification, using as template the PCR product, cDNA, from the lu- 
teal-phase amplification. Lane 2 represents negative control in the ab- 
sence of RNA. The tissue was taken from 4 cows at each stage of the 
estrous cycle, and two representative samples from each stage are shown. 
SM, size markers. 



peratiire with 100 \x\ of cAMP binding protein prepared 
from bovine adrenal extracts as described [10]. The buffer 
used for the assay was added to form a final reaction vol- 
ume of 400 |xl. After 2-h incubation in a cold room, the 
reaction was stopped by adding 300 u-1 of a charcoal-dex- 
tran solution; centrifugation followed, and the supernatant 
was removed for counting in a scintillation counter. The 
sensitivity of the assay was 60 pg/tube. Quadruplicate as- 
says were made for each value determined. The within- 
assay and between-assay coefficients of variance were 8% 
and 10%, respectively. 



1401 

BTFSH GCTTTTTCAC TGTCTTTGCC AGTGAGCTCT CAGTCTACAC TCTGACCGCC 
Cervix GCTTTTTCAC TGTCTTTACI AGTGAGCTCT CAGTCTACAC TCTGACCGCC 

BTFSH ATCACGCTGG AAAGATGGCA TACCATCACC CATGCCATGC AGCTCGAATG 
Cervix ATCACGCTGG AAAGATGCCA TCCCATCACC CGTGCCATGC AGCTCGAATG 

BTFSH CAAAGTGCAG CTCCGCCATG CTGCCAGCAT CATGCTGGTG GGCTGGATCT 
Cervix CAAAGTGCAG CTCCGCCATG CTGCCAGCAT CATGCTGGTG GGCTGGATCT 

BTFSH TTGCTTTTGC AGTTGCCCTT TTTCCCATCT TTGGCATCAG CAGCTACATG 
Cervix TTGCTTTTGC AGTTGCCCTT TTTCCCATCT TTGGCATCAG CAGCTACATG 

BTFSH AAGGTGAGCA TCTGCCTGCC CATGGACATT GACAGCCCCT TGTCACAACT 
Cervix AAGGTGAGCA TCTGCCTGCC CATGGACATT GACAGCCCCT TGTCACAACT 

BTFSH CTATGTCATG TCCCTCCTTG TGCTCAATGT CCTGGCCTTT GTGGTCATCT 
Cervix CTATGTCATG TCCCTCCTTG TGCTCAATGT CCTGGCCTTT GTGGTCATCT 

BTFSH GTGGCTGCTA CACTCACATC TACCTCACGG TGAGGAACCC CAACATCACA 
Cervix GTGGCTXCTT CACTCACATC TACCTCACGG TGAGGAACCC CAACATCACA 

. 1761 

BTFSH TCCTCTTCTA GTG 
Cervix TCCTCTTCTA GTG 

FIG. 2. Automated nucleotide sequencing and homology between Bos 
taurus mRNA FSHR (GenBank accession number L22319) from nucleo- 
tides 1401 to 1 763 and the PCR products obtained using the upper primer 
as described in Materials and Methods. A homology of 97.5% was found 
between Bos tnurus and our amplified cDNA obtained by RT-PCR from 
cervical mRNA at pre-estius/estrus, suggesting that they are complemen- 
tary. 



Measurement of Phospholipase C (PLC) Activity 

Cervical minces (30 mg) were incubated in 1 ml TCM- 
199 containing 5 mCi [ 3 H]myo-inositol for 90 min. Slices 
were rinsed with 1 ml TCM-199 and incubated for another 
60 min to remove unincorporated [ 3 H]myo-inositoI. At the 
end of the preincubations, slices were incubated in 1 ml 
TCM-199 containing 10 mM LiCl to inhibit inositol phos- 
phate (IP) hydrolysis, allowing 3 H-IP, 3 H-IP 2 , and 3 H-IP 3 to 
accumulate in the tissue after stimulation of PLC, in ab- 
sence or presence of 0-20 ng/ml of FSH or 0-20 ng/ml 
LH (USDA bLH-I-1). Incubations were terminated by add- 
ing 2 ml ice-cold chloroform/methanol/hydrochloric acid 
(5:10:0.1, v:v:v) and then 1 ml chloroform and 1 ml EDTA 
(5 mM) and extracted as described by Kisielewska et al. 
[12]. 3 H-IP, 3 H-IP 2 , and 3 H-IP 3 were separated from the 
labeled inositol compounds by using Dowex AG 1-X8 (for- 
mate form). 3 H-Labeled IP was eluted from the column by 
sequential elution into 10 fractions. Aliquots of each frac- 
tion were added to scintillation vials containing 5 ml scin- 
tillation fluid, and radioactivity was determined using a 
scintillation counter. Total activity of PLC is expressed as 
cpm of total IPs/30 mg/30 min. 

RIA for PGE 2 and PGF 2a 

Aliquots of 100 |xl were taken at the end of the incu- 
bation period for specific RIA of PGE 2 and PGF 2? , which 
were performed without chromatographic separation. The 
antisera for PGF-> a (Sigma) reacts preferentially with PGF 2 „ 
but cross-reacts with PGF la (60%) and to a negligible ex- 
tent (< 0.1%) with prostaglandins of the A, B, and E series. 
The antisera to PGE? (Sigma Israel) reacts preferentially 
with PGE 2 but cross-reacts with PGE, (20%), PGA,, PGA 2 , 
PGF] , and PG"F, Q (< 10%) and to a negligible extent with 
PGB^and PGB 2 (< 0.1%). The intraassay coefficients of 
variation were 9% and 11%, and the interassay coefficients 
of variance were 12.3% and 13% for PGF 2 „ and PGE 2 . 

Statistical Analysis 

ANOVA was performed with a significance level of/? 
< 0.05. Data were further analyzed using Tukey's proce- 
dure (p < 0.05 or p < 0.01) to assess significance between 
treatments. Student's /-test was used where appropriate. 
Values are expressed as means ± SEM. 

RESULTS 

FSHR mRNA Expression in the Cervix 

RT-PCR demonstrated that the FSHR gene is expressed 
in bovine cervix as indicated by the presence of the 384- 
bp fragment corresponding to bFSHR mRNA. The mRNA 
was maximally expressed in the cervix during the pre-es- 
trous/estrous phase. In contrast, the expression of the gene 
was reduced in postovulation cervix and not detectable in 
luteal-phase cervix (Fig. 1). However, reamplification of the 
PCR product, cDNA, from the luteal phase produced a de- 
tectable signal (Fig. 1 ). 

To demonstrate that the absence of FSHR mRNA in lu- 
teal-phase cervix was specific, bovine p-actin mRNA was 
iised as internal control. PCR demonstrated that the specific 
890 bp of the p-actin cDNA band was produced by the 
cervical tissues of all the stages. The band for p-actin was 
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FIG. 3. Northern blot for analysis of total RNA (10 p.g) extracted from 
bovine cervical tissues collected from pre-estrous/estrous, postovulatory, 
and luteal-phase cows, and bovine granulosa cells as a positive control. 
Relative migration of 28S and 18S ribosomal RNA is indicated. To localize 
mRNA for FSHR and (S-actin, our RT-PCR products (384-bp and 890-bp 
fragments, respectively) were used to generate biotinylated probes using 
biotin random primer. Figure shows results for FSHR mRNA in bovine 
granulosa cells as a positive control (lane A); and in cervical tissues at 
pre-estrus/estrus (lanes B-C), postovulation, (lanes D-E) and the luteal 
phase (lanes F-G). The corresponding (3-actin band was obtained by re- 
probing of the membranes previously tested for FSHR mRNA as described 
in Materials and Methods. Columns show mean ± SEM of the optical 
density of the ratio FSFlR:0-actin, in which the' luteal phase is given an 
arbitrary value of 1.0. "Significantly different from luteal-phase value (p 
< 0.01); n = 5 for each stage. 
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FIG. 4. Presence of FSHR protein in cervical tissues as determined by 
Western blot. Soluble cell extracts (60 p,g protein) were used to determine 
the amount of FSHR as described in Materials and Methods, Western blot 
proteins were transferred to nitrocellulose paper and probed with anti- 
serum to peptide hFSHR, 265-296. Western blot displays a representative 
result for bovine granulosa cells, used as positive control (lane A); bovine 
seminal vesicles (lane B) and muscle, used as a negative control (lane C); 
and cervical tissues from three stages of the estrous cycle: pre-estrous/ 
estrous (lane D), postovulatory (lane E), and luteal (lane F). Histogram 
summarizes the results for the three stages of the estrous cycle. Columns 
show mean ± SEM of the relative optical density in which the luteal phase 
is given an arbitrary value of 1 .0. Significantly different from luteal-phase 
value (p < 0.01); n = 7 for each stage. 



present even though the 384-bp band corresponding to the 
FSHR was absent (Fig. 1). 

The nucleotide sequence (Fig. 2) of the 384-bp fragment 
obtained from pre-estrous/estrous cervix was compared 
with that for the Bos taunis FSHR reported by Houde et 
al. [7]. It was found that there was a 97.5% homology be- 
tween the 384-bp fragment and the nucleotide range 1410- 
1763 found in the Bos taurus FSHR. 

The Northern blot contained the expected three tran- 
scripts of 2.55, 3.3. and 3.8 kilobases (kb) present in the 
bovine granulosa. However, the high (6.8-kb) and low (1.6- 
kb) transcripts found in the ovary were not found in the 
cervix. The three transcripts were found in pre-estrous/es- 
trous cervix, but only one transcript (2.55 kb) was seen in 
luteal cervix (1/8 of the pre-estrus/estrus level; n = 5) or 
postovulatory cervix (1/3 of the pre-estrus/estrus level; n = 
5) (Fig. 3). 
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FIG. 5. Effect of incubation of cervical tissues with FSH on the expres- 
sion of the FSHR protein. Cervical tissue from 5 pre-estrous/estrous cows 
was incubated in the absence or presence of FSH (0 or 1 0 ng/ml). Western 
blot is representative for FSH protein concentration after 3 h of incuba- 
tion, with granulosa cell extract serving as positive control. Lane A, Gran- 
ulosa cells (CC); lane B, cervical tissue at Time 0; lane C, cervical tissue 
after 3 h of incubation in TCM-199; lane D ( cervical tissue after 3 h of 
incubation with 10 ng/ml of FSH. . Lane E represents muscle tissue incu- 
bated with FSH (10 ng/ml) for 3 h as a negative control. Histogram sum- 
marizes the results of six experiments. Columns are means ± SEM. *Sig- 
nificantly different (p < 0.01) from the column representing the control 
at 3 h (a given value of 1). 



FSHR Protein in the Cervix 

The presence of FSHR protein in the cervix was dem- 
onstrated as determined by Western blot analysis using a 
specific antipeptide antibody for hFSHR. The antibody rec- 
ognized a 75-kDa protein in both bovine granulosa and 
cervical preparations identical with the predicted molecular 
size of FSHR. The signal for the 75-kDa protein was stron- 
gest in pre-estrous/estrous cervix when compared with 
postovulation- (3-fold) or luteal-phase (6-fold) cervix (Fig. 
4). With the use of 60 jxg protein in each lane, this protein 
was observed to be expressed throughout the estrous cycle. 

In Vitro Effect of FSH on Induction of FSHRs 

A time-course (0.75, 1.5, and 3 h) and dose-response (0, 
10, and 20 ng/ml) study was carried out to determine in 
vitro effect of FSH on its own receptor. Cervical minces 
(100 ing) from cows in the pre-estrous/estrous, postovula- 
tory, and luteal phases were used. It was found, as deter- 
mined by Western blot, that FSH induced its own receptor 
in cervical tissues at pre-cstrus/estrus, with a 3-fold increase 
(p < 0.01) observed at 10 ng/ml after 3 h of incubation 
(Fig. 5). However, no effect of FSH was seen at earlier 
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FIG. 6. Effect of incubation of cervical tissues with FSH on the synthesis 
of G protein. Cervical tissues from 5 pre-estrous/estrous cows were in- 
cubated in the absence or presence of FSH (10 ng/ml). Both the 87-kDa 
G protein (a, p, y complex) and 42-kDa protein (aj are elevated at 3 h 
of incubation. Granulosa cell extract was used as a positive control. Lane 
A, Granulosa cells (GC); lane B, cervical tissue at Time 0; lane C, cervical 
tissue after 3 h of incubation in TCM-199; lane D, cervical tissue after 3 
h of incubation with 10 ng/ml of FSH. Columns are means ± SEM of 42- 
kDa protein. *Significantly different (p < 0.05) from the column repre- 
senting the control at 3 h (with an arbitrary value of 1.0). 



times at the 10 ng/ml level. When 20 ng/ml was used, a 
stimulatory effect (2-fold) was seen after 1.5 h. No signif- 
icant response was observed when tissues of the postovu- 
latory or luteal phase were used at any dose or time tested 
(data not shown). 

Effect of FSH Treatment on G Protein (a J 

A time-course and dose-response study was carried out 
to determine whether cervical FSHR was coupled to G pro- 
tein. Cervical minces (100 mg) from cows in the pre-es- 
trous/estrous, postovulatory, and luteal phases were incu- 
bated in the presence of FSH (0-20 ng/ml) for 0.75, 1 ; 5, 
and 3 h. It was found in pre-estrous/estrous cervical tissue 
that, in the presence of FSH (10 ng/ml), there was a sig- 
nificantly (p < 0.05) higher level of G protein (a s ) at the 
level of both the 87-kDa^a, (3, y complex) and the 42-kDa 
(a s ) proteins after 3 h of incubation (Fig. 6). FSH, at either 
10 or 20 ng/ml, did not have a stabilizing effect on the 
level of G protein (a s ) at shorter times of incubation (0.75 
or 1.5 h). No significant response was observed when tis- 
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FIG. 7. Effect of FSH on IP production in cervical tissue. Cervical tissues 
from six cows at each of three stages of the cycle were incubated for 30 
min in the absence or presence of FSH (10 ng/ml) or LH (ng/ml). IPs 
produced were determined in four replicates by amount of labeled com- 
pound (cpm) produced. Columns are means ± SEM (*p < 0.05; **p < 
0.01 ; significantly different from control value for each stage of the cycle). 



sues of the postovulatory or luteal phase were used at any 
dose or time tested (data not shown). 

IP Pathway Activation by FSH 

For IP measurement, cervical tissue minces (30 mg) 
from pre-estrous/estrous, postovulatory, and luteal-phase 
cows were incubated for 30 min in the absence or presence 
of FSH (10 ng/ml) or LH (10 ng/ml). FSH significantly (p 
< 0.01) increased the level of IP, IP 2 , and IP 3 at pre-estrus/ 
estrus. However, FSH significantly (p < 0.05) inhibited IP 
level in the postovulatory stage. In contrast, LH, but not 
FSH, stimulated IPs significantly (p < 0.01) at the luteal 
phase (Fig. 7). 

Adenylate Cyclase Activation by FSH 

A dose-response analysis was used to determine whether 
the cervical FSHR was coupled to adenylate cyclase. In 
these experiments, cervical tissue minces (50 mg) obtained 
from pre-estrous/estrous, postovulatory, and luteal-phase 
cows were incubated with FSH (0-40 ng/ml). FSH in- 
creased cAMP accumulation in a dose-dependent manner 
when incubated with pre-estrous/estrous cervical tissues, 
with maximal stimulation 2.5 times (p < 0.01) that of con- 
trol in the presence of 10 ng/ml (Fig. 8 A). In contrast, cer- 
vical tissue from luteal or postovulation did not respondjto 
FSH even though tissues from all phases of the cycle re- 
sponded to forskolin with a significant (p < 0.01) increase 
in cAMP (Fig. 8B). 

Effect of FSH on PCE Production by Cervical Tissues 

A dose-response analysis was carried out to determine 
whether FSHR was associated with cervical prostaglandin 
production. PGE 2 and PGF 2u production by cervical minces 
(100 mg/ml) in the presence of FSH (0-20 ng/ml) was de- 
termined by R1A. FSH stimulated (p < 0.01) basal PGE 2 
production in a dose-dependent manner and caused a 3-fold 
increase in PGE 2 production at 10 'ng/ml in pre-estrous/ 
estrous cervical tissues (n = 8, Fig. 9). FSH had no effect 
on the small amount of basal PGF 2m (< 1 ng/100 mg) pro- 
duced by cervical tissue (data not shown). FSH did not 
elevate PGE 2 in cervical tissues from the luteal phase; fur- 
thermore, a small but significant (p < 0.05) inhibition was 
observed at postovulation (Fig. 9). 
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FIG. 8. Effect of FSH on adenylate cyclase production in cervical tissue. 
A) Cervical tissues from six cows at pre-estrus/estrus were incubated (4 
replicates) for 10 min in the absence or presence of FSH (0, 5, 10, 20, or 
40 ng/ml) or forskolin (10 (xmol/L), and the amount of cAMP produced 
was measured by a protein binding assay. B) Cervical tissues from six 
cows at each of the three stages of the estrous cycle were incubated (4 
replicates) for 10 min in the absence or presence of FSH (10 ng/ml) or 
forskolin (10 jjlM). The amount of cAMP produced was measured by a 
protein binding assay. Columns are means ± SEM. Columns with asterisks 
were statistically different from their own control (*p < 0.01 ; **p < 0.05). 
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FIG. 9. FSH dose response on production of PGE 2 by cervical tissues. 
Cervical tissues from 8 pre-estrous/estrous, postovulatory, or luteal-phase 
cows were incubated in the absence or presence of FSH (5, 10, or 20 ng/ 
mi) for 6 h, and the effect on the production of PGE, was measured by 
RIA. *p < 0.01; **p < 0.05. 
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FIG. 10. Effect of incubation of cervica! tissues with FSH on the ex- 
pression of cyclooxygenase. Cervical tissues from 5 pre-estrous/estrous 
cows were incubated in the absence or presence of FSH (10 ng/ml). The 
72-kDa cyclooxygenase II protein was elevated at 3 h of incubation. Lane 
A, bovine granulosa cells (GC); lane B, cervical tissue at Time 0; lane C, 
cervical tissue after 3 h of incubation in TCM-1 99; lane D, cervical tissue 
after 3 h of incubation with 10 ng/ml of FSH. Columns are means ± SEM 
of 72-kDa protein. *Significantly different (p < 0.01) from the column 
representing the control at 3 h (with an arbitrary value of 1.0). 



Cyclooxygenase Activation by FSH 

To determine whether the presence of FSHR was asso- 
ciated with the expression of cyclooxygenase, cervical 
minces were incubated for 3 h in the presence or absence 
of FSH (10 ng/ml). Tissues from pre-estrous/estrous, post- 
ovulatory. and luteal-phase cows were extracted, separated 
on SDS-PAGE, and tested for the cyclooxygenase II (72 
kDa) using a specific antibody. It was found that in cervical 
tissues from six pre-estrous/estrous cows, FSH induced a 
200% increase in the expression of cyclooxygenase after 3 
h of incubation (Fig. 10). In contrast, no effect on cyclo- 
oxygenase expression was seen in cervical tissues from lu- 
teal-phase cows, and an insignificant (p > 0.05) elevation 
at the postovulatory phase was observed (data not shown). 

DISCUSSION 

Results of this study demonstrate by both PGR amplifi- 
cation and Northern blot analysis that FSHR mRNA is pres- 
ent in the bovine cervix. The bands corresponding to the 
receptor (75 kDa) and its mRNA were maximally expressed 
during the pre-estrous/estrous phase. Incubation of cervical 
tissues with FSH increased the expression of cyclooxygen- 



ase and the production of PGE 2 (and possibly PGEj) at the 
pre-estrous/estrous phase. The stimulation of cervical cy- 
clooxygenase was associated with activation of both the 
PLC and adenylate cyclase second messenger G-protein re- 
lated pathways. 

Northern blot analysis of bovine cervical tissues from 
pre-estrous/estrous cows revealed multiple transcripts for 
bovine cervical FSHR. The sizes of the major transcripts 
were 2.5 kb, 3.3 kb, and 3.8 kb, which were similar to those 
obtained in bovine granulosa cells [7], rat testis and ovary 
[13, 14], and human ovary [15] and myometrium [16]. Oth- 
er minor transcripts have been reported in all of these spe- 
cies. The RT-PCR product yielded a single band of 384 bp, 
which was the expected molecular size, so the different 
iso forms of the FSHR are probably the result of differential 
splicing of the same transcript. Iso forms of FSHR have 
been characterized in human [17] and primate [18] ovary 
and in ovine testis [19]. Since only a single 2.5-kb isomer 
was present in luteal-phase cervix (which did weakly pro- 
duce FSHR), it would appear that there are different iso- 
forms present in the bovine cervix as well. 

The nucleotide sequence homology between our RT- 
PCR 384-bp product was 97.3% identical to the comparable 
region of the bovine FSHR, showing that our amplified 
cDNA was complementary to the Bos taurus mRNA FSHR. 
The PCR product of 384-bp cDNA was found primarily in 
the pre-estrous/estrous cervix. 

Using specific antibody raised in rabbits against hFSHR 
(amino acid sequence 265-296) for Western blot analysis 
resulted in a major signal for a 75 -kDa protein. A protein 
of similar molecular mass has been reported in rat and hu- 
man ovary [20]. A strong signal for this protein was seen 
in the pre-estrous/estrous cervix compared with cervix from 
the other stages of the cycle. This cervical receptor was 
regulated by FSH itself, similar to the regulation of the 
FSHR in the rat ovary [14]. 

FSH increased cAMP production by cervical tissue from 
the pre-estrous/estrous phase but did not elevate cAMP in 
the postovulatory and luteal phases. FSH elevation of IPs 
displayed the same pattern. It would therefore appear that 
the cervical FSHR is associated with signal transduction 
pathways in a way similar to the accepted mechanism of 
LH action in the ovary and testes, i.e., activation of the 
second messenger pathways — adenylate cyclase [21] and 
phosphatidyl -inositol (PLC) [22, 23]. The inhibitory effect 
of FSH on IPs and cAMP at postovulation occur when 
FSHR is low. Since FSH activity is biphasic, i.e., is stim- 
ulated by a low dose and inhibited by a high dose, an in- 
hibitory effect at low concentrations could be the result of 
the change in the ligand-receptor ratio. 

FSH induced about a 200% increase in the expression 
of cyclooxygenase at pre-estrus/estrus but had no effect on 
the two other stages of the cycle. This was similar to the 
response of cyclooxygenase to LH that was observed for 
the bovine endometrium [2] and uterine vein [6]. However, 
in the endometrium, LH increased cyclooxygenase during 
both the luteal and postovulatory phases but not at estrus 
[2]. Furthermore, the induction of cyclooxygenase by go- 
nadotropin in the endometrium was associated with an in- 
creased level of PGF 2(1 , while in the cervix PGE 2 was the 
major product. 

The bFSH was of immunological grade and devoid of 
residua] biological contamination of LH, thyroid-stimulat- 
ing hormone, growth hormone, prolactin, or ACTH. The 
effects on increasing cAMP. PGE 2 , and cyclooxygenase as 
well as induction of FSHR were therefore specific. Fur- 
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thermore, it was found that highly purified hFSH (10-20 
ng/ml) (donation of J.A. Dias) had the same effects as 
bFSH when incubated with cervical tissue under the same 
conditions (2.5-fold increase in cAMP; 3-fold increase in 
PGE 2 ; 2-fold increase in cyclooxygenase). 

The relaxation and opening of the gravid cervix is due 
to active biochemical and structural changes in the cervical 
connective tissue that are mediated in part by prostaglan- 
dins [24]. It is thought that stretching of the cervix is a 
factor in causing the release of these prostaglandins [25, 
26]. The effect of PGE in causing cervical softening in the 
ewe is also well documented [27-29]. In both nonpregnant 
and pregnant cows, PGE 2 causes an increase in cervical 
opening within 3 h of treatment [30]. PGE! administered 
intra-cervically in a jelly was shown to decrease cervical 
resistance within 24 h [31]. 

Recently, we reported [6] that oxytocin caused a signif- 
icant stimulation of PGE 2 production in vitro in cervical 
tissues from pre-esrrous/estrous cows but had no effect on 
PGE 2 production in cervical tissue from other stages of the 
estrous cycle of the cow. Similarly, in an initial report [31], 
it was shown that administration of oxytocin to pre-estrous 
cows increased the concentration of PGE 2 in the cervical 
exudate. However, peripheral oxytocin concentrations dur- 
ing estrus are lower than during the luteal phase [32, 33], 
and elevations of both peripheral oxytocin and cervical 
oxytocin receptor concentration are necessary for oxytocin 
to cause cervical softening towards parturition [24]. Fur- 
thermore, progesterone in vitro induced a dose-dependent 
inhibition of PGE 2 release by cervical tissues from pre- 
estrous/estrous cows, and this was associated with a de- 
crease in both basal and oxytocin-stimulated PGE 2 produc- 
tion [6]. It would therefore seem that hormones other than 
oxytocin are responsible tor the increase in cervical PGE 2 
in the pre-estrous/estrous cow. The present report indicates 
that FSH, which has its peak peripheral concentration at the 
time of estrus, could be the hormone that increases cervical 
PGE 2 , as the FSHR expression is maximal at this time and 
FSH in vitro increases PGE 2 production by the cervix. 

The bovine cervix at pre-estrus/estrus has high levels of 
FSHR protein and its corresponding mRNA. Activation of 
the receptor by FSH is associated with the G-protein-cou- 
pled receptor family that mediates the cAMP and IP sig- 
nalling pathways. These signalling pathways then increase 
the expression of cyclooxygenase and production of PGE 2 . 
The expression of the FSHR was maximal at the time of 
the FSH peak in the blood and suggests a physiological 
role for FSH in the relaxation and opening of the cervix at 
estrus. 
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The alpua(v)bcta<> iutegrin receptor for Foot-and-mouth disease virus is 
expressed coustitutively on the epithelial cells targeted iu cattle. 

M 2gg^£ . GojdS, ^souj, ZhangZ, Weio^, VjojetteSM, AleMnder^n 

Field strains of Foot-and-mouth disease virus (FMDV) use a number of alpha(v)- 
integnns as receptors to initiate infection on cultured cells, and integrins are bel eved to 
be the receptors used to target epithelial cells in animals. In this study 
immunofluorescence confocal microscopy and real-time RT-PCR were used to 

rha?vS Pre ?° n ° f • tW ° ° f * C integrfn reCCpf0rS ° f FMDV - alpha(v)be t a6 and 
tox in^Z^ Van0US 7 ltheha ^« ^ this virus in cattle. Thie studies show 
where it 1 ' - ex P ressed ; onstitu «vely on the surfaces of epithelial cd.s at sites 

^ not IrllL eS1 °^ ° K CCUr dUm,g a natUraI ,nfeCti0n ' but not at site * where lesions 
are not normally formed. Express.on of al P ha(v)beta6 protein at these sites showed a 
good co.Telat.on with the relative abundance of beta6 mRNA. In contrast alphaMbeta3 

l?^onlt y detected t at ,ow , ! r ls on the vasculature and not ° n the 

of any of the ^t.ssues mvest.gated. Together, these data suggest that in cattle 

^ofFMUV? T a, P ha S V ) beta3 '— - ^ -ajor receptor that determmes tl ie 
tropism of FMDV for the ep.theha normally targeted by this virus. 

PMID: 1618623 1 [PubMed - in process] 
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Urokinase-mediated posltranscriptioua! regulation of urokinase-rcceptor 
expression in uon small cell lung carcinoma. 

Montuori N , Mattiello A, Mancini A . Taglialarcla p . Caput! M . Rossi G. Ragno P . 

Istituto di Endocrinologia ed Oacologia Sperimentale, Consiglio Nazionale delle 
Ricerche, Naples, Italy. 

The urokinase-type plasminogen activator (uPA) and its cellular receptor (uPAR) are 
involved in the proteolytic cascade required for tumor cell dissemination and metastasis, 
and are highly expressed in many human tumors. We have recently reported that uPA 
independently of its enzymatic activity/is able to increase the expression of its own 
receptor m uPAR-transfected kidney cells at a positranscriptional level. In fact uPA 
upon binding uPAR, modulates the activity and/or the level of a mRNA-stabili'zing factor 
that binds the coding region of uPAR-mRNA. We now investigate the relevance of uPA- 
mediated positranscriptional regulation of uPAR expression in non small cell lung 
carcinoma (NSCLC), in which the up-regulation of uPAR expression is a prognostic 
marker. We show that uPA is able to increase uPAR expression, both at protein and 
mRNA levels, in primary cell cultures obtained from tumor and adjacent normal lung 
tissues of patients affected by NSCLC, thus suggesting that the enzyme can exert its 
effect in lung cells. We investigated the relationship among the levels of uPA uPAR and 
uPAR-mRNA binding protein(s) in NSCLC. Lung tissue analysis of 35 NSCLC patients 
shows an increase of both uPA and uPAR in tumor tissues, as compared to adjacent 
normal tissues, in 27 patients (77%); 19 of these 2.7 patients also show a parallel increase 
ot die level and/or binding activityof a cellular protein capable of binding the coding 
region of uPAR-mRNA. Therefore, in rumor tissues, a strong correlation is observed 
among these 3 parameters, uPA, uPAR and the level and/or the activity of a uPAR- 
mRNA binding protein. We then suggest that uPA regulates uPAR expression in NSCLC 
at a posttranscnptional level by increasing uPAR-stability through a cellular factor that 
binds the coding region of uP AR-mRNA. Copyright 2003 Wiley-Liss, Inc. 
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Vascular endothelial growth factor expressioD correlates with matrix 
metalloproteinases MT1-MMP, MMP-2 aud MMP-9 in human 
glioblastomas. 

Munaut C, Noel A, Hougrand O , Foidart JM. Boniver J. DcprczM' 

Laboratory of Tumour and Development Biology, University of Liege, Liege, Belgium. • 

Vascular endothelial growth factor (VEGF) is the major endothelial mitogen in central 
nervous system neoplasms and it is expressed in 64-95% of glioblastomas (GBMs) 
Tumour cells are the main source of VEGF in GBMs whereas VEGF receptors (VEGFR- 
1, its soluble form sVEGFR-l, VEGFR-2 and neuropilin-l) are expressed predominantly 
by endothelial cells. Infiltrating rumour cells and newly-formed capillaries progress 
through the extracellular matrix by local proteolysis involving matrix metalloproteinases 
(MMPs). Recent studies have shown that VEGF expression and bioavailability can be 
modulated by MMPs. We reported previously that the expression ofMTl-MMP in 
human breast cancer cells was associated with an enhanced VEGF expression We used 
quantitative RT-PCR, Western blot, gelatin zymography and immunohistochemistry to 
S .!. eXpreSSi ° n ° f VEGF ' VEGFR "^ VEGFR-2, sVEGFR-l, neuropilin-l r MTl- . 
MMP, MMP-2, MMP-9 and TIMP-2.in 20 human GBMs and 5 normal brains The 
express.on of these MMPs was markedly increased in most GBMs with excellent 
correlation between mRNA and protein levels; activated forms of MMP-2 and MMP-9 
werepresent in 8/18 and 7/18 of GBMs. A majority of GBMs (17/20) also expressed high 
aS P"™ 0 " 5 ^ reported, with strong correlation between VEGF and MT1 - 
MMP gene expression levels, and double immunostaining showed that VEGF and MT1- 
MMP peptides co-localize in tumour and endothelial cells. Our results suggest that the 
interplay between metalloproteinases and VEGF previously described in experimental 
rumours may also be operative in human GBMs. Because of its dual ability to activate 
MMP-2 and.to up-regulate VEGF, MT1-MMP might be of central importance in the 
growth of GBMs and represent an interesting target for anti-cancer treatments. Copyright 
2003 Wiley-Liss, Inc. 
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Vascular endothelial growth (actor (VEGF) is the maior 
endo hehal m ,toge„ i„ central nervous system neophX 

C U 'r celh arf ^ d **** °' * B <*'*t™* (GBMs^Tu- 

T olv "? roatrix ""^proteinases (MMpJl ^ 

celh was associated with an enhanced VEGF expression 
used quantitative RT-PCR. Western blot, gX zym J£ 
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tumour and endothelia. cells. OuTres^s t h 't*Z 
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and A S « S ™f '°' deve, <*™' of "0™al .issue 
ana solid tumours. This process includes ihe degradation of Ihe 
extracellular «,» (ECM) and .he proliferation! mi™"on and 

f„h^? 2 ^ end °"' e,ia ' Ce " 5 - and i$ fin"y"S°cd by 
inhibitory and promoting factors.. .» Among positive faCrs vas 
cular endo.hehal growth facor ( VEGF) ha! ^ proposed ^ 
rnapr endothelial mitogen j„ central nervous 
phsrns.3 Strong VEGF expression has been de.e ed J y 
h«.o=hem,s.ry ,„ W-95% of glioblastomas (GBMs) ™GBMs 
^ be most common malignant primi , ive , um l ours 0 / lhe ^ BMs 
adults.' Microvascular proliferation is charac.eris.ic of thefe W 
mours and ,s an essen.ial WHO diag„os.ic criteria.' Tumour ceMs 

e the mam source of VEGF in GBMs whereas VEGF re" p,o 
are P"dorr,ma„.ly expressed by endo.helial cells." These receo- 

VEGFR- I (fli-|) and VEGFR-2 fflk/KDR) belono I0 c lhe ,, 
tyrosine-kinase receptor fnmilv VEGFR I h,« , i - , 
(sVFfiFB n ,h« ■ v cUIK-l has a soluble isoforro 

NRPrf£ , r" VEGF •»«a»Mfi«y. , - u Neuropilin™ 
aimuty ol the VEGF 1M isoforro for VEGFR-2 •- NRPl « ,),„„„>„ 
to moduJa.e VEGF-mcdia.ed rumour aneiogenesi^n human 
bgnan, as.rocy.omas.-o The coordinalcd up^reguJa.i^n^T VEGF 



_ Infiltrating ; rumour cells and newly-formed capillaries process 
tough rbe ECM by local proteolysis involvingmMrtx Ko- 
SS^'" KmPS ^.eolydcLymesTa, ^ 
r™?^ """^ J?***™- Their ac.iva.ion requ ir« ,he 
removal of a propepude by proteinase cleavage and can be inhib- 
..ed b, mow ussue .nhibi.ors of MMPs CTMPs). Most mm 

■n Ae ceD membrane by a transmembrane domain or by a glyco- 
sylphospha.audyl-inosi.ol anchor and are classified as merobr^nt 
type MMPs (MT-MMPs). '?»»» Jn cultured hC^cT m- 
MMPs .end to accumulate on.. be cyioplasmic membranTof 
mvado^dia where ,hey selecjively mediate local ^eSuW 

MMP^.- ^Among ^ MMPs - MT«"MMP migh. play a cen- 
" m rem0d - KnS ° f ,he ECM as ,hij »™br^oo^ 
MTf MMP 2 ,, aCU r' e MMP " 2 3nd MMP-13. 2 . « Moreover! 

IT.U " 0 by " S ab,,i '' 10 cleave a major 

■?S ^r em memb ^- 2 " 4 l-ough .he' proc« S - 
infegrin^ J ° f reCep,OT f ° r ^"'O"^)" and of a^ 3 

MTl-MMP isinvolved in bom development arid rumour an- 

trrt,^ assoc,a,ed w„h enhanced in vitro invasion and 
increased m vivo tumour growth and vascularization » We have 
shown previously thai in MCF-7 breast cancer cells, VEGF tran- 
scripnon is upregula.ed when MTl-MMP is overexpressed." Up- 
regulation of VEGF by MTl-MMP has also been reported in'a 
model of human ghoma xenograft by Deryugina et o/» These 
«pe„men.al data suggest a link between MTl-MMP and (he 
VEGF network. We have tested for .he presence of such a link in 
human ^oblas.omas^We compared the expression of VEGF and 
its receptors (VEGFR- 1, sVEGFR-l, VEGFR-2, NRPl) with 
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RNA extraction and cDNA synthesis 

Total RNA was extracted from cryosections with RNeasy Mini 
Kit (QIAGEN GmbH, Hilden, Germany) according to the manu- 
facturer's protocol. Total RNA (1 p.g) was reverse transcribed with 
a ThermoScript reverse transcriptase (ThermoScript RT-PCR Sys- 
tem, Invitrogen, Carlsbad, CA) and random hexamers s£s primers. 

Primers 

■ Primers pairs used in our study are described in Table I. Primers 
for the VEGF gene were chosen to distinguish between VEGF I8 ?> 
VEGF, ^ VEGF 145 and VEGF, 2I mRNA isoforms. Jntron-span- 
ning primers and probes for the TaqMan system (primers for ■ 
VEGFR-1 (Fh-l)„sVEGFR-l, VEGFR-2 (KDR/Flk-1) and NRPl) 
were designed to meet specific criteria by using Primer Express 
software (Perkin Elmer, Foster City, CA> All primers were syn r 
thesized by Eurogentec (Liege, Belgium). The 5'- and 3'-end ■ 
nucleotides of the probe were labeled with a reporter (FAM = 
6-carboxy-fluorescein) and a quencher dye (TAMRA — 6-car- 
boxy-tetramethylrhodaroine). We conducted BLASTn (National 
Center for Biotechnology Information; Be'thesda) searches against 
dbEST and the non redundant set of GenBank, EMBL, and DDBJ 
database sequences to confirm the total gene specificity of the 
nucleotide sequences chosen for the primers. The specificity of the 
amplified PCR productswas confirmed either by restriction digest 
or by sequencing. The 18S ribosomal RNA was measured using 
the Pre-Developed TaqMan Assay Reagents Endogenous control 
kit from Applied Biosystems (Foster City, CA). 



TABLE 1 - SEQUENCE OF PRIMERS AND TaqMan PROBES USED FOR RT-PCR STUDIES 


Gene and accession 
number 


Position 


Sequence 


Size 


Cycles 


MMP-2FP 
MMP-2-RP 
NMJXM530 


1740F 
1964R 


5'-AGATCTTCTTCTTCAAGGACCGGTr-3' 
5%CGCTGGTCAGTGCCITGGGGTA-3' 


225 bp 


33 


MMP-9-FP 
MMP-9-RP 
JQ5070 


1592F 
1800R 


5'"GCGGAGATTGGGAACCAGCTGTA-3' 
5'-GACGCGCCTGTGTACACCCACA-3' 


208 bp 


37 


MMP-14-FP * 
MMP-J4-RP / 
NM_004995 


1288F 
I508R 


5;-GGATACCCAATGCCCATTGGCCA-3' 
5*-CCATTGGGCATCCAGAAGAGAGC-3' 


221 bp 


32 . 


T1MP1-FP 
TIMP1-RP 
M12670 


' : 78F 
245R 


5'-CATCCTGjTGTTGCTGTGGCTGAT-3' 
5'-GTCATCTTGATCTCATA ACGCTGG-3 ' 


168 bp 


33 


TJMP-2-FP 

TTMP-2-RP 

NM 003255 

VEGF-FP 

VEGF-RP 

AH0O1553 


78F 
245R 

1208F 
1687R 


5'-CTCGCTGGACGTTGGAGGAAAGAA-3' 
5 r -AGCCCATCTGCTACCTGTGGTTCA-3' 

5'-CCTGGTGGACATCTTCCAGGAGTA-3' 
5'-CrCACCGCCTCGGCrTGTCACA-3' 


155 bp 

479 bp 
407 bp 
347 bp 
275 bp 


30 
33 


28S rRNA-RP 
28S rRNA-RP 
U 13369 


12403F 
I26J4R 


5'- GTTC ACCC ACTA ATAGGG A ACGTG A - 3 ' 
5 ; GATTCTGACTTAGAGGCGTTCAGT-3' 


212 bp 


.19 ' 


VEGFR1-FP 
VEGFR1-RP 
VEGFRI Probe 
AF063657 


2438F 
251 6R 
2469 


5 '-TCCCTT ATG ATGCCA GC A A GT- 3 ' 

5'-CCAAAAGCCCCTCTTCCAA-3' 

S'-CCGGGAGAGACTTAAACTGGGCAAATCAO' 


79 bp 


40 


sVEGFRl-FP 
sVEGFRl-RP 
sVEGFR) Probe 
U0JI34 


2209F 
2388R 
2257 


y-ACAATCAGAGGTGAGCACTGCAA-3' 
5*-TCCGAGCCTGAAAGTTAGCAA-3' 

5'-TCCAAATTTAAAAGCACAAGGAATGATTGTACCAC-3' 


180 bp 


40 


VEGFR2-FP 
VEGFR2-RP 
VEGFR2 Probe 
AF063658 


79JF 
946R 
820 


5'-CTTCGAAGCATCAGCATAAGAAACT-3' 

5'-TGCTCATCAGCCCACTGGAT-3' 

5'-AACCGAGACCTAAAAACCCAGTCTGGGAGT-3' 


156 bp 


40 


NRP1-FP 
NRPI-RP 
NRPl Probe 
XM 034725 


)83)>F 
1942R 
J 883 


5'- CAC AGTGG A ACAGGTG ATG ACTTC- 3 ' 

5^AACCATATGTTGGAAACTCTGATTGT-3' 

5'-CCACAGAAAAGCCCACGGTCATAGACA-3' 


112 bp 


40 



MT1-MMP, MMP-2, MMP-9 and TTMP-2 in a series of 20 GBMs 
and 5 normal brains. Using quantitative RT-PCR,. gelatin zymog- 
raphy, Western blot and immunohistochemistry, we showed a 
strong correlation between the expression of VEGF, MT1-MMP, 
MMP-2 and MMP-9 in GBMs. These results are in accordance 
with previous in vitro studies and add to the evidence of an 
interplay between VEGF and MMPs in the progression of human 
GBMs. 

MATERIAL AND METHODS 

Patients 

We studied 20 GBMs diagnosed at the Laboratory of Neuropa- 
thology-CHU Liege between 1997 and 2001. The series included 
17 primary GBMs (te., no previous history, of lower grade diffuse 
astrocytoma) and 3 secondary GBMs (i.e., previous history of 
lower grade diffuse astrocytoma). Clinical information on these 20 
cases have been reported previously as part of a larger series. 31 The 
gender ratio was 1/1, and the age at time of diagnosis ranged from 
41-79 years (mean 56 years). Normal brain cortex and white 
matter were obtained from 5 patients with intractable epilepsy 
treated by partial temporal lobectomy. Histological examination of 
these specimens showed severe hippocampal sclerosis; frozen tis- 
sue was sampled from microscopically normal inferior temporal 
gyri. Our study was approved by the Ethical Committee of the 
Faculty of Medicine of the University of Liege. 
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VEGF VEGFR1 sVEGFRI VEGFR2 NRP 

JtfX J" ^GF and VEGF receptors mRNA quantification: scat- 

spote) mRNA levels are expressed as normalized values (as described 
■n Material and Methods: A.U = arbitrary units). Each riinT^ore 
sents the mean of 3 separate experiments. ^ P 

TJMP-2 mRNA and, VEGF mRNA isoforms 
MTl-MMP, MMP-2, MMP-9, TIMP-2 and VEGF mRNA iso- 
V EGF 165l VEGF )45 and VEGF..,) were mea- 

^ '? ? al j ( i UOtS ° f CDNA UsIn 8 Tac » Polymerase (Takara 
Shiga, Japan) and 5 pmo] of each primers (Tabte 1). TT* thermal 
cycling condnmns included 2 min a, 95X for denaturationTnd 
b ;"S ,5c f°^ scc * 94*C, 20 sec a. 66"C and 20 sec at 

72«P C f ° r / EGF iS ° f0ims) Wi,h 3 finaJ inc ">»*» 2 min at 
t ,\>- ClS Were reso ^ed O" 2% Nusieve 3:1 agarose 
gels BroWhittaker, Rockland, MD) and analyzed using a IWS 
^Imager (Bio-Rad Hercules, CA) af J etbidium bVom^e 
stammo Specific mRNA levels were expressed as the ratio of 

^SfSS^ E >« - W af 

J?ifcST^ * Vfc ™' ; ' V£G ™' 2 

,vprrU in ? e v2^ latiVe RTPCR ana,V5es ^ VEGFR-J 
sVECFR-1, VEGFR-2, NRPI mRNAs and 18S rRNA were car' 
ne ou, using tbe ABJ PRISM 7700 Sequence Detectn Sys em 

of ncR a D nd ? ° f,Ware (P !> ,ied Biosystems). 7he sequence" 
ol the PCR primer patrs and fluorogenic probes that were used for 
each gene are shown m Table I. A standard curve was generated by 

For^h ^ ™ m dUp,icate dmin 6 cve ^ "Penmen.. 

For each exper.menial sample, the amount of target £ne was 
denned from this standard curve. The relative ex^es^n leve 
of the target gene was normalized against 18S rRWA io compen- 



S ^w Var J atl01 ! ,n ^ quaVity ofmA ™* the ^ount of input 
cDNA (as desenbed by the manufacturer PE Applied Biosystems 

10 V TJ?lT 2) ' rcR was dmfcd "* ^ «* TaoMan 
"SS T*? 4 ? tCr Mh (Appiied biosystems) US mg5 pJ of diluted 
^ x, (Cqi ? ValCn - n0 JOng ,otal 200ru\f of the probe, and 

400 nM primers in a 25 pJ finaj reaction mixture. After a 2 min 
incubation at 50*C to allow for UNG cleavage, AmpliTaa Gold 
was activated by an incubation for 1 0 nun ai 95*C Each of the 40 
PGR ^ cycles consisted of 15 sec of denaturation at 95°C and 
hybridization of probe and primers for 1 min at 60°C 

To confirm amplification specificity, the PCR products were 
also examined- by subsequent 2% agarose gel elmrophoresis. 
Experiments were repeated at least 3 times in duplicate. 

Jmmunohistochemisiry for VEGF and MTJ-MMP 

k CC ^ S (4 ^ ^ Ck) were cut Tlom formalin-fixed, paraffin 
embedded ttmiour tissue. TTley were hydrated through graded 
alcohols and jncubated m H 2 0 2 (0.3% 15 min). SeXifwere 

re^eval (Dako, Glostrup, Denmark). For double immunoslainL 
aH? ; D ^^ in Plimaiy mo ^lonal Ab antiW 
^5 K h,0 °. (Oncogene Research Products, San DfeU 
CA) followed by peroxidase-conjugated EnVision (Dako); Imn? u - 
SSS! V,SUaI ? Cd 3 >3' d ^nobenridine (DAB+, 
VEGF iTsnT, TU*"" i nCUbmed Clonal Ab anti- 
VEGF 1.150 (Santa Cruz, Santa Cruz, CA) for 1 hr at room 
SEES followed by alkaline phosphatase-conjugated^ 
Vis on (Dako). Immunoreac.ivaty for VEGF was visualized with 
Fast Red chromogemc substrate (Dako). Single immunostaining 
was also carried out on serial sections using each primary anbbody 
alone with me corresponding enzyme- chromogene combination 
Negative controls were obtained by omitting the primary antibod- 

Gelaiin zymography assay 

MMP-2 and MMP-9 activities were quantified by gelatin zy- ' 
mography on 2 normal brains and 18 GBMs. Ten cryosections (10 
p.M) were homogenized in buffer (0.1 M Tris-HO pH 8.1, 0.4% 
Jriton X-100) and centrifuged for 20 min at 5,000f. The pellets 
were discarded. 25 u.g of total protein from homogenate superna- 
tants were mixed with non reducing sample buffer (62.5 mM 
Tns-HCl P H 6.8; 2% SDS: 10% glycerol; 0.1% bromophenol 

^!^ncDf^P h0rCSed direCt,y on J0% SDS-polyacrylamide 
ge/s (5DS-PAGE) containing 0.1% gelatin (w/v)^ After electro- 
phoresis gels were washed for 1 hr at room temperature in a 2% 

tin ;7£ n . X ; 2? S ° ,Uh0n to removc SDS > ^nsfened to a buffer 
(50 mM Tm-HQ pH 7.6, containing 10 mM CaCl,) and incu- 
bated for 18 hr at 37X. Gels were stained for 30 muTwith 0.1% 
w/v) Coomassie brilliant blue G250 in 45% (v/v) methanol/10% 
- (v/v) acetic acid and destained in 10% (v/v) acetic acid/20% (v/v) 
methanol. Gels were analyzed with Quantity One software (ver- 
sion 4.2.2, Bio-Rad Laboraiories, Hercules, CA) after densitomet- 
nc scanning of the gels using a Fluor-S Moltimager (BioRad). 
Western blot 

^^rJ^ M » PTOXcin ,CVe,S were in 2 normal brains and 

buffer [0.25 M Tns ( P H6.8). 10% SDS (w/v), 4% sucrose (v/v) T 
a V-™™?^*^ (v/v) and 0. 1 25% bromophenol blue (w/v)J 
and boiled for 5 mm. They were separated on 10% SDS-PACE 
gels and transferred to a PVDF hjier (NEN f Boston. MA). After 
blocking wjth 5% milk (w/v), 0.1% iween 20 (w/v) m PBS for 2 
hr at room tempejature. membranes were exposed to the primary 
ant.body (JO jxg/ml, clone I13-5B7, Ab-4, Oncogene Reseaich 
Products, San Diego, CA) at 4X overnight followed by incubation 
with a horseradish peroxidase-conjugaied rabbit anti-mouse anti- ' 
body (J.3 pg/ml, Dako, Glostrup, Denmark). Signals were de- 
jected vvMh an enhanced chemoiumJnescence (ECL) kit (NEN. 
Boston, MA). The relative intensities of (he immunoreaclive bands 
were analyzed with Quantity One software (version 4 *> 7 Bio-Rad 
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Figure 2- MMPs and T1MP-2 mRNA quantifica- 
tion, (a) Representative 2% agarose gels of RT-PCR 
products for MTI-MMP, MMP-2, HMP-2 and 
MMP-9 in 2 norma] brains (NB) and 10 GBMs. (5) 
Scatter plots (as described in Fig. 1). Experiment was 
repeated at least 3 times in duplicate. 
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Laboratories) after densitometry scanning of the X-ray films using 
a Fluor-S Multimager (Bio-Rad). 

Statistics 

VEGF, VEGFRs, MMPs and TIMP-2 expression values in 
GBMs were correlated using SpearroajTs test. Correlation was 
considered significant for 2-tailed p-value < 0.05. Statistical anal- 
ysis was carried out using the Prism 3.0 software (GraphPad, San 
Diego, CA). 

RESULTS 

Expression of VEGF and VEGF receptors 

VEGF mRNA was present in normal brains (295-375, arbitrary 
units; mean = 322) and in all GBM samples (217-5,1 12; mean = 
1/774) as reported previously (Fig. I). 3 ' In most GBMs, VEGF 
mRNA levels were raised 2- 1 5- fold above normal brain values. 
The most abundant isoform in all cases was VEGF )65 , followed by 
VEGF 121 , VEGF 189 and VECF M5 (data not shown). VECFR-J 
expression was found at similar levels in GBMs (89-357; mean.= 
182) and normal controls (154-198; mean = 181). There was no 
correlation between VEGFR-J and VEGF mRNA levels ip = 
0.35) in GBMs. VEGFR-2 was expressed in all GBMs (48-582: 



mean = 210) and in 8/20 cases at least twice normal values 
(87-111; mean = 103). VEGF and VEGFR-2 expressions were 
correlated significantly ip = 0.0035) in GBMs; NRP1 expression 
varied broadly between GBMs (75-3,260; mean = 1.061) con- 
trasting with a constant baseline expression in normal controls 
(383-397; mean = 390). In tumours, NRP1 correlated with 
VEGFR-2 ip = 0.0119) but not with VEGF ip = 0:084), nor 
VEGFR-1 [p = 0.066). s VEGFR-I was expressed at low levels 
both in normal brains (84-92; mean = 87) and GBMs (25-208; 
mean = 101). sVEGFR-1, however, was found to correlate with 
VEGFR-) [p = 0.0289). VEGFR-2 ip = 0.0029). and NRP1 ip - 
0.0027) but not with VEGF ip = 0.053). 

Expression of MMPs and TIMP-2 

MTI-MMP, MMP-2 and MMP-9 were expressed in both nor- 
mal brains and GBMs but at much higher levels in the latter (Fig. 
2n). MTI-MMP mRNA levels were constantly higher in GBMs 
(34-202; mean = 106) than in normal controls (3-29; mean - 
15). MMP-2 and MMP-9 mRNA levels were higher than controls 
in 1 8/20 and 14/20 cases respectively (Fig. 2b) and correlated with 
each other ip = 0.0187). MTl-MMP.mRNA levels correlated with 
MMP-2 ip = 0.0008) and MMP-9 ip = 0.005). TIMP-2 had a 
non-discriminative distribution in relation to the controls. TIMP-2 
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TABLE JH -CORRELATION BETWEEN VEGF VFPF Dtrrr^nn v 



MTl-MMP ' MMP-2 



VEGF 

VEGFR-1 

VEGFR-2 

NRPJ 

sVECFR-1 



0.0250 
0.0073 
<0.000l 
0.0053 
0.0313 



MMP-9.. 



TIMP-2 



0.0245 
0.07)0 
0.0168 
0J334 
0.0469 



proMMP-9 


Nl 


0.4 J 


N2. 


0.28 


GB1 


9.00 


GB2 


4.28 


GB3 


1.67 


GB4 


1.40 


GB5 


0.47 


GB7 


5.29 


GB8 
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Figure 5 - Immunohistocbemis- 
try (scale bar = 50 jim). (ajb) i VEGF 
positive tumour and endothelial cells 
(plain arrows) show granular red 
staining of the cytoplasm. Negative 
cells (empty arrow) are seen in their 
close vicinity, (c) MTI-MMP posi- 
tive cells show a strong brown cyto- 
plasmic staining. They include tu- 
mours cells (plain arrow) and • 
endothelial cells, (empty arrow), (d) 
MTI-MMP positive tumour cells 
(plain arrow) are mixed with nega- 
tive cells (empty arrow), (ej) Double 
staining with VEGF (red) and MTI- 
MMPI (brown). Double positive ru- 
mour cells (plain arrows) contrast 
with negative or single weakly pos- 
itive cell (empty arrow). 



pericellular proteolysis mediated by MTI-MMP in GBMs could 
also induce an aulocrihe loop resulting in enhanced VlEGF expres- 
sion. In turn, VEGF could act as a paracrine factor on endothelial 
cells to stimulate angiogenesis. or possibly as an autocrine factor 
promoting glioblastoma cells survival- migration and invasion as 
demonstrated recently in ihe various tumour cell culture models. 

We compared the expression of VEGF and its receptors with 
MTI-MMP, MMP-2 and MMP-9 in 20 GBMS and 5 normal brains. 
The expression of these MMPs was markedly increased in most 
GBMs with excellent correlation between mRNA and protein levels. 
MTI-MMP expression has been shown previously to correlate with 
glioma aggressiveness and its transfecrion in different tumour cell 
lines triggers an angiogenic phenotype and promotes tumour 
growth. 20 ^ 8 - 30 - 36 - 38 A majority of GBMs ( 17/20) also expressed high 
levels of VEGF, as previously reported, with a strong correlation 
between VEGF and MTI-MMP gene expression levels. Double in> 
munostaining studies showed co-expression of VEGF and MTI- 
MMP by the same tumour cells. These data suggest that the transcrip- 




tional control of VEGF by MTI-MMP could be operative not only in 
vitro but also in vivo in human GBMs. 

MTI-MMP could also promote the growth of GBMs by its 
ability to activate MMP-2 in the presence of low concentration of 
TIMP-2. J0 Pro-MMP-2 activation occurs after the formation of a 
ternary complex that contains pro-MMP-2 linked to cell surface 
MTI-MMP via a TJMP-2 bridge. In accordance with this hypoth- 
esis, we found that MMP-2 activation occurred in 8/ J 8 of our 
GBMs 20 - 41 among which 7/7 tested for MTI-MMP showed high 
contents of this protease. 

Activated MMP-9 was also found in 7/18 of our GBMs. This is 
an interesting finding as active MMP-9 is able to mobilize VEGF 
from its ECM reservoir. 3 * Therefore, MMPs could promote 
VEGF-mediated angiogenesis in GBMs by both transcriptional 
(MTI-MMP) and post translalional (MMP-9) mechanisms. 

VEGF binding to VEGFR-2 triggers the proliferation and mi- 
gration of endothelial cells whereas its binding to VEGFR-1 has 
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°oS?i le . eff ? ls °" g |ioblas, °ma«Il lines. '"2 In oor study VEGF 
mRNA levels were correlated wilh VEGFR-2 but not VEGFR-J 
NRPI and sVEGFR-l. Collectively our data suggest *a, GBMs 

£ri I vp^* C "? COmp,CX pat,em of VECFreceptors, trans- 
ducing VEGFs.gnalmg toward cell proliferation and migration 

hert^^T' ° Ur StUdy 0,6 evidence «*• » i»«erplay 

™" ********* and VEGF in human GBMs as previ- 

to activate MMP-2 and to up-regulaie VEGF, MT1 -MMP mH« be of 
'-T 0 ^ - *e growth of human ghobJas.omas^rf ' ete- 
sent an uteresung targe! for anti-cancer treatments. 
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DNA hypcrmetkylation is a mechanism for loss of expression of the HLA 
class I genes in human esophageal squamous cell carcinomas. 

£«£& XangG, SongY, ZbaoX, SoC, Ua^; Wan^JUO, Yang^. 

laboratory for Cancer Research, College of Pharmacy, Rutgers-The State University of 
New Jersey, 164 Frehnghuysen Road, Piscataway, NJ 08854-8020, USA. 

The three human leukocyte antigen (HLA) class I antigens, HLA-A, HLA-B and HLA-C 
play important rol es in the elimination of transformed cells by cytotoxic T cells Frequent 
loss of expression of these antigens at the cell surface has been observed in many human 
cancers. Venous mechanisms for post-transcriptional regulation have been proposed and 
jested uut the molecular mechanisms for transcriptional regulation are not clear We show 
by immunolustochemistry that the HLA class I antigens are absent in 26 of 29 (89%) 
samples of human esophageal squamous cell carcinomas (ESCC). Eleven of the 26 ESCC 
samples lost mRNA expression for at least one of the HLA genes, as shown by RT-PCR 
DNA from the 29 pairs of ESCC and neighboring normal epithelium were examined for' 
CpO island hypermethylation, homozygous deletion, microsatellite instability (MSI) and 
loss of heterozygosity (LOH). DNA from normal epithelial tissues had no detectable 
methylation of the CpG islands of any of these gene loci. Thirteen of 29 ESCC samples 
(43 /,) exhibited methylation of one or more of the three HLA loci and six samples (21%) 
exhibited methylation of all three loci. The HLA-B gene locus was most frequently 
methylated (38%). HLA-B mRNA expression in an ESCC cell line, where HLA-B was 
hypermethylated and did not express mRNA, was activated after treatment with 5-aza-2'- 
deoxycytidme. Homozygous deletion of these three gene loci was not observed 

S«L ^°o W ratCS ° f L ° H and MSI Were observed for microsatellite markers 
. D6S306, D6S258, D6S273 and D6S1666, close to the HLA-A, -B and -C loci, although a 
lugh rat 10 of LOH was observed at a nearby locus (represented by the markers D6S1 05 1 
and D6S1560), where the tumor suppressor gene p21(Wafl) resides. A strong correlation 
between genetic alterations and mRNA inactivation was observed in the ESCC samples 
Our results indicate that HLA class I gene expression was frequently down-regulated in' 
bi>CC at both the protein and mRNA levels and that hypermethylation of the promoter 
regions of the HLA-A, -B and -C genes is a major mechanism of transcriptional 
inactivation. 
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HERVd«. ^ u, soft tissue tun™, rem.ins poortj, under- 
stood. Tlus the present srudy iovestig,..* the expression of MERJ/ 

»e» mRNA lev,* w . m «asu,e<l to ft o Ia > tissue ample, usto, 1 
rename reverse b^nsoiprJon.pohynrer.se chair, reaction assay- pro. 

^pression w assessed by imxmurohistocbemis^ using „ 
" ra2/n ™ P-**"* a»-*bd/. Six „ mJ delera. „„sl speci. 
jnens vere used ,„ establish basal icels of HER2/neu mRNA. 
HERJ/neu rranseripts «ere detected in all normal tissues and SSs 
Four of 1, sarcomas (Sl%) demonstrated HER2/ne» mRNA levels 
above the mean value, **ereas 3 runrors displaved HER2/»e» 

pro.em overexpression. Bod. membranous and cytoplasmic patterns 
of unmunostaminB were observed, and a s^ong corre)a £ ^ 

lurao S ^°™ 1 Sarcom * tSS) » » aggressive so fl touc 
tumor thai account for up to 10% of sarcomas, with a 
pea* mc.cler.ee >n adolescents and young adults. This 
umor occurs ,„ 2 major forms, biphasic and tnonopha- 
NBYfen ,, 1 , 1 S , C ^ , °g= nelicall y characterized by the t(X; 
A.,K {?. '> found in >95% of cases. 

Although traditionally considered lo be a high- K rade 

£„T . .T mflucnce Prognosis, including morpho- 
ogical and cytogenetic features, u-eataen. strategies 
the plo,<ly status, and the apoplotic index ' ' 

...eh f cvelo P ,n r en ' of new ihenpculic advancement 
such as the specific targeting of molecular attentions 
present ,„ human malignancies, has brought to light the 
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found brtween protein expression and mRNA !ev<J (/> = 0 on 
Increased llERS/neu mHNA levels ^ c ^ 

.Wrnskof de^lopng recurrences <#>= 0.02). Moreover, none of 
^ P : &ente ^ overextension developed n^t^i, 

Our d,u demonstnue th a . HER2/neu is expressed fa, SS, aod tha, 
^r^r e i :m f i C> * t<>PWie HER »/»™ expression correlate 
l J^Slw 1 ' rCSU,tS ^ 1116 P»»*e «»' leased 

levels of HBR2/neu in SSs is associated with a rn^e favorable clinical 
ourse. Further smdies are needed to assess the role of this oncogene 

^; HER2/nen, svnoviaj sarcoma, real-time HT-TCR im- 
miinohjstocbemistry. ' 

AtorrviatioK: flS)I t fluorvcrw in jifa, hytnidiznH^ RT-PCR 
reverse txaT^oon^rvmerase chain reaction, SS, synovial sarcoma.' 

need to identify not only prognostic factors, but also 
tumor features that are predictive of response to therapy. 

One of the most exiensivdy studied molecular 
TK^urD'o^^ is thG H£R - 2 /«cu P rotcM>ncogene- 
HiLR " 2/neu oncogene (also kno>vn as c-erbB-2) 
located on chromosome I7q2l, is a member of the 
tyrosine kmase receptor family and encodes for a 185- 
Modalton protein that shows 50% homology wiih the 
epidermal growth factor receptor ^ This gene ts am- 
phhed, and/or over ex pressed "in "20% lo 30% of breast 
carcinomas ■ and m variom other tumors, 6 and usually 
is assoaaled >vitl, tumor aggressiveness and poor pro,-' 
2 0 / ^ StUdiCS have 5u PP<>ned (he value of 

HhK-2/nen topredid the response to chemotherapy in 
breast cancer, and the use of recombinant humanized 
anubod.es to HER-2/neu protein (Trastuiumabj in ifae 
care of patients with advanced, metastatic breast tumors 
nas been approved. 

The role of HER-2/neu acbvahon in soft tissue 
tumors remains pooily understood, and scarce molec- 
uJa, data backing immunohistochemical studies have 
been reported. HER-2/ncu protein expression was mv 
inunohwrochcmicaJly stt.rjicd in 20-1 sarcomas, mrlud- 
mg b S&, and over expression ^ absent in all these 
malignant mesenchymal neoplasms. *° 

Recently, HJ-R-2/neu alterabons have been dr- 
scJibcd m osteosarcoma, ^itb a high incidence of pio- 
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TABLE 1. Clinfcoporthologic Features of 13 Patients With SS 



No. 


Age(y)/G 


Site 


Subtype 


Surgery 


Che mo/Radio 


Rec 


Mets 


HER2 


FU^nio) 


1 


68/F 


XI 


MF 


RR 


No 


Yes 


Yes 


L 


108 


2 


71/F 


KE 


. BF 


WA 


No 


No 


' No 


- L 


12 


3 


27/M 


KI 


. BF 


" ME 


Yes 


Yes 


Yes 


L 


24 


1 


39/M 


T 


MF 


WE. 


Yes 


No 


No 


H 


48 


5 


15/M 


N 


MFJ*D 


WE 


Yes 


No 


No 


H 


48 


6 


41/F 


F 


MFJ*D 


WA 


Yes 


No 


* No 


H 


48. 


. 7 


57/F 


Kl 


MF 


WA . 


No 


N/A 


N/A 


L 


N/A 


8 


■ 48/M . 


F 


BF,PD 


ME ' 


Yes 


Yes 


No 


L 


36 


9 


29/F 


T 


PD 


WE 


Yes 


Yes 


Yes 


L 


36 


10 


27/M 


KE 


BF 


WE 


No 


No 


No 


II 


24 


11 


62/F 


A . 


MF 


WE 


No 


Yes 


No 


L 


24 ■ 


12 


62/F 


F 


MF 


WA 


No 


No 


No 


L 


2 


13 


34/F 


L 


MF 


RR 


No 


No . 


No 


L 


2 



Abbreviations: Age, age at diagnosis; G,. gender; M, male; F, female; Site, anatomic location; KJ, knee, intra-articular; KE, knee, 
extra-articular; T, thigh; N, neck; F. foot; A, arm; U leg; Subtype, histological subtype; MF, monophasic fibrous; BF, biphasic; MF-PD, 
monophasic fibrous with poorly differentiated areas; BF-PD, biphasic with poorly differentiated areas; PD, poorly differentia ted; Surgery! 
primary surgical therapy; RR, radical en bloc resection; WA. wide through bone amputation; ME, marginal en bloc excision; WE^widc en bloc 
excision; Chemo/Radio, adjuvant postoperative chemotherapy and/or radiotherapy, Rec, recurrence; Mets, presence of metastasis; HER2, 
HER2 mRNA expression; L, low expression; H, high expression; FU, follow-up status; N/A, not avaible. 1 



tein expression, r 2 ngirtgfrorrH2% to 61 %. 111s Indeed, 
despite limited information on Her2/ncu in this type 
of malignancy, based mostly on immunohistochemicaJ 
findings, 2 clinical trials of Trastuzumab have been 
initiated for recurrent and metastatic osteosarcoma pa- 
tients (http://\v W w.cancer.goVc)inical_ trials: MSKCC- 
99097/NCJ-T98-0083 and COG-AOST01 21). 

Therefore, we evaluated the mRNA expression and 
the gene product of HER-2/neu in 13 SS patients using 
real-time reverse tr a nsoipuon-polym erase chain reac- 
tion (RT-PCR) and immuriohistochemisiry. To the best 
of our knowledge, this is the first report documenting 
increased levels of HER-2/neu mRNA and protein 
in SS. 



MATERIALS AND METHODS 

Potient Population 

Thirteen patients with primary SSs, obtained from the 
files of the Department of Pathology, Caetano Pini Orthope- 
dic Institute, we i e inc hided in this stud)'. Cases were those d 
based on the availability of frozen primary rumor. Patient age 
ranged horn 15 to 71 years (mean, 44.6 years). Anatomic sites 
included the Xnee (5 patients, 3 intra-articular ). thigh (2 
patients), foot (3 patients), ann (1 patient), neck (1 patient), 
aod J<?»»er )rg (7 par/em). The hhto)ogtcal subrj-pes nvre 3 
biphasic (BF), 6 monophasic fibrous (MF), 2 monophasic 
fibrous with poorly differentiated areas (MF-PD), 1 biphasic 
with pooiry differentiated areas (BFPD), and 1 poorly differ- 
entiated (PD). Clinical staging was 1JB for all the patients. 
Local surgical excision was performed in 9 patients; amputa- 
tion, in 4 patients. Recurrence was observed in 5 cases with 
subsequent amputation (3 patients) and Joca) excision (2 
patients). Metastasis occurred to lung (2 patients) and ingui- 
nal lymph nodes (1 patients). Five patients received adjuvant 
postoperative chemotherapy, and 1 patient (case 4) was 
treated with chemotherapy associated with radiotherapy. Fol- 
low-up ranged from 2 to J 08 month* (mean, 3-1.3 month*). 
Clinicopaihologic dan are summarized in Table 1. 



Pathologic Studies and Frozen Tissue 
Selection • 

In all cases, the primary tumor was available - for study. 
Surgically Tesected tumor tissues were partly snap-frozen in 
liquid nitrogen and stored at -80°Cfor RNA extraction, and 
partly fixed in buffered formalin and embedded in paraffin 
blocks. Hematoxylin and eosin-stained sections were re-«yaJ- 
uated and graded according to FNCLCC grading system. 16 
Frozen tissue blocks were handled as follows: 4-ft- thick frozen 
sections were cut and stained with hematoxylin and eosin to 
determine the percentage of tumor cells present in the spec- 
imen. We used tissue blocks with tumor cells comprising 
more than 80% of the specimen. About 10 20-u-thkk sections 
were collected into Eppendorf rubes. Another 4-ix-thick fro- 
zen section was cut after the serial sections and examined by 
light microscopy to guarantee the percentage of rumor cells 
collected. Trizol (life Technologies; Gibco BRL, Gaithers- 
burg, MD) was used for RNA extraction, according to the 
manufacmrer's protocol. RNA was quantified sp ectroph ©to- 
rn etri rally. 



cDNA Synthesis 

Total RNA (200 ng) was re verse- Iran scribed in a total 
volume of 50 fxL containing 1 X TaqMan buffer, 5.5 mmol 
MgClj, 1 mmol deoxynucleo tides, 2.5 fimol random hexam- 
ers, 20 U RNase inhibitor, and 62.5 U MuLv reverse transcrip- 
tase. The samples were incubated at 25°C for 10 minutes, 
48°C for 30 minutes and 95°C for 5 minutes. 



PCP Amplification 

Amplification reactions were performed with the Univer- 
sal TaqMan 2 X PCR master mix in a volume of 25 uL con- 
taining 300 nmol of each primer, 100 nmol of probe, and 5 
i*L of cDNA. Both f^acon and HER2/neu amplification were 
done in duplicate for each sample. 

The thermal cycling conditions included 2 minutes at 
SOX and 10 minutes at 95 °C, followed by 40 cycles of 95 = C for 
15 seconds and 60°C for 1 minute. All icngents used for 
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RT-PCR were purchased from Applied Biosystems (Foster 
City, CA). 

Primers ond Probes 

Primers and probes for £-actin and HER2/neu mRNA 
were chosen using the computer program Primer Express 
(Applied Biosystems). Sequences of the forward primer for 
HER2/neu mRNA (CenBank accession number X03363) 
were 5'-TCC TCT CTC CAC CTG CAT CAC3' and the 
reverse primer 5*-CCA AAC ACC ACC CCC AAG A-3'; the 
sequence of the TaqMan probe was 5'(FAM)-AGC ACA ATC 
CCA ACC ACC CCA CA-<TAMRA)-3\ Sequences of the for- 
ward primer for ^actin mRNA (CenBank accession number 
X00351) were 5-TCC TTC CTG CCC ATG GAG-3' and the 
reverse pnmer 5'ACG AGG ACC AAT GAT CTT GAT CTT- 
3'; the sequence of the TaqMan probe was 5' (FAM)-CCT GTG 
GCA TCC ACG AAA CTA CCT TG(TAMRA)3\ Probeswere 
purchased from Applied Biosystems. 

Reol-Time RT-PCR 

To measure HER2/neu expression in these tumors we 
used a real-ume quantitative RT-PCR based on TaqMan meth- 
odology, as previously described,'' with minor modifications. 
Briefly, this technique allows, by means of fluorescence emis- 
sion, to find the cycling point when PCR product is detectable 
(Ct value or threshold cycle). As previously reported, the Ct 
value correlates to the starting quantity of the target mRNA ia 
To normalize the amount of total RNA present in each reac- 
tion, we amplified the housekeeping gene ^-actin, which is 
assumed to be constant in both normal samples and rumor 
tissues. 

Our results are expressed as relative levels of HER2/neu 
mRNA, referred to a sample, called a 'calibrator,- chosen to 
represent ]X expression of this gene. The calibrator was a 
breast cancer cellular line (MCF-7) 19 that was analyzed on 
every assay plate with the unknown samples. All of the ana- 
lyzed tumors expressed n-fold HER2/neu mRNA relative to 
the calibrator. 

The amount of target, normalized to an endogenous 
reference (f*actin) and relative to the calibrator, was defined 
by the AACt method as described by Livak K (Sequence 
Detector User Bulletin 2; Applied Biosystems). Specifically 
the formula is applied as follows: 



. 25.0 



or 
E 



target amount = 2 -AAC ' 

where AA C, - ||Ct (HER2/neu sample) - Ct (/J-actin 
sample)) ~ fC. <HER2/neu calibrator) - Ct (^ac tin calibra- 
tor)]). 

Immunohistochemistry 

Formalin-fixed, paraffin-embedded tissue sections were 
cleparaffimzed, lehydrated, and exposed to the primary anti- 
body using the EnVision-f system (Dako, Carpinteria CA) 
Primary ann-HER-2/neu antibody (rabbit polyclonal ami- 

^ y o^ ra,0E nmnber AtM85; D3ko) TO a PP ,ied in 3 <'i»"Don 
of 1:2000 for 60 mmuies at ioojd temperature. Before expo- 
sure to the primary antibody, sections were miriowave-pie- 
trca.cd in EDTA, pH S.0, to retrieve antigenicity-, and incu- 
bated wuh endogenous peroxidasc-blocXmg solution for 10 
mmu.es at room temperature. Positive control, constituted bv 
? ^^V""™ 01 " 3 showing more than 80% positive staining 
lor H£R2/ n «i, as wet! as negative control, in which the 
primary ann body was omiued, were stained in parallel 



15J0 




IS 
DC 



norma) fesue n=6 synovial sarcomas synovirf Wc«rrBs 
(fow expression) (high expresstori) : 

FIGURE l. Distribution of HEP2/neu mRNA levels' in normof 
muscle tissues, ond in low- ond high-expression sarcomas. 
Dcrto are expressed as mean and slandord error of the mean 
for eoch group. 



All cases were examined for both cytoplasmic and mem- 
brane immunoreactivity. Cytoplasmic staining was evaluated 
on a semiquantitative scale, according to Kilpamc* et al with 
minor modifications, 20 and reported as 0 (no staining or 
stammg in <10% of cells). 1+ (wea* staining in >W% of 
cells). 2+ (moderate staining in >10% of celb). or 3+ 
(strong staining in >J0% of celb). The presence of a mem- 
branous pattern of staining was recorded separately and 
scored as absent . (no staining or weak staining in <10% of 
cells) or present (complete and/or incomplete staining in 
>10% of cells). Tumors with a cytoplasmic score of 3f were 
considered to hare high HER-2/neu protein expression. 

Statisticoi Analysis 

Statistical differences were calculated by Fisher's exact 
test. The Mest method was used to evaluate the differences 
between groups. Differences were considered statistically sig- 
nificant when Pwas <0.05. 



RESULTS 

HER2/neu mRNA Evoluotion 

All of the tissues analyzed contained delectable 
)eve)s of H£H2-neu mRNA. Sbt norma) tissue sampJes 
(skeletal muscle) were used to establish basal level of 
HER2/neu mRNA. All tbe normal samples expressed 
very low levels of HER2/neu mRNA. ranging from 0.9 
to 1.9 n (mean, 1.4 n). Among ihe 13 tumor samples, 
H ER2/ neu levels varied greatly, ranging from 2.1 to 
24 n. Setting a cutofT level at 7.9 n (a value that repre- 
sents the mean value of expression distribution of the 
SSs), 9 cases (69%) had low HER2/nen expression and 
4 cases (31%) had high HER2/ncu expression (Fig 1; 
Table 1). The difference between the 2 groups (low 
and high I JER2/neu lumois) was statist ir ally significant 
IP= 0.0004). 
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FIGURE 2. Immunohistochemicor locoltzotfon of HER?/neu in SS. (A) Positive control (breast cancer) showing typical strong 
membrane pattern ot posrtrvity. (B) Case ?, o biphasic SS. displaying very local membrane staining limited to a gland. Inset: tocal 
weak cytoplasmic posifivity tn the epithelial component ot the same case. <C) Case 10. the epithelial component ot this biphasic 
SS displaying strong membrane pattern ot staining. (D) Case A, an epithelioid area in o monophaslc fibrous SS. showing strong 
cytoplasmic positively. . y y 



HE R2/neu Prolein Expression 

Staining with HER2/neu antibody revealed a vari- 
able cytoplasmic and membrane staining pattern. 
Three tumors (23%) showed strong staining involving 
both the cell membrane and the cytoplasm (cases 4, 5, 
and 10); weak to moderate, exclusively cytoplasmic 
staining was observed in 7 cases (cases ], 3, 6, 7, 8, 9, 
and J1). No staining was detected in 2 tumors (cases 12 
and 13). In 1 case (case 2), a cluster of glandular 
structures representing <5% of the tumor showed 
weak cytoplasmic and very- focal membiane staining, 
the latter Iiinited to a single gland. The epidielial/ 
epithelioid components exhibited snongti cytoplasmic 
staining compared with die spindle-cell component of 
the "tumors. Membrane staining was predominantly in- 
complete and limited to the epiibelial/epirbelioid ar- 
eas. All 3 cases with high inwninohistoc hemical expics- 
sion of HER2/ncu (cases \ t b, and 10) were giade 111 
sarcomas, including 1 \1F. 1 J>F, and ] MF-PD SSs. 



Examples of HER2/neu /cyjdplaiinfc "arid merhbrane 
staining are depicted in Figure 2. 

Correlation of Molecular and 
Imrnunohistochemicol Results 

A strong, statistically significant association was 
present between protein expression, for both mem- 
brane ajid cytoplasmic staining, and HER2/neu rnRNA 
levels {P = 0.0]), although J case (case 6) displayed 
discordant results. Interestingly, this neoplasm showed 
high >JER2/neu mRNA levels, whereas only weak stain- 
ing, limited to the cytoplasm of a minority of tumor 
cells, was detected by inununohistochemicaJ analysis. 

HER2/neu Expression and Clinicopathologic 
Parameters 

Both HFR2/neu protein expression and mRNA 
levels were evaluated to establish the relationships ro 
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TABLE 2. Correlation Behyeen Clinkrorxithologic 
Features and HER2/neu Expression as Delected by 
IHC and RT-PCR 



HER2/neu 
IHC PGR 



Variable 



L H Pvalue L H PvaJuc 



Age (years) 

<40 

>40 

Sex 

Feroale 

Male 

Tumor size (cm) 

<5 

>5 

Histological grade 
II 

Histological type 
MF 
BF 
PD 



3 3 3 3 

7 0 . NS 6 J 

8 0 7 1 

2 3 0.03 2 3 

3 2 3 2 
7 1 NS ; 6 2 

3 0 30 

7 3 NS 6 4 



NS 



NS 



NS 



NS . 



5 ] 5 ) 

2 1 2 J 

3 J NS 2 2 NS 
Chero o/Jtacb"otherapy* 

Yes 4 2 3 3 

*° 5 1 NS 5 1 NS 

Recurrence! 

Yes . 5 0 5 0 

N° 2 3 NS 1. 4 0.02 

Metastasis f 

Yes 3 o 

No 4 3 NS 



3 0 
3 4 



NS 



Abbrevjabons: U low expression; H, high expression; NS, not 
s.gT»6cnm; MF, monophask fibrous; BF, biphasic; PD, poorly diller- 
endaled (including MF and BF with poorly differentiated areas). 

Information not available for case 7. 
1 Cases 7, )2, and 33 were excluded from the analysis. 

clinicopathologic features, including local recurrence 
and metastatic disease. Two cases (cases 12 and J 3) with 
follow-up Jess thari 12 months and 1 case (case 7) for 
which clinical information was not available were ex- 
cluded from the analysis of recurrences and metastatic 
behavior. 

No correlation was observed between HER2/neu 
mRNA expression and age, sex, tumor size, tumor 
grade, histotype, and metastasis. A correlation between 
sex of the patients and HER2/neu protein expression 
was found. In fact, none or the female patients showed 
high HER2/nen protein expression [P = 0.03) Pa- 
tients xvith high Her2/neu mRNA levels had a lower 
nsfc of recurrence- than those with low Her2/neu 
mRNA levels (P = 0.02). None of the cases with high 
HER2/neu mRNA levels developed metastatic foci, al- 
though the small number of observations precluded 
reaching statistical significance (P = 0.1). Results are 
detailed in Table 2. 

DISCUSSION 

The present work provides the first combined mo- 
lecular by real-time RT-PCR and immunohistochernical 
ev,dence that HER2/neu over expression occurs in SSs 



Our results indicate that this parameter may provide 
prognostic information and suggest that a specific ther- 
apy with humanized monoclonal antibodies against 
HER2/neu may be considered in a significant number 
ofSSs. 

The HER2/neu oncogene has been, extensively ' 
investigated as.a prognostic factor and more recently as. 
a predictor of response to therapy. It has been demon- 
strated in breast cancer, where HER2/neu overexpres- 
sion is usually associated with gene amplification, 2 ^and 
in other epithelial tumors, mcluding ovarian, gastric, 
lung, and urinary bladder carcinomas. 

HER-2/ neu amplification/ overexpression appears 
to be an early event in oncogenic transformation by 
interacting with other members of the HER family. 3 In 
breast cancer, it is involved in cell cycle and apoptotic 
pathways through the antiapoptotic effects mediated bv 
p53 and p21 deregulation/ 2 '^ . 

Whether HER2/neu overexpression plays an im- 
portant role in mesenchymal neoplasms . remains con- 
troversiaJ. An immunohistochernical study of sarcomas, 
using a monoclonal antibody, reported no evidence of 
immunoreactivity for HER^2/neu in 6 SSs as well as in 
other 197 mesenchymal tumors, with cytoplasmic reac- 
tivity observed only in 1 case of peripheral neuroepi- 
thelioma. A recent investigation reported gene ex- 
pression profiles of 41 soft tissue tumors with cDNA ' 
microarray analysis. Among these sarcomas, 6 
monophasic SSs were characterized by a unique expres- 
sion pattern of a cluster of 104 genes, induding the 
epidermal growth factor receptor, which shows 50% 
homology with the HER2/ neu gene. 24 These data also 
suggest that the erb-B receptor family plays a significant 
role in SS. It has been demonstrated that a variable 
number of osteosarcomas over express HER2/neu. JM5 
However, more recent studies 20 '^ 5 ' 26 were unable to 
detect any HER2/neu gene amplification and/or over- 
expression using fluorescence in situ hybridization 
(FISH), RT-PCR, and immunohistochemistry. 

Differences in the techniques used may play an 
important role and explain (at least in part) these 
discrepancies. HER2/neu alterations can be evaluated 
using different techniques including immunohisto- 
chemistry, FISH, Southern hybridizauon, Northern 
blot, and competitive, differential, or real-time PCR 27 
Immunohistochemistry is the most common method 
for detection of HER2/neu overexpression, but it is 
significantly affected by the sensitivity and specificity of 
the antibodies used, the type of tissue {frozen versus 
formaIir>fixed), and die various interpretative criteria 
and scoring systems used to evaluate cases. Indeed, 
most studies of HER2/neu expression in osteosarcoma 
used immunohistochemica] techniques, with different 
monoclonal or polyclonal antibodies. The discrepancy 
in results may stem from the use of different antibodies, 
as well as a lack of standardized evaluation. 

For these reasons, to evaluate H£R2/neu immu- 
noreactivity in our study, ive used a polyclonal antibody 
(Dako, Carpinteria, CA), arguably the most ditTuse and 
thoroughly tested antibody for HER2/neu assessment. 
Furthermore, we investigated HER2/ne\> mRNA ex- 
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pression with real-time RT-PCR, because it has been 
demonstrated that mRNA levels correlates tightly with 
protein expression. 28 At present, real-time RT-PCR 
probably represents the most powerful tool for quanti- 
tative analysis, because it allows better internal control 
and reduction of sample contamination, and provides 
more objective results. 18 

We analyzed HER2/neu gene expression at the 
mRNA and protein level in 13 cases of SS. HER2/neu 
expression was found in all of the cases investigated, 
and mRNA content in the tumors varied from 2.1 to 
24 n. . 

The variability of mRNA levels in SSs is reflected 
on heterogeneity of protein expression pattern as de- 
tected by immunohistochemistry- We found that 
HER2/neu immunbreactivity correlates strongly with 
mRNA levels. A convincing cytoplasmic immunoreac- 
tivity was documented with the polyclonal antibody in 
10 of 13 sarcomas. Distinct membranous staining was 
observed in 3 cases, although it was never comparable 
to the positive breast control. It was predominantly 
incomplete and identified in the epithelial/epithelioid . 
component of SS. Interestingly, all of the cases with 
strong cytoprasmic staining also exhibited a membrane- 
staining pattern. 

In breast cancers, a membranous pattern, of stain- 
ing is thought to be specific for HER2/neu protein 
expression and correlated with gene amplification, 
whereas cytoplasmic staining is usually considered non- 
specific. 29 However, cytoplasmic posirivity for HER2/ 
neu has been reported to be prognostically significant 
in other tumor rypes ; including bladder, colon, pan- 
creas, thyroid, and nasopharyngeal carcinomas, and 
even in breast cancer. 3 

Patients with high HER2/neu mRNA expression 
had a significantly lower risk of recurrence. Similarly, 
all of the cases wjih high HER2/neu expression did not 
metastasize, although this correlation did not reach 
statistical significance due to the small number of cases 
studied. These data suggest that HER2/neu plays a role 
in the biology of SS and thai HER2/neu overexprcssion 
may be linked to a less-aggressive clinical behavior. 
Indeed, unlike many cancers where HER2/neu overex- 
pression has been shown to correlate wiih poor prog- 
nosis, HER2/nen levels are linked to a more favorable 
clinical course in other malignant rumors, such as thy- 
roid carcinoma and osteosarcoma. 15 34 

The molecular mechanisms responsible for the 
action of HER-2/neu in SSs are unknown. A possible 
interaction between HER- 2/ neu and the other mem- 
bers of HER family could be important in tumoiigen : 
esis. Derangements of other oncogenes, tumor suppres- 
sors, and apoptosis regulators have been described in 
SSs. For instance, many SSs have he en shown to be 
diffusely positive for lxl-2 family proteins (be 1-2, bax, 
bcl-x, and bak). These members of the be 1-2 family rue 
involved in the regulation of apoptosis in SS. 36 This 
raises the hypo diesis that complex alterations in apop- 
tosis-c on trolling mechanisms nre present in these neo- 
plasms, with HER-2/neu interacting with lie 1-2 family 



members. Further studies, are needed to clarify the 
mechanisms of apoptosis in SS. 

Depending on the size and location, the therapy of 
choice for SS is radical local exdsion or amputation.. 
Whenever radical surgery cannot be_perfonned, radio- 
therapy in concert with local excision is suggested in an 
attempt to avoid the need for amputation. Only re- 
cently has a study evaluated the possible role of chemo- 
therapy in the treatment of SS. 

The role of molecular markers in predicting treat- 
ment responsiveness is currently the focus of extensive 
investigation. Breast cancer patients with high HER2/ 
neu expression .appeared to benefit from high-dose 
GAF (cyclophophamide, adriamycin, and 5-fluoroura- 
cil) therapy. 38 In our study, 3 of 4 patients with high 
HER2/neu expression received adjuvant chemother- 
apy with ifosfamide; these patients had a favorable clin- 
ical outcome. These data raise the possibility that 
HER2/heu may have value in predicting. which patients 
are likely to respond to specific adjuvant chemotherapy 
regimens. Whether the favorable significance of 
HER2/neu expression depends on predicting clinical 
recurrence, response to chemotherapy, or both re- 
mains to be fully elucidated in SS patients. » 

To the best of our knowledge, this is the first 
report that shows expression of HER2/neu in primary 
SS by reaJ-rime RT-PCR. Elevated levels of HER2/neu 
mRNA and protein are found in a significant group of 
SS patients, and these levels appear to correlate with 
features of good prognosis. Furthermore, our results 
suggest that this mechanism of disease in SS may be the 
target of specific inhibitory therapies based on human- 
ized monoclonal antibodies. Considering the small 
number of patients examined, further investigation is 
needed to confirm these preliminary findings. 
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Genome-wide Study of Gene Copy Numbers, 
Transcripts, and Protein Levels in Pairs of 
Non-invasive and Invasive Human Transitional 
Cell Carcinomas* 

Torben F. 0rntoftf§, Thomas ThykjaerU, Frederic M. Wa!dman||, Hans Wolf**, 
and Julio E. Celistt 



Gain and loss of chromosomal material is characteristic 
of bladder cancer, as well as malignant transformation in 
general. The consequences of these changes at both the 
transcription and translation levels is at present unknown 
partly because of technical limitations. Here we have at- 
tempted to address this question in pairs of non-invasive 
and invasive human bladder tumors using a combination 
of technology that included comparative genomic hybrid- 
ization, high density oligonucleotide array-based monitor- 
ing of transcript levels (5600 genes), and high resolution 
two-dimensional gel electrophoresis. The results showed 
that there is a gene dosage effect that in some cases 
superimposes oh other regulatory mechanisms. This ef-: 
feet depended (p < 0.015) on the magnitude of the com- 
parative genomic hybridization change. In general (18 of 
23 cases), chromosomal areas with more than 2-fold gain 
of DNA showed a corresponding increase in mRNA tran- 
scripts. Areas with loss of DNA, on the other hand, 
showed either reduced or unaltered transcript levels. Be- 
cause most proteins resolved by two-dimensional gels 
are unknown it was only possible to compare mRNA and 
protein alterations in relatively few cases of well focused 
abundant proteins. With few exceptions we found a good 
correlation (p < 0.005) between transcript alterations and 
protein levels. The implications, as well as limitations, 
of the approach are discussed. Molecular & Cellular 
Proteomics 1:37-45, 2002. 



Aneuploidy is a common feature of most human cancers 
(1), but little is known about the genome-wide effect of this 
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phenomenon at both the transcription and translation levels. 
High throughput array studies of the breast cancer cell line 
BT474 has suggested that there is a correlation between 
DNA copy numbers and gene expression in highly amplified 
areas (2), and studies of individual genes in solid tumors 
have revealed a good correlation between gene dose and 
mRNA or protein levels in the case of c-erb-B2, cyclin d1, 
ems 7, and N-myc (3-5). However, a high cyclin D1 protein 
expression has been observed without simultaneous am- 
plification (4), and a low level of c-myc copy number in- 
crease was observed without concomitant c-myc protein 
overexpression (6). 

In human bladder tumors, karyotyping, fluorescent in situ 
hybridization, and comparative genomic hybridization (CGH) 1 
have revealed chromosomal aberrations that seem to be 
characteristic of certain stages of disease progression. In the 
case of non-invasive pTa transitional cell carcinomas (TCCs), 
this includes loss of chromosome 9 or parts of it, as well as 
loss of Y in males. In minimally invasive pT1 TCCs, the fol- 
lowing alterations have been reported: 2q-. 11p-, 1q + , 
11qt3+, 17q+, and 20q+ (7-12). It has been suggested that 
these regions harbor tumor suppressor genes and onco- 
genes; however, the large chromosomal areas involved often 
contain many genes, making meaningful predictions of the 
functional consequences of losses and gains very difficult. 

In this investigation we have combined genome-wide tech- 
nology for detecting genomic gains and losses (CGH) with, 
gene expression profiling techniques (microarrays and pro- 
teomics) to determine the effect of gene copy number on 
transcript and protein levels in pairs of non-invasive and in- 
vasive human bladder TCCs. 

EXPERIMENTAL PROCEDURES 

Material— Bladder tumor biopsies were sampled after informed 
consent was obtained and alter removal of tissue for routine pathol- 
ogy examination. By light microscopy tumors 335 and 532 were 
staged by an experienced pathologist as pTa (superficial papillary), 

1 The abbreviations used are; CGH, comparative genomic hybrid : 
ization; TCC, transitional celt carcinoma; LOH, toss of heterozygosity; 
PA-FABP, psoriasis- associated fatty acid- binding protein; 2D, 
two-dimensional. 
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Fig. 1. DNA copy number and mRNA expression level. Shown from left to right are chromosome (Chr.), CGH profiles, gene location and 
expression level of specific genes, and overall expression level along the chromosome. A, expression of mRNA in invasive tumor 733 as 
compared with the non-invasive counterpart tumor 335. B, expression of mRNA in invasive tumor 827 compared with the non-invasive 
counterpart tumor 532. The average fluorescent signal ratio between tumor DNA and normal DNA is shown along the length of the chromosome 
{left). The bold curve in the ratio profile represents a mean of tour chromosomes and is surrounded by thin curves indicating one standard 
deviation. The central vertical line (broken) indicates a ratio value ol 1 (no change), and the vertical lines next to it {dotted) indicate a ratio of 
0.5 (/eft) and 2.0 (right). In chromosomes where the non-invasive tumor 335 used for comparison showed alterations in DNA content, the ratio 
prpfile of that chromosome is shown to the right ol the invasive tumor profile. The colored bars represents one gene each, identified by the 
running numbers above the bars (the name of the gene can be seen at www.MDL.OK/sdata.htmi). The bars indicate the purported location of 
the gene, and the colors indicate the expression level of the gene in the invasive tumor compared with the non-invasive counterpart; > 2- fold 
increase (black), >2-fold decrease (6/ue), no significant change (orange). The bar to the far right, entitled Expression shows the resulting change 
in expression along the chromosome; the colors indicate that at least half of the genes were up-regulated (black), at least half of the genes 
down-regulated (b/ue), or more than half of the genes are unchanged (orange). It a gene was absent in one of the samples and present in 
another, it was regarded as more than a 2- fold change. A 2- fold level was chosen as this corresponded to one standard deviation in a double 
determination of -1800 genes. Centromeres and heterochromatic regions were excluded from data analysis. 



grade I and II, respectively, tumors 733 and 827 were staged as pT1 
(invasive into submucosa), 733 was staged as solid, and 827 was 
staged as papillary, both grade III. 

mRNA Preparation— Tissue biopsies, obtained fresh from surgery, 
were embedded immediately in a sodium-guanidinium thiocyanate 
solution and stored at -80"C. Total RNA was isolated using the 

, RNAzol B RNA isolation method (WAK-Chemie Medical GMBH). 
poly(A)* RNA was isolated by an oligo(dT) selection step (Oligotex 

. mRNA kit; Oiagen). 

cfiNA Preparation— 1 ol mRNA was used as starting material. 
The .first and second strand cDNA synthesis was performed using the 
Superscript® choice system (Invitrogen) according to Ihe manufac- 
turer's instructions but using an oligo(dT) primer containing a T7 RNA 
polymerase binding site. Labeled cRNA was prepared using the ME- 
GAscrip® in vitro transcription Kit (Ambion). Biot in -labeled CTP and 



UTP (Enzo) was used, together with unlabeled NTPs in the reaction. 
Following the in vitro transcription reaction, the unincorporated nu- 
cleotides were removed using RNeasy -columns (Oiagen). 

Array Hybridization and Scanning — Ar ray hybridization and scan- 
ning was modified from a previous method (13). 10 /ig ol cRNA was 
fragmented at 94 **C for 35 min in buffer containing 40 mM Tris 
acetate, pH 8.1, 100 mM KOAc, 30 mM MgOAc. Prior to hybridization, 
the fragmented cRNA in a 6x SSPE-T hybridization buffer (1 m NaCI, 
10 mw Tris, pH 7.6, 0.005% Triton), was heated to 95 °C for 5 min, 
subsequently cooled to 40 °C, and loaded onto the Affymetrix' probe 
array cartridge. The probe array was then incubated for 16 h at 40 C C 
at constant rotation (60 rpm). The probe array was exposed to 10 
washes in 6x SSPE-T at 25 "C followed by 4 washes in 0.5 x SSPE-T 
at 50 °C. The btotinylated cRNA was stained with a streptavidin- 
phycoerylbrin conjugate, 10 ^g/m\ (Molecular Probes) in 6x SSPE-T 
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Fig. 1— continued 



for 30 min at 25 °C followed by 10 washes in 6x SSPE-T at 25 °C. The 
probe arrays were scanned at 560 nm using a conlocal laser scanning 
microscope (made lor Alfymetrix by Hewlett-Packard). The readings 
irom the quantitative scanning were analyzed by Alfymetrix gene 
expression analysis software. 

Microsatellite Analysis— Micros at ellite Analysis was performed as 
described previously (14). Microsatellites were selected by use of 
www.ncbi.nlm.nih.gov/genemap98, and primer sequences were ob- 
tained from the genome data base at www.gdb.org. DNA was extracted 
from tumor and blood and amplified by PCR in a volume of 20 for 35 
cycles. The ampficons were denatured and electrophoresed for 3 h in an 
ABI Prism 377. Data were coRected in the Gene Scan program for 
fragment analysis. Loss of heterozygosity was defined as less than 33% 
of one allele detected in tumor amplicons compared with blood. 

Proteomic Analysis— YcCs were minced into small pieces and 
homogenized in a small glass homogenizer in 0.5 ml of lysis solution. 
Samples were slored at -20°C until use. The procedure for 2D gel 
electrophoresis has been described in detail elsewhere (15. 16). Gets 
were stained with silver nitrate and/or Coomassie Brilliant Blue. Pro- 
teins were identified by a combination of procedures lhat included 
microsequencing, mass spectrometry, two-dimensional gel Western 
immunoblotting, and comparison with the masler two-dimensional gel 
image of human keratinocyte proteins; see biobase.dk/cgi-bin/celis. 

CGH— Hybridization ot differentially labeled tumor and normal DNA 
\o normal metaphase chromosomes was performed as described 
previously (10). Fluorescein- labeled lumor DNA (200 ng), Texas Red- 



labeled reference DNA (200 ng), and human Got-1 DNA (20 ^g) were 
denatured at 37 °C for 5 min and applied lo denatured normal met- 
aphase slides. Hybridizalion was at 37 °C for 2 days. After washing, 
(he slides were count erst a ined with 0.15 MO^ml 4,6-diamkJino-2-phe- 
nylindole in an anti-fade solution. A second hybridization was per- 
formed for all tumor samples using fluorescein- labeled reference DNA 
and Texas Red-labeled tumor DNA (inverse labeling) lo confirm the 
aberrations delected during the initial hybridization. Each CGH ex- 
periment also included a normal control hybridization using fluores- 
cein- and Texas Red- labeled normal DNA. Digital image analysis was 
used lo identify chromosomal regions with abnormal fluorescence 
ratios, indicating regions o( DNA gains and losses. The average 
green:red fluorescence intensity ratio profiles were calculated using 
tour images of each chromosome (eight chromosomes total) with 
normalization of the greenrred fluorescence intensity ratio for the 
entire metaphase and background correction. Chromosome identifi- 
cation was performed based on 4,6-diamidino-2-phenylindole band- 
ing patterns. Only images showing uniform high intensity fluores- 
cence with minimal background staining were analyzed. All 
centromeres, p arms of acrocentric chromosomes, and heterochro- 
malic regions were excluded from the analysis. 

RESULTS 

Comparative Genomic Hybridization— The CGH analysis 
identified a number of chromosomal gains and losses in the 
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Table I 

. Correlation between alterations detected by CGH and by expression monitoring 
Top. CGH used as independent variable (if CGH alteration - what expression ratio was found); bottom, altered expression used as 
independent variable {if expression alteration - what CGH deviation was found). • ■ ' 



CGH alterations 



Tumor 733 vs. 335 
Expression change.clusters 



Concordance 



CGH alterations 



Tumor 827 vs. 532 
Expression change clusters 



Concordance 



13 Gain 



10 Loss 



10 Up-regulation 

0 Down- regulation 

3 No change 

1 Up-regulation 

5 Down-regulation 

4 No change 



77% 



50% 



10 Gain 



12 Loss 



8 Up-regulation 
0 Down-regulation 

2 No change 

3 Up-regulation 
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7 No change 



80% 



17% 
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CGH alterations 
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16 Up-regulation 
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17 Up-regulation 


10 Gain 
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3 No change 






2 No change 


33% 


21 Down-regulalion 


1 Gain 


38% 


9 Down-regulation 
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3 Loss 






12 No change 






6 No change 


81% 


15 No change 


3 Gain 


60% 


21 No change 


1 Gain 


3 Loss 

9 No change 
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17 No change 





two invasive tumors (stage pT1 , TCCs 733 and 827), whereas 
. the two non- invasive papillomas (stage pTa, TCCs 335 and 
532) showed onfy 9p-, 9q22-q33-, and X-, and 7-f ; 9q-, 
and Y-, respectively. Both invasive tumors showed changes 
(1q22-24 + , 2q14.1-qter- t 3q12-q13.3-, 6q12-q22-, 
9q34 + , 11q12-q13+; 17 + , and 20q11.2-q12+) that are typ- 
ical for their disease stage, as well as additional alterations, 
some of which are shown in Fig. 1. Areas with gains and 
losses deviated from the normal copy number to some extent, 
and the average numerical deviation from normal was 0.4-fold 
in the case of TCC 733 and 0.3-fold for TCC 827. The largest 
changes, amounting to at least a doubling of chromosomal 
content, were observed at 1q23 in TCC 733 (Fig. 1A) and 
20q12inTCC827(Fig. 1B). 

mRNA Expression in Relation to DNA Copy Number —The 
mRNA levels from the two invasive tumors (TCCs 827 and 
733) were compared with the two non-invasive counterparts 
(TCCs 532 and 335). This was done in two separate experi- 
ments in which we compared TCCs 733 to 335 and 827 to 
532, respectively, using two different scaling settings for the 
arrays to rule out scaling as a confounding parameter. Ap- 
proximately 1,800 genes that yielded a signal on the arrays 
were searched in the Unigene and Genemap data bases for 
chromosomal location, and those with a known location 
(1096) were plotted as bars covering their purported locus. In 
that way it was possible to construct a graphic presentation of 
DNA copy number and relative mRNA levels along the indi- 
vidual chromosomes (Fig. 1). 

For each mRNA a ratio was calculated between the level in 
the invasive versus the non- invasive counterpart. Bars, which 
represent chromosomal location of a gene, were color-coded 
according to the expression ratio, and only differences larger 



than 2-fOld were regarded as informative (Fig. 1). The density 
of genes along the chromosomes varied, and areas contain- 
ing only one gene were excluded from the calculations. The 
resolution of the CGH method is very low, and some of the 
outlier data may be because of the fact that the boundaries of 
the chromosomal aberrations are not known at high resolution. 

Two sets of calculations were made from the data. For the 
first set we used CGH alterations as the independent variable 
and estimated the frequency of expression alterations in these 
chromosomal areas. In general, areas with- a strong gain of 
chromosomal material contained a cluster of genes having 
increased mRNA expression. For example, both chromo- 
somes 1q21-q25, 2p and 9q, showed a relative gain of more 
than 100% in DNA copy number that was accompanied by 
increased mRNA expression levels in the two tumor pairs (Fig. 
1). In most cases, chromosomal gains detected by CGH were 
accompanied by an increased level of transcripts in both 
TCCs 733 (77%) and 827 (80%) (Table I, top). Chromosomal 
losses, on the other hand, were not accompanied by de- 
creased expression in several cases, and were often regis- 
tered as having unaltered RNA levels (T able I, fop). The inabil- 
ity to detect RNA expression changes in these cases was not 
because of fewer genes mapping to the lost regions (data not 
shown). 

In the second set of calculations we selected expression 
alterations above 2- fold as the independent variable and es- 
timated the frequency of CGH alterations in these areas. As 
above, we found that increased transcript expression corre- 
lated with gain of chromosomal material (TCC 733, 69% and 
TCC 827, 59%), whereas reduced expression was often de- 
tected in areas with unaltered CGH ratios (Table I, bottom). 
Furthermore, as a control we looked at areas with no alter- 
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FiG.. 2. Correlation between maximum CGH aberration and the ability to detect expression change by oligonucleotide array 
monitoring. The aberration is shown as a numerical -fold change in ratio between invasive tumors 827 (A) and 733 (♦ } and their non-invasive 
counterparts 532 and 335^ The expression change was taken from the Expression line to the right in Fig. 1, which depicts the resulting 
expression change lor a given chromosomal region. At least half of the mRNAs from a given region have to be either up- or down-regulated 
to be scored as an expression change. All chromosomal arms in which the CGH ratio plus or minus one standard deviation was outside the 
ratio value of one were included. 



ation in expression. No alteration was detected by CGH in 
most of these areas (TCC 733, 60% and TCC 827, 81%; see 
Table I, bottom). Because the ability to observe reduced or 
increased mRNA expression clustering to a certain chromo- 
somal area clearly reflected the extent of copy number 
changes, we plotted the maximum CGH aberrations in the 
regions showing CGH changes against the ability to detect a 
change in mRNA expression as monitored by the oligonucleo- 
tide arrays (Fig. 2). For both tumors TCC 733 (p.< 0.015) and 
TCC 827 (p < 0.00003) a highly significant correlation was 
observed between the level of CGH ratio change (reflecting 
the QNA copy number) and alterations detected by the array 
based technology (Fig. 2). Similar data were obtained when 
areas with altered expression were used as independent vari- 
ables. These areas correlated best with CGH when the CGH 
ratio deviated 1.6- to 2.0-fold (Table I, bottom) but mostly did 
not at lower CGH deviations. These data probably reflect that 
loss of an allele may only lead to a 50% reduction in expres- 
sion level, which is at the cut-off point for detection of expres- 
sion alterations. Gain of chromosomal material can occur to a 
. much larger extent. 

Microsatellite- based Detection of Minor Areas of Loss- 
es—in TCC 733, several chromosomal areas exhibiting DNA 
amplification were preceded or followed by areas with a nor- 
mal CGH but reduced mRNA expression (see Fig. 1, TCC 733 
chromosome 1q32, 2p21, and 7q21*and q32, 9q34, and 
10q22). To determine whether these results were because of 
undetected loss of chromosomal material in these regions or 



because of other non-structural mechanisms regulating tran- 
scription, we examined two microsatellites positioned at chro- 
mosome 1q25-32 and two at chromosome 2p22. Loss of 
heterozygosity (LOH) was found at both 1q25 and at 2p22 
indicating that minor deleted areas were not detected with the 
resolution of CGH (Fig. 3). Additionally, chromosome 2p in 
TCC 733 showed a CGH pattern of gain/no change/gain of 
DNA that correlated with transcript increase/decrease/in- 
crease. Thus, for the areas showing increased expression 
there was a correlation with the DNA copy number alterations 
(Fig. 1/4). As indicated above, the mRNA decrease observed in 
the middle of the chromosomal gain was because of LOH, 
implying that one of the mechanisms for mRNA down-regu- 
lation may be regions that have undergone smaller losses of 
chromosomal material. However, this cannot be detected with 
the resolution of the CGH method. 

In both TCC 733 and TCC 827, the telomeric end of chro- 
mosome 11 p showed a normal ratio in the CGH analysis; 
however, clusters of five and three genes, respectively, lost 
their expression. Two microsatellites (D1 1 S1 760, D1 1 S922) 
positioned close to MUC2, IGF2, and cathepsin D indicated 
LOH as the most likely mechanism behind the loss of expres- 
sion (data not shown). 

A reduced expression of mRNA observed in TCC 733 at 
chromosomes 3q24, 11p11, 12p12.2, 12q21.1, and 16q24 
and in TCC 827 at chromosome 11p15.5, 12p11, 15q11.2, 
and 18q12 was also examined for chromosomal losses using 
microsatellites positioned as close as possible to the gene loci 
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Fig. 3. Microsatellite analysis of loss of heterozygosity. Tumor 
733 showing loss of heterozygosity at chromosome 1q25, detected 
(a) by D1S215 close to Hu class I histocompatibility antigen (gene 
number 38 in Fig. 1), (6) by D1S2735 close to cathepsin E (gene 
number 41 in Fig. 1), and (c) at chromosome 2p23 by D2S2251 close 
to general ^-spectrin (gene number 11 on Fig. 1} and of (d) lumor 827 
showing loss of heterozygosity at chromosome 18q12 by S18S1118 
close to mitochondrial 3-oxoacyf-coenzyme A thiolase (gene number 
12 in Fig*. 1). The upper curves show (he electropherogram obtained 
from normal DNA from leukocytes (N), and the tower curves show the 
eleclropherogram trom lumor ONA (7). In all cases one allele is 
partially losl in the tumor amplicon. 

showing reduced mRNA transcripts. Only the microsatellite 
positioned at 18q12 showed LOH (Fig. 3), suggesting that 
transcriptional down-regulation of genes in the other regions 
may be controlled by other mechanisms. 

Relation between Changes in mRNA and Protein Levels — 
2D-PAGE analysis, in combination with Coomassie Brilliant 
Blue and/or silver staining, was carried out on all tour tumors 
using fresh biopsy material. 40 well resolved abundant known 
proteins migrating in areas away from the edges of the pH 
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Fig. 4. . Correlation between protein levels as judged by 20- 
PAGE and transcript ratio. For comparison proteins were divided in 
three groups, unaltered in level or up- or down-regulated (honzon/a/ 
axis). The mRNA ratio as determined by oligonucleotide arrays was 
plotted for each gene (vertical axis). A, mRNAs that were scored as 
present in both tumors used for the ratio calculation; A, mRNAs that 
were scored as absent in the invasive tuniors (along horizontal axis) or 
as absent in non-invasive reference (top of Figure). Two different 
scalings were used to exclude scaling as a confounder, TCCs 827 
and 532 (A A) were scaled with background suppression, and TCCs 
733 and 335 (#0) were scaled without suppression. Both compari- 
sons showed highly significant (p < 0.005) differences in mRNA ratios 
between the groups. Proteins shown were as follows: Group A (from 
left), phosphoglucomutase 1, glutathione transferase class fx number 
4, fatty acid-binding protein homologue,. cylokeratin 15, and cylo- 
keratin 13; 0 (from /eft), fatty acid- binding protein homologue, 28-kDa 
heat shock protein, cylokeratin 1 3, and calcyclin; C (from left), o-eno- 
lase, hnRNP B1, 28-kDa heat shock protein, 14-3-3-€, and 
pre-mRNA splicing (actor; D, mesothelial keratin K7 (type II); E {trom 
top), glutathione S-transf erase- ti and mesothelial keratin K7.(type II); 
F(from fop arid /eft), adenylyl cyclase- associated protein, E-cadherin, 
keratin 19, calgtZ2arin, phosphoglycerate mutase, annexin IV, cy- 
toskeletal 7-actin, hnRNP A1, integral membrane protein calnexin 
(IP90), hnRNP H, brain-type clathrin light chain-a, hnRNP F, 70-kDa 
heat shock protein, heterogeneous nuclear ribonucleoprotein A/B, 
translationally controlled tumor protein, liver glyceraldehyde-3-phos- 
phate dehydrogenase, keratin 8, aldehyde reductase, and Na.K- 
ATPase 0-1 subunit; G, (from top and left), TCP20, calgizzarin, 70- 
kDa heal shock protein, calnexin, hnRNP H, cylokeratin 15, ATP 
synthase, keratin 19, triosephosphate isomerase, hnRNP F, liver glyc- 
eraldehyde-3- phosphatase dehydrogenase, glutathione S-transfer- 
ase--<x, and keratin 8; H (from left), plasma gelsolin, autoantigen cal- 
reticulin, thioredoxin, and NAD+-:dependent 15 hydroxyprostaglandin 
dehydrogenase; / (from fop), prolyl 4-hydroxylase /3-subunit, cylo- 
keratin 20, cylokeratin 17, prohibition, and fructose 1,6-biphos- 
phatase; J annexin II; K, annexin IV; L (from fop and left), 90-kDa heat 
shock protein, prolyl 4-hydroxylase /3-subunit, o-enolase, GRP 78, 
cyclophilin, and cofilin. 

gradient, and having a known chromosomal location, were 
selected for analysis in the TCC pair 827/532. Proteins were 
identified by a combination of methods (see "Experimental 
Procedures"). In general there was a highly significant corre- 
lation (p < 0.005) between mRNA and protein alterations (Fig. 
4). Only one gene showed disagreement between transcript 
alteration and protein alteration. Except for a group of cyto- 
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keratins encoded by genes on chromosome 17 (Fig. 5) the 
analyzed proteins did not belong to a particular family. 26 well 
focused proteins whose genes had a know chromosomal 
location were detected in TCCs 733 and 335, and of these 19 
correlated (p < 0.005) with the mRNA changes detected using 
the arrays (Fig. 4). For example, PA-FABP was highly ex- 
pressed in the non-invasive TCC 335 but lost in the invasive 
counterpart (TCC 733;. see Fig. 5). The smaller number of 
proteins detected in both 733 and 335 was because of the 
smaller size of the biopsies that were available. 

1 1 chromosomal regions where CGH showed aberrations 
that corresponded to the changes in transcript levels also 
showed corresponding changes in the protein level (Table II). 
These regions included genes that encode proteins that are 
found to be frequently altered in bladder cancer, namely 
cytokeratins 17 and 20, annexins II and IV, and the fatty 
acid-binding proteins PA-FABP and FBP1. Four of these pro- 
teins were encoded by genes in chromosome 17q, a fre- 
quently amplified chromosomal area in invasive bladder 
cancers. 

DISCUSSION 

Most human cancers have abnormal DNA content, having 
lost some chromosomal parts and gained others. The present 
study provides some evidence.as to the effect of these gains 
and losses on gene, expression in two pairs of non-invasive 
and invasive TCCs using high throughput expression arrays 
and proteomics, in combination with CGH. In general, the 
results showed that there is a clear individual regulation of the 
mRNA expression of single genes, which in some cases was 
superimposed by a DNA copy number effect. In most cases, 
genes located in chromosomal areas with gains often exhib- 
ited increased mRNA expression, whereas areas showing 
losses showed either no change or a reduced mRNA expres- 
sion. The latter might be because of the fact that losses most 
often are restricted to loss of one allele, and the cut-off point 
for detection of expression alterations was a 2-fold change, 
thus being at the border of detection. In several cases, how- 
Table K 



Proteins whose expression level correlates with both mRNA and gene dose changes 


Protein 


Chromosomal location 


Tumor TCC 


CGH alteration 


Transcripl alleration" 


Protein alleration 


Annexin II 


1q21 


733 


Gain 


Abs to Pres" 


, Increase 


Annexin IV 


2pl3 


733 


Gain 


3.9-Fold up 


Increase 


Cytokeratin 17 


17q12-q21 


827 


Gain 


3. 8- Fold up 


Increase 


Cytokeratin 20 


17q21.1 


827 


Gain 


5.6-Fold up 


Increase 


(PA-)FABP 


8q21.2 


827 


Loss 


10-Fold down 


Decrease 


FBP1 


9q22 


827 


Gain 


2.3-Fold up 


Increase 


Plasma gelsolin 


9q31 


827 


Gain 


Abs to Pres 


Increase 


Heat shock protein 28 


15q12-q13 


827 


Loss 


2.5-Fold up 


Decrease 


Prohibiten 


17q21 


827/733 


Gain 


3.7-/2.5-Fo!d up° 


Increase 


Prolyl- 4 -hydroxy! 


17q25 


827/733 


Gain 


5.7-/1.6-Fo!d up 


Increase 


hnRNPBt. 


7pt5 


827 


Loss 


- 2.5- Fold down 


Decrease 



3 Abs, absent; Pres, present. 

6 tn cases where the corresponding altera lions were found in both TCCs 827 and 733 Ihese are shown as 827/733. 




Fig. 5. Comparison of protein and transcript levels in invasive 
and non-invasive TCCs. The upper pari ol the figure shows a 2D gel 
{left) and the oligonucleotide array {right) of TCC 532. The red rectan- 
gles on the upper gel highlight the areas that are compared below. 
Identical areas of 2D gets of TCCs 532 and 827 are shown below. 
Clearly, cytokeratins 13 and 15 are strongly down-regulated in TCC 
827. {red annotation). The tile on the array containing probes for 
cytokeratin 15 is enlarged below the array {red arrow) from TCC 532 
and is compared with TCC 827. The upper row ol squares in each tite 
corresponds to perfect match probes; the lower row corresponds to 
mismatch probes containing a mutation {used for correction lor un- 
specific binding). Absence of signal is depicted as- black, and the 
higher the signal the lighter the color. A high transcript level was 
detected in TCC 532 (6151 units) whereas a much lower level was 
detected in TCC 827 (absence of signals). For cytokeratin 13, a high 
transcript level was also present in TCC 532 (15659 units), and a 
much lower level was present in TCC 827 (623 units). The 2D gels at 
the boitom of the figure {left) show levels ol PA-FABP and adipocyte- 
FABP in TCCs 335 and 733 (invasive), respectively- Both proteins are 
down-regulated in the invasive tumor. To the right we show the array 
tiles for the PA-FABP transcript. A medium transcript level was de- 
tected in the case of TCC 335 (1277 units) whereas very low levels 
were detected in TCC 733 (166 units). 1EF. isoelectric focusing. 
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ever, an increase or decrease in DNA copy number was 
associated with de novo occurrence or complete loss of tran- 
script, respectively- Some of these, transcripts could not be 
detected in the non-invasive tumor but were present at rela- 
tively high levels in areas with DNA amplifications in the inva- 
sive tumors {e.g. in TCC 733 transcript from cellular ligand of 
annexin II gene (chromosome tq21) from absent to 2670 
arbitrary units; in TCC 827 transcript from small protine-rich 
protein 1 gene (chromosome 1q12-q21.1) from absent to 
1326 arbitrary units). It may be anticipated from these data 
that significant clustering of genes with an increased expres- 
sion to a certain chromosomal area indicates an increased 
likelihood of gain of chromosomal material in this area. 

Considering the many possible regulatory mechanisms act- 
ing at the level of transcription, it seems striking that the gene 
dose effects were so clearly detectable in gained areas. One 
hypothetical explanation may lie in the loss of controlled 
methytation in tumor cells (17-19). Thus, it may be possible 
that in chromosomes with increased DNA copy numbers two 
• or more alleles could be demettiylated simultaneously leading 
to a higher transcription level, whereas in chromosomes with 
losses the remaining allele could be partly methylated, turning 
. off the process (20, 21). A recent report has documented a 
ploidy regulation of gene expression in yeast, but in this case all 
the genes were present in the same ratio (22), a situation that is 
not analogous to that of cancer cells, which show marked 
chromosomal aberrations, as well as gene dosage effects. 

Several CGH studies of bladder cancer have shown that 
some chromosomal aberrations are common at certain 
stages of disease progression, often occurring in more than 1 
of 3 tumors. In pTa tumors, these include 9p-, 9q-, 1q+, Y- 
(2, 6), and in pTt tumors, 2q-,11p~, 11q-, 1q+, 5p+, 8q+, 
17q+, and 20q+ (2-4, 6, 7). The pTa tumors studied here 
showed similar aberrations such as 9p- and 9q22-q33- and 
9q- and Y-, respectively. Likewise, the two minimal invasive 
pT1 tumors showed aberrations that are commonly seen at 
that stage, and TCC 827 had a remarkable resemblance to the 
commonly seen pattern of losses and gains, such as 1q22-24 
amplification (seen in both tumors), 1 1q14-q22 loss, the tatter 
often linked to 17 q+ (both tumors), and 1q+ and 9p-, often 
linked to 20q+ and 11 q13+ (both tumors) (7-9). These ob- 
servations indicate that the pairs of tumors used in this study 
exhibit chromosomal changes observed in many tumors, and 
therefore the findings could be of general importance for 
bladder cancer. 

Considering that the mapping resolution of CGH is of about 
20 megabases it is only possible to get a crude picture of 
chromosomal instability using this technique. Occasionally, 
we observed reduced transcript levels close to or inside re- 
gions with increased copy numbers: Analysis of these regions 
by positioning heterozygous rnicrosatellites as close as pos- 
sible to the locus showing reduced gene expression revealed 
loss ot heterozygosity in several cases. It seems likely that 
multiple and different events occur along each chromosomal 



arm and that the use of cDNA microarrays for analysis of DNA 
copy number changes will reach a resolution that can resolve 
these changes, as has recently been proposed (2). The outlier 
data were not more frequent at the boundaries of the CGH 
aberrations. At present we do not know the mechanism be- 
hind chromosomal aneuploidy and cannot predict whether 
chromosomal gains will be transcribed to a larger extent than 
the two native alleles. A mechanism as genetic imprinting has 
an impact on the expression level in normal celts and is often 
reduced in tumors. However, the relation between imprinting 
and gain of chromosomal material is not known. 

We regard it as a strength of this investigation that we were 
able to compare invasive tumors to benign tumors rather than 
to normal urothelium, as the tumors studied were biologically 
very close and probably may represent successive steps in 
the progression of bladder cancer. Despite the limited amount 
of fresh tissue available it was possible to apply three different 
state of the art methods. The observed correlation between 
DNA copy number and mRNA expression is remarkable when 
one considers that different pieces of the tumor biopsies were 
used for the different sets of experiments. This indicate that 
bladder tumors are relatively homogenous, a notion recently 
supported by CGH and LOH data that showed a remarkable 
similarity even between tumors and distant metastasis (10, 23). 

In the few cases analyzed, mRNA and protein levels 
showed a striking correspondence although in some cases 
we found discrepancies that may be attributed to translations! 
regulation, post-translational processing, protein degrada- 
tion, or a combination of these. Some transcripts belong to 
undertranslated mRNA pools, which are associated with few 
translationalty inactive ribosomes; these pools, however, 
seem to be rare (24). Protein degradation, for example, may 
be very important in the case of polypeptides with a short 
half-life (e.g. signaling proteins). A poor correlation between 
mRNA and protein levels was found in liver cells, as deter- 
mined by arrays and 2D- PAGE (25), and a moderate correla- 
tion was recently reported by Ideker ef a/. (26) in yeast. 

Interestingly, our study revealed a much better correlation 
between gained chromosomal areas and increased mRNA 
levels than between loss of. chromosomal areas and reduced 
mRNA levels. In general, the level of CGH change determined 
the ability to detect a change in transcript. One possible 
explanation could be that by losing one allele the chanc-e in 
mRNA level is not so dramatic as compared with gain of 
material, which can be rather unlimited and may lead to a 
severalfold increase in gene copy number resulting in a much 
higher impact on transcript level. The latter would be much 
easier to detect on the expression arrays as the cut-off point 
was placed at a 2- fold level so as not to be biased by noise on 
the array. Construction of arrays with a better signal to noise 
ratio may in the future allow detection ot lesser than 2-fold 
alterations in transcript levels, a feature that may facilitate the 
analysis of the effect of loss of chromosomal areas on tran- 
script levels. 
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In eleven cases we found a significant correlation between 
DNA copy number, mRNA expression, and protein level. Four 
of these proteins were encoded by genes located at a fre- 
quently amplified area in chromosome 17q. Whether DNA 
copy number is one of the mechanisms behind alteration of 
these eleven proteins is at present unknown and will have to 
be proved by other methods using a larger number of sam- 
ples. One factor making such studies complicated is the large 
extent of protein modification that occurs after translation, 
requiring immunoidentification and/or mass spectrometry to 
correctly identify the proteins in the gels. 

In conclusion, the results presented in this study exemplify 
the large body of knowledge that may be possible to gather in 
the future by combining state of the art techniques that follow 
the pathway from DNA to protein (26). Here, we used a tradi- 
tional chromosomal CGH method, but in the future high reso- 
lution CGH based on microarrays with many thousand radiation 
hybrid-mapped genes will increase the resolution and informa- 
tion derived from these types of experiments (2). Combined with 
expression arrays analyzing transcripts derived.from genes with 
known locations, and 2D gel analysis to obtain information at 
the post-translational level, a clearer and more developed un- 
derstanding of the tumor genome wilt be forthcoming. 
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Expression of bcr-abl niRNA in individual chronic myelogenous leukaemia 
cells as determined by in situ amplification, 

PachmannlC Zhao S , Schcnk T . Kantarjian II El-Naggar AIC Sicitiano MJ Guo 
JQ, Arlinghaus RB , Audreeff M . 

The University of Texas M.D. Anderson Cancer Center, Department of Molecular 
Haematology and Therapy, 1515 Holcombe Boulevard, Houston, TX 77030, USA. 

We present the results of a novel method developed for evaluation of in situ 
amplification, a molecular genetic method at the cellular level. Reverse transcription 
polymerase chain reaction (RT-PCR) was used to study bcr-abl transcript levels in 
individual cells from patients with chronic myelogenous leukaemia (CML). After 
hybridizing a fluorochrome-Iabelled probe to the cell-bound RT-PCR product, bcr-abl 
raRNA-positive cells were determined using image analysis. A dilution series of bcr-abl- 
positive BV173 into normal cells showed a good correlation between expected and actual 
values. In 25 CML samples, the percentage of in situ PCR-positive cells showed an 
excellent correlation with cytogenetic results (r = 0.94, P < 0:0001), interphase 
fluorescence in situ hybridization (FISH) (r - 0.95, P = 0.001) and hypermetaphase FiSH 
(r= 0.81, P < 0.001). The fluorescence intensity was higher in residual CML cells after 
interferon (IFN) treatment than in newly diagnosed patients (P = 0.004), and was highest 
in late-stage CML resistant to IFN therapy and lowest in CML blast crisis (P = 0.00 1). 
Mean fluorescence values correlated with bcr-abl protein levels, as determined by 
Western blot analysts (r = 0.62). Laser scanning cytometry allowing automated analysis 
of large numbers of cells confirmed the results. Thus, fluorescence in situ PCR provides a 
novel and quantitative approach for monitoring tumour load and bcr-abl transcript levels 
in CML. 
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Correlative iiiimunohistoclieiuical and reverse transcriptase polymerase 
chain reactiou analysis of somatostatin receptor type 2 in neuroendocrine 
tumors of the lung. 

Papotti M , Croce S , Macri L , Funaro A , Pccchioni C , Schindler M , Dussolati G . 

Department of Biomedical Sciences and Oncology, University of Turin, Italy. 7 

Somatostatin receptors type 2 (sst2) have been frequently detected in neuroendocrine 
rumors and bind somatostatin analogues, such as octreotide, with high affinity. Receptor 
autoradiography, specific mRNA detection and, more recently, antisst2 polyclonal 
antibodies are currently employed to reveal sst2. The aim of the present study was to 
investigate by three different techniques the presence of sst2 in a series of 26 
neuroendocrine tumors of the lung in which fresh frozen tissue and paraffin sections were 
available- It was possible, therefore, to compare, in individual cases, RNA analysis 
studied by reverse transcriptase polymerase chain reaction (RT-PCR), in situ 
hybridization (ISH), and immunohistochemistry. A series of 20 nonneuroendocrine lung 
carcinoma samples served as controls. RT-PCR was positive for sst2 in 22 of 26 samples, 
including 15 of 15 typical carcinoids, 5 of 6 atypical carcinoids, and 2 of 5 small-cell 
carcinomas- The sst2 mRNA signal obtained by RT-PCR was strong in the majority 
(87%) of typical carcinoids and of variable intensity in atypical carcinoids and small-cell 
carcinomas. A weakly positive signal was observed in 5 of 20 control samples. In 
immunohistochemistry, two different antibodies (ariti-sst2) were employed, including a 
monoclonal antibody, generated indie Department of Pathology, University of Turin. In 
the majority of samples a good correlation between sst2 mRNA (as detected by RT-PCR) 
and sst2 protein expression (as detected by immunohistochemistry) was observed. 
However, one atypical carcinoid and one small-cell carcinoma had focal immunostaining 
but no RT-PCR signal. ISH performed in selected samples paralleled the results obtained 
with the other techniques. A low sst2 expression was associated with high grade 
neuroendocrine tumors and with aggressive behavior. It is concluded that 1) 
neuroendocrine tumors of the lung express sst2, and there is a correlation between the 
mRNA amount and the degree of differentiation; 2) immunohistochemistry and ISH are 
reliable tools to demonstrate sst2 in these tumors; and 3) ssl2 identification in tissue 
sections may provide information on the diagnostic or therapeutic usefulness of 
somatostatin analogues in individual patients with neuroendocrine tumors. 
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Correlative Inamunbhistochemical and Reverse 
Transcriptase Polymerase Chain Reaction Analysis of 
Somatostatin Receptor Type 2 in Neuroendocrine . 
; Tumors of the Lung 

T^M auiu Pap o tti, mjx, S 3brira£roce;-MTl^ 

; Aola Fuiiaro, Ph.D:, Carla Pecchioni, Marcus Schindler, M.D., and 

7* Gianni Bussolati, m.d., RR.c.Path. * 



Somatostatin receptors type 2 (sst2) have been frequently de- 
" tected in neuroendocrine tumors and bind somatostatin ana- 
- logues, such as octreotide, with high affinity. Receptor autora- 
" diogiaphy, specific mRNA detection and, more recently* anri- 
1 sst2 polyclonal antibodies arc currently employed to reveal 

• sst2rThe aim of the present study was to investigate by three 
: differ ent techniques the presence of sst2 in a series of 26 neu- 

• roendocrine tumors of the Jung in which fresh frozen tissue aDd 
paraffin sections were available. It was possible, therefore, to 
compare, in individual cases. RNA analysis srudied by reverse 
transcriptase polymer a se chain reaction (RT-PCR), in situ hy- 
bridization (1SH), and irrununoWslocheriustry. A series of 20 
nonneirrbcndocrine lung carcinoma samples served as controls. 
RT-PCR was positive for ssl2 in 22 of 26 samples, including 15 
of. 15 typical carcinoids* 5 of 6 atypical carcinoids, and 2 of 5 
small- cell carcinomas. Tbe sst 2 mRNA signal obtained by RT- 
PCR was strong in the majority (87%) of typical carcinoids and 
of variable intensity in atypical carcinoids and small-cell car- 
cinomas. A weakly positive signal was observed in 5 of 20 
control samples. In irorounohistochemistry, two different anti- 
bodies (anti-s$t2) were employed, including a monoclonal an- 
tibody, generated in the Department of Pathology, University 
of Turin. In the majority of samples a good correlation between 
s$t2 mRNA (as detected by RT-PCR) and sst2 protein expres- 
siob (as detected by immunohisiochernistry) was observed. 
However, one atypical carcinoid arid one small-eel) carcinoma 

had focal iromunostaining but no RT-PCR signal. ISH per- 
formed in selected samples paralleled tbe results obtained with 

ihe othei techniques. A low sst2 expression was associated with 
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high grade neuroendocrine tumors and with aggressive bchav- 
. ioi. It is concluded that 3) neuroendc>CTine tumors of the lung 
express sst2„ and there is a correlation between the mRNA 
amount and the degree of differentiation; 2) imrounohistochem- 
istry and JSH are reliable tools to demonstrate" sst2 in these 
tumors; and 3) ssi2 ideritificariorj in tissue sections may provide 
information on the diagnostic or therapeutic usefulness of so- 
matostatin analogues in individual patients with neuroendo- 
crine rumors. 

Key Words; Neuroendocrine — Lung — Tumors — Somato- 
statin receptors — Immunohistochemistry— -Small cell carci- 
noma — Reverse transcriptase polymerase chain reaction. 

Diagn Mol Pathol 9(1): 47-57, 2000. 



The somatostatin receptor family '(sst) includes at least 
five jsoforrns thai have been recently identified and char- 
acterized (18,32,41). Tbe ssts are widely distributed in 
normal human tissues and jd human rumors. Sst type 2 is 
more commonly detected in neuroendocrine tumors 
(32,37) and binds the somatostatin analogue octreotide 
with high affinity. 

Sst localization had originally been demonstrated by 
means of binding assays of radiolabeled somatostatin 
analogues (20,25,31). Subsequently, specific sst messen- 
ger RNA (mRNA) detection was obtained by means of in 
situ hybridization (]SH) and reverse transcriptase poly- 
merase chain reaction (RT-PCR) (14,32,37). Recently, 
polyclonal antibodies specific for different isoforms of 
sst were produced and used in irrununohisiochemisD'y 
(10,12,15,1830,35,36). Given the well-known beteroge- 
neiry of neoplastic populations, in situ methods (immu- 
nohistochemistry and JSH) allow a more definite map- 
pin" of the distribution of ihe receptor in .such tissues. 
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This is potentially useful for predicting the responsive- 
ness of a given neoplastic cell population to medical 
treatment with somatostatin analogues, which are used in 
the clinical setting for both diagnostic and therapeutic pur- 
poses with special reference to neuroendocrine tumors. 

The spectrum of neuroendocrine rumors of the lung 
includes well-differentiated neoplasms (so-called typical 
. carcinoids) and poorly differentiated small- cell carcino- 
mas (SCCs). Intermediate forms sharing features of both 
the aforementioned types also belong to this spectrum 
. (so-called atypical carcinoids or well-differentiated neu- 
roendocrine carcinomas). Finally, large-cell neuroendo- 
— Grine-carGinoma-has-beeii-identifjed and- induded-in-this- 
tumor group (4,40). The tissue distribution of sst2 in 
neuroendocrine tumors of the Jung has not been thor- 
oughly characterized, although individual samples of 
: bronchial carcinoids were found to express sst (30)^ 
SCCs (but riot non-small-cell types) were also shown to 
be sst2 positive by receptor binding assay (33). More- 
over, sst2 has been detected in in vitro cell cultures of 
human SSC of the lung (39,42). No study on a series of 
neuroendocrine rumors of the lung including all neuro- 
endocrine lung tumor types has been reported to date. 

The aim of this study was therefore to investigate the 
presence of sst2 mRNA and protein in a series of 26 
neuroendocrine tumors of the King, employing different 
technical approaches, such as RT-PCR, JSH, and immu- 
nohistochemistry. To this purpose a monoclonal anti- 
body to sst2 (N-tenninaJ) was generated in the Depart- 
ment of Pathology, University of Turin. The results were 
then compared and related to the rumor grade and to 
, other clinic opathologic parameters. 

MATERIALS AND METHODS 

Case Series and RNA Extraction 

Twenty-six samples of neuroendocrine rumors of the 
lung, in which fresh frozen tissue was available, were 
retrieved from the surgical pathology file of the Univer- 
sity of Turin, Italy. All samples were reviewed applying 
currently accepted criteria of classification (4,40), and 
the neuroendocrine nature was confirmed by positive jm- 
munostaining for chromoerarrin A (CgA) (with or with- 
out antigen retrieval) or synapiophysin, and by positive 
.RT-PCR for CgA mRNA. According to die classi- 
fications described here, these included J 5 well- 
differentiated neuroendocrine tumors (typical carci- 
noids), 6 well-differentiated neinocndocrine carcinomas 
(atypical carcinoids), and 5 SCCs. 

A series of 20 non-small-cell lung carcinomas (10 
squamous, 9 adenocarcinomas, arid ] large- cell anaplas- 
tic) lacking neuroendocrine differential on, as demonstrated 
by De«:itive immunohistochemisiry and RT-PCR for CsA 
(]), served as a control group. Clirticopaihologic data and 
foliow- up information were obtained for al] paiienis. 



For hybridization analysis, total RNA was extracted 
using the guanidine thiocyanate-cesium chloride method 
(5). The concentration of RNA was estimated by spec- 
trophotometry, and RNA degradation was assessed by 
agarose gel electrophoresis, as previously reported {31). 

Reverse Transcriptase Polymerase Chain Reaction 
for sst2 and Chromogranin A 

Total RNA (2 pg) was first digested, with 10 units of 
RNase-free DNase (Boehringer, Mannheim, Germany) 
in a 10- pi solution containing 20 mmol/L MgCl 2 , to 
. avoid DNA contamination. The solution was kepi at 
room temperamre fortO'minutesrtben heated for 5 min- 
" utes at 70°C to inactivate the DNase molecules; 40 
pmol/L of oligodeoxy thymidine primers (oligo-dTl 6) 
were added and the solution was heated again at 70°C for 
10 minutes, then chilled on ice to allow the primer hy- 
bridization. The resulting solution was reverse tran- 
scribed using 100 units of reverse transcriptase (Gibco 
BRL, Gaitersburg, MD). Complementary DNA (cDNA) 
was generated in a 50-jjlL final reaction volume contain- 
ing 50 mmol/L Tris-HCl pH 83, 75 mraol/L KCL 3 
mmol/L MgCl 2 » 10 mmol/L dithiotbreitol, 1 irimol/L de- 
oxynucleotide triphosphates (dNTPs), and 20 units of 
RNasin (Proroega, Madison, WJ). The solution was 
heated at 37°C for 90 minutes. Finally, the enzymes were 
inactivated by heating to 70°C for 10 minutes. 

The efficiency of the reverse transcription was deter- 
mined by reforming a PGR reaction having the ^2- 
microgJobulin "housekeeping gene" as a target. PCR was 
carried out in a 10-ixL final reaction volume containing 
1 \iL of cDNA template, 10 pmol of sense and an ti sense 
oligonucleotide primers, 67 mmol/L Tris-HCl pH 8.8, 16 
mmol/L (NH 4 )2S0 4> 0.01% polysorbate 20, 2 mmol/L 
dNTPs, 1 mmol/L MgCl 2 , and 0.5 units of Taq polymer- 
ase. p 2 - Microglobulin, sst2, and CgA PCR reactions were 
performed using the same protocol at the following PCR 
conditions: 35 cycles, each cycle consisting of denaturation 
at 94 °C for 2 minutes, annealing at 55°C for 1 rniriuie for 
P 2 -rjiicroglobulin, at 6] °C for sstZ and at 68°C for C^A; 
extension was performed at 72°Cfor 3 minute. Tire primers 
used for RT-PCR (9,1 1,2337) are reported in Table 1. 

The amplified fragments were run in a 1% agarose gel, 
containing ethidium bromide. Strict precautions against 
contamination were undertaken (19) and negative con- 
note (a no- template contro) and a noHreverse trans criptase 
control and distilled water to replace the RNA) wer e in- 
cluded. The RNA extracted from an H7J 6 neuroendocrine 
colon carcinoma cell line, and from a neuroblastoma (37) 
served as positive controls for CgA and sst2 t respectively. 

Antibodies 

Two different antibodies specific tor ssi2 were em- 
ployed. The firsi one was a monoclonal ami body raised 



SOMATOSTATIN RECEPTOR TYPE 2 IN NEUROENDOCRINE LUNG TUMORS 
TABLE 1. Sequences of primers used for reverse transcriptase potyn^^ 
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Size of PCR 
product (bp) 



PosrDon 



Study 



^r^g^ormsenSKS'ACC CCC ACT ^ GAT SA_3; 
1 fe-microBtobulin anfeense: 5' ATC TTC AAA CCT CCA TGA TG 3 
3 CSTR2 sense- 5' CAG TCA TGA GCA TCG ACC GA 3 
. iH^ra a^e: 5' GCA AAG ACA GAT GAT GGT GA 3' 
X SaA sense- 5' GCT CCA AGA CCT CGC TCT CC 3' 

anteense: W 6AC CGA CTC TCG CCT TTC CG 3" 

PCR; potymerase chain reaction. 



in the Department of Pathology (Dnivexsiry of Turin) 
y.cific for an N-temii pa] s^"e^-?. f . jl^ st2 (shared- 



:;ised 



>- by both A and B receptor isoforms). The octapepride 
EPYYDLTS, corresponding to amino acids 35 to 42 of 
: the human receptor (and differing by one amino acid 
" ; " from the mouse sequence), was synthesized, having a 
-;' ly^in added to the N-terminaL This sequence was similar 
£ 10 that used by other groups to produce polyclonal anti- 

- : ' bodies (17,18,27). This sequence was rather short but 
made it possible to avoid extensive homology with sstl. 

: In addition, according to a genbairk search using FASTA 
\" (28), this protein sequence is unique to human sst2 and 
" has a partial homology only with Tat and human nuclear 
U. receptor retinoid orphan nuclear- receptor- beta (a protein 

- ■" having nuclear localization). Three Balb/c mice were im- 
; mumzed with the peptide conjugated to keyhole limpets 
■v hemocyanin (KLH) (Sigma, St Louis, MO) following 

- the standard procedure. After the first intraspJenic injec- 
tion (100 u-g of protein) at time 0, the mice were intra- 
peritoneally injected six times with the pepode-KLH 
conjugate (150 jxg) iD the presence of Freund adjuvant 
The inactivity of the sera from each animal was evalu- 
ated using an enzyme-linked immunosorbent assay, us- 
ing The peptide coated onto the plastic. The bybridomas 

y were produced by somatic fusion of immunized spleno- 
cytes with the mouse myeloma cell line Ag8.X63.653, 
^. following the standard technique (21). Tbe monoclonal 
antibodies of interest were selected on tbe basis of tbe 
reactivity with the target peptide and with appropriate 
tissue sections: The latter included foraalin lixed and 
paraffin-embedded sections of pituitary. gland and pan- 
creatic islets and were analyzed by means of immuno- 
peroAidase staining. ParaUel control expeiimenls were 
also performed by staining serial sections of these 1 is- 
sues, omitting tbe primary antibody 01 with the preim- 
rotine serum or with the antibody preads orbed with high 
concentrations (1 mg/mL) of the antigenic peptide. In 
addition, the selected monoclonal antibodies (coded 
,10C6and lOG^.boub of IgM isotype, were further char- 
acterized by Western blotting. Membianes were picpared 
horn stable uansfecred Chinese hamster ovary (CHO> 
KJ cells, individually expressing jecombinant human so- 
rnniostatii) receptors (sstl to sst5). West en) blotting was 
performed as previously described (36). The monoclonal 
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antibody was used as culture supernatant at 1:3 dilution 
_ Jnr 2 hours at room te mperature., in Tris-buffered saline 
(TBS), supplemented with 6.1% polysorbate 20. Blots 
were washed in TTBS and incubated with peroxid.ase- 
conjugated goat antimouse IgM, diluted 1:1J500 for 90 
minutes at room temperature. Tben, blots were washed in 
TTBS and immunocomplexes were visualized using 
ECJL following manufacturer's instructions (Amersbam, 

Bucks, UK). ' - . 

A second polyclonal aDtibody was pioduced that bad 
been characterized previously (35,36). This antibody 
(coded K230) was raised in sbeep and was specific for a 
sequence of tbe C-tenninal portion of the sst2A (KSKL- 
OTTIETQRT1J-KEDLQ, amino acids 347 to 366). 

. Jjmmnnobist ocbemistry 

Sections 4 or 5 u. thicX, adjacent to those used for 
conventional histopathologic examination and irnmuDO- 
staining for nemoendocrine markers, were collected onto 
poly-L-lysine-coated slides. The proliferative activity of 
the tumors was assessed by means of Ki67 imnmoosta- 
nining (clone MIBJ, lmmunotecb, Marseille, France), 
diluted 1:10 after microwave-based antigen retrieval in 
citrate buffer). The ascitic fluid of monoclonal antibody 
10G4 was used in this study and was applied to tissue 
sections with prior antigen retrieval (three 3- minute pas- 
sages in a microwave oven at 800 "W in citrate buffer pH 
6.0), at the dilution of 1:10,000 or 1:12,000 for 30 min- 
utes at room temperature, tbe antiserum coded K230- 
was applied overnight at a dilution of 1:300 with no prior 
antigen retrieval. The immune reactions were then re- 
vealed with the immunoperoxidase technique (13) using 
the streptavidin-peioxidase kit and diaminobenzidine as 
cbroroogen. A weak nuclear countersign or no counter- 
stain was used in parallel sections. Control stainings for, 
both antibodies included immunoperoxidase of serial 
sections using preimrnune serum 01 antibody pread- 
soibed with the antigen or buffer instead of the primary 
antibody. 

In Sim Hybridization 

Selected tumors (12 samples) were also analyzed for 
sst2 mRNA expression by means' of a nonradioactive, 
ryramide deposition- based JSHjechmqne. Tbe proce; 
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dure of amplification was modified from procedures re- 
ported by Kerstens et al (J 6), Spee] et al. (38), and' the 
GenPoiot (bioDnyMynamide) manufacturer (DaJco, GJos- 
trap, Denmark). Briefly, 5-^- thick paraffin sections 
were collected onto silane-coated slides and deparaf- 
frnized through xylene and graded alcohols to phosphate 
buffer saline (PBS). The slides were then incubated for 5 
minutes in a microwave oven at 800 W in citrate buffer 
pH 6.0. After washing in PBS, they weie digested with 
proteinase K (1 p-g/mL) for 10 minutes at 23°C Endog- 
enous peroxidase activity was bJodced with 3% hydro- 
gen peroxide and endogenous biothi was blocked using 
*^r3in"^)cK:king-Te^enrtoT--J5 minutes -foHow ed by 



washing in PBS and biodrhblocking reagent for 15 min- 
utes (3). Sections were then prehybridized for 1 hour at 
room temperature in a mixture composed of 4x SSG, 
50% formamide, Denhardt's lx, dextrane sulfate 5x,500 
p.g/mL salmon sperm DNA, and 250 jag/mL tRNA. Hy- 
bridization took place overnight at 42°C in a solution 
containing the specific probe at a concentration of 1 
pmol/mL The probe was a digoxigenin-labeled 4 8- base 
oligonucleotide (32), complementary to positions 93 to 
139 of the human sst2 gene (41). After hybridization,, 
excess hybridization buffer and coverslips were removed 
by a rapid wash in 4x SSC followed by stringent washing 
in OAx SSC for JO minutes at 42°C. The hybrids were 
leveaJed by the following incubation steps: perqxidase- 
labeled antidigoxigenin (diluted 1:J00 in PBS) for 30 
minutes a! room temperature, bioiinylated tyramide (di- 
luted 3:5 in PBS) for 35 minutes„at room temperature, 
and peroxidase-labeled streptavidin for 15 minutes at 
room temperature. Diaminobenzidine was used as chro- 
mogen. Controls for JSH included staining of serial sec- 
tions with sense probe, an unrelated probe (EBER-J of 
the Epsiein-Barr virus), and omission of the probe in the 
hybridization mixture, with all other experimental con- 
ditions identical to the procedure described here. 



RESULTS 

Keverse Transcriptase Polymerase Chain Reaction 

All neuroendocrine tumors, but no nonneuToendocTine 
lung carcinomas, were positive for CgA nrRNA (Fig. 1). 
Sst2 inRNA was amplified in 22 of 26 samples of neu- 
roendocrine tumor. Tbe signals bad variable intensities 
(Fig. 2) and were weak in moderately or poorly differ- 
entiated tumors (mostly in SCCs). No amplification was 
obtained in no-template or no-reverse transcriptase ex- 
periments. Control samples (nonneuroendocrine lung 
carcinomas proven by negative CgA RT-PCR) were 
weakly positive for sst2 in 5 of 20 samples only, (includ- 
ing 3 adenocarcinomas^ 3 " squamous, and the large-cell 
anaplastic carcinoma) (Fig. 3). These, differences were 
statistically significant (P < 0.03) by X 2 test 

Characterization of Monoclonal Antibodies to sst2. 

Several clones were identified having a positive bind- 
ing by enzyme-linked immunosorbent assay and a par- 
allel immun ore activity 1 on formalin- fixed paraffin- 
embedded human endocrine tissues (pituitary and pan- 
creatic islets). In Western blotting experiments, two 
clones (coded )0C6 and 10G4) specifically developed a 
band at approximately 70 kDl When the antibodies were 
. used against CHO-transfected cells expressing recombi- 
nant somatostatin receptors 3 through 5, a specific band 
corresponding to sst2 (at approximately 70 kD) was re- 
vealed by the monoclonal antibody J0G4. Monoclonal 
antibody 10C6 developed a strong band with ssl2 but 
displayed a weaker reactivity also with sstl, 3, and 5, al 
least in the present experimental conditions (Fig. 4 A,B). 
The same antibodies were also tested by means of im- 
munoperoxidase staining, on formalin-fixed, paraffin- 
embedded samples of normal human pituitary gland and 
pancreas. Monoclonal antibody 1QG4 gave good results 
in immunohistochemisrry and was used at increasing di- 
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FIG. 1. Reverse transcriptase polymerase chain reaction for chrornograniri A (CGA) mRIMA 
in 26 samples ot neuroendocrine tumor ot the lung. Numbers in each lane correspond to 
sample numbers in Table 2. CGA mRNA is amplified at 429 bp. C and C* stand tor positive 
(neuroendocrine colon carcinoma cell line, B716) and negative (distilled water) controls, 
respectively. The !as! column to the right represents the molecular weight marker. All samples 
are positive with a variable intensity ot the amplification band. . 
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FIG. 2. Reverse transcriptase polymerase chain reaction for sst2 mRNA in 26 samples of 
neuroendocrine tumor of the lung. Numbers in each Jane correspond to sample numbers in 
Table 2. sst2 mRNA is amplified at 284 bp. C and C* stand 1or positive, (a neuroblastoma) 
and negative (distilled water) controls, respectively. The last column to the right represents 
the molecular weight marker. Twenty-two of 26 samples are positive with a variable intensity 
of the amplification band. 
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boons (up to 1:15,000) with specific staining. Using thin 
sections (approximately 4 pun), a strong membrane- 
bound and peripheral cytoplasmic imrnunoreactivity was 
found in an adenohypopbyseal cell population (corre- 
sponding to growth hormone-secreting cells, as con- 
firmed by double immunohistochemical analyses) and in 
pancreatic islets (Fig. 4 C T D). In the latter, the staining 
was apparently not restricted to a specific hormone- 
producing cell type and had a peripheral cytoplasmic or 
membrane distribution. Exocrine pancreatic cells (both 
acinar and ductal) were only occasionally immuno : 
stained. Immunobistochem)SD7 performed on serial con- 
trol sections, either omitting the primary antibody or us- 
ing the preirrtmune serum or antibodies preabsorbed with 
the synthetic peptide, was negative in both tissues. 
.Monoclonal antibody 10C6 had a relatively higher back- 
ground staining at similar dilutions. 

Iflmnmohistocbemistry - 

The antibodies to sst2 (monoclonal antibody 10G4 and 
polyclonal K230) gave slightly different imrnunoreac- 
tions in 25 samples, and staining was not done in 1 
sample because of lack of residual paraffin blocks. The 
monoclonal antibody 10G4 stained 2) of 25 samples, the 



negative samples being 1 atypical carcinoid and 3 SCCs 
(Fig. 5). The rumors had 5% to 25% of the neoplastic 
cells immunoreacdve. The staining was at the periphery 
of the cytoplasm, and omitting the counterstain its mem- 
brane-bound distribution was better outlined in most 
samples (Fig. 6). One sample of atypical carcinoid <no. 
21) was focally immunore active for sst2, despite nega- 
tive RT-PCR findings. Conversely, sample no. 26 was 
immunohistochemistry negative and RT-PCR positive. 
The antiserum anti-sst2A (code K230) °ave positive sig- 
nal in 19 of 25 samples, in 5% to 60% of the neoplastic 
cell population (Fig. 7). The location of the staining was 
at the membrane level associated with a weak cytoplas- 
mic reactivity. The same partem was seen in positive 
controls, e.g., pancreatic islets (Fig. 7, inset). Two 
samples (nos. 19 and 26) were negative in spite of a 
positive RT-PCR signal. Two other tumors (nos. 21 and 
•22), apparently devoid of sst2 mRNA, showed a small 
percentage of immunoreactive cells. Incidentally, one of 
these latter samples (no. 21) was also immunoreactive 
with monoclonal antibody 10G4 (Table 2). 

The five control samples positive 'by RT-PCR were 
also reactive with the antibodies. The type of immuno- 
cytochemical location of s*st2 receptors was similar to 
thai described here, being a peripheral cytoplasmic stain- 




M 1 2 3 4 5 6 7 8 9 30 U 12 J3 14 15 )6 17 J8 19 20 C C* M 

FIG. 3. Reverse transcriptase polymerase chain reaction ioi ss\2 mRNA in 20 control 
samples cf n on neuroendocrine lung carcinoma. Five ol 20 samples show a weak band at 
204 bp corresponding to sst2 mRNA. Con! rot" columns {C and C*) are identical 1o those in 
Fio. 2. 
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F)G. 4. Western blot analysis of monoclonal antibody 
clones 10G4 and 10C6 against sst2 in Chinese hamster 
ovary cells transferred with recombinant sst 1 through 5 
(numbers of each column "correspond to receptor type). 
Monoclonal antibody 10G4 shows a specific band at ap- 
proximately 70 kD for sst2 only (A) as opposed lo mono- 
clonal antibody 10C6> which strongly reads with sst2 but 
also has some degrees of cross-reactivity with sst 1, 3, 
and 5 (B). The lower figures show control formalin-fixed 
paraffin-embedded pancreatic islets immunostained with 
monoclonal antibody 10G4 without (C) and with (D) 
preadsorption with the peptide antigen, respectively. The 
majority of endocrine cells show a membrane- bound in> 
muhoieactivity (C) (immunoperoxidase). Bar: 90 urn. 



ing present in 40% to 70% of neoplastic cells. A weak 
and focal staining was a)so observed in five of the re- 
maining KT-PCR-negauve samples/ when the antibody 
K230 was used (but not when the monoclonal was em- 
ployed). 

Several celts in periiumojal tissues were occasionally 
stained. Ciliated cells of bronchial mucosa had a periph- 
eral staining at the cilia border. Mucous glands were 
negative. Kaje chondrocytes had a membrane staining. 
The wall of perimmoraJ as well as of occasional distant 
vessels was siained at the endothelium level and in oc- 
casional smooth muscle celts. 

. The reactivity "of both antibodies was abolished in se- 
rial sections when the reagents were preabsorbed with 
the respective synthetic peptides, but not when an unre- 
lated peptide was used. The perirumoral bronchial mu- 
cosa had a focal staining of ciliated cells with both an- 
tibodies. This ic activity disappeared when the pic ab- 
sorbed ami hotly was applied. 



In Situ Hybridization 

Eight of 12 samples stained by ISH were positive for 
sst2 mRNA. The mRNA was present in a percentage of 
cells (ranging from 10% to 40%) and gave a weak signal 
(Fig. 8), despite the amplification provided by the tyra- 
nt! de-based procedure. The background level was mini-, 
mal using diluted biotinylated ryrarnide. Control sections 
stained with sense probe or an unrelated probe, or omit- 
ting the pjobe, were consistently negative. 



Clinical Data 



GinicopatholGgic data are summarized in Table 2. At 
follow-up, the majority of patients with typical carci- 
- noids are free from disease 1 to 1 1 years after surgery. 
Two patients are alive with stable metastatic disease. 
Patients affected by atypical carcinoids had disease pro- 
gression in one third of samples. Finally, patients with SCC 
had fatal outcomes* within 1 year from diagnosis (except the 
recent sample). Eight patients had preoperative octreotide 
scintigraphy performed at the time of diagnosis. All patients 
had positive octreoscan findings, *and, in these patients, 
also the tumor was positive by RT-PCR and immu- 




FIG. 5. sample no. 25 (small cell carcinoma). Absence of 
immunoreactivity lor sst2 with the monoclonal 10G4. This 
sample was also negative by reverse transcriptase poly- 
merase chain reaction and in situ hybridization. (Immuno- 
peroxidase in a formalin- fixed paraffin-embedded sample. 
Nuclei slightly counterstained with hernalum.) Bar: 45 prn. 
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no^istocbemistiy or 1SH. In addition, three of these pa- 
tients received octreotide therapy administered at the 
time of rumor recurrence or metastatic spread. Stable 
disease is recorded at follow-up more than 5 years after 
diagnosis. 

Correlations 

Overall, complete overlapping (i.e., RT-PCR, ISH, 
and immunobistochemisiry with two antibodies) be- 
tween sst2 geDe and protein expression was obtained in 
21 of 25 samples (84%) and between RT-PCR results 

the antibodies in 24 of 25 samples (96%). The monoclo- 
nal antibody 10G4 looked highly sensitive, being able to 
stain all but one sample (no. 26) (95%) positive for sst2 
.mRNA by RT-PCR. Sst2 expression, at mRNA as well 
as at protein levels, was reduced in high grade rumors, 
with SCCs being weakly positive in ODly two of five 
samples. Decreasing expression of sst2 appears to cor- 




FIG. 6. sample no. 16 (typical carcinoid). Immunohisto- 
chemical detection of sst2 by means ol monoclonal anti- 
body 10G4. The neoplastic cells have a peripheral cyto- 
plasmic staining and membrane positively in some .cells, 
whereas the peribronchial gland adjacent to Ihe tumor is 
unreactrve. (Immunoperaxidase in a foimalin-fixed paraf- 
tin-embedded sample. Nuclei slightly counterstained with 
hemalum.) Bar; 45 pm. The membrane- hound distribution 
of the immunostaining is better outlined in a parallel sec- 
lion stained lor monoclonal antibody 10G4 omitting' 
nuclear counterstain (inset). 




FIG. 7. Some sample as in Fig. 6. Immunohistocbemtcal 
detection ol sst2 by means of the polyclonal antibody 
K230\ The immunostaining is mora intense at the cell bor- 
der (arrows), as observed with the monoclonal antibody. 
In the inset, a pancreatic islet, used as positive control 
shws a predominant membrane-bound immunostaining of 
many neuroendocrine cells. (Immunoperoxldase in a for- 
malin-fixed paraffin-embedded sample. Nuclei slightly 
counlerstained with hemalum.) Ban 45 pm. 

relate with high tumor grade and elevated proliferative 
activity, but not with other parameters such sex, age, or 
tumor size. 



DISCUSSION 

Jn this study, the presence of sst2 mRNA has been 
demonstrated in a series of resected neuroendocrine ru- 
mors of the lung by.means of RT-PCR and confirmed by 
a sensitive nonradioactive tyramide-based JSH procedure 
and by immunohistochemistry with anti-sst2 antibodies. 
Samples of both carcinoid tumors and SCCs were sst2 
positive, although 2 reduced, or absent signal was ob- 
served in poorly differentiated (small^cll) carcinomas. 
This is the first study of sst2 expression in a relatively 
laroe series' of neuroendocrine tumors of the lung. Single 
samples of human carcinoids and SCCs (including cell 
lines of ihe latter) had previously been analyzed and 
found to express sst2 (7,15,30,32,33,39/12). Several 
methods have been used 10 detect these receptors and 
partially overlapping re.su I is- were obtained. 

Jn the present study, the expression of high amounts of 
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TABLE 2. Cfinicopathologic data and somatostatin receptor type 2 (sst2) expression in 26 cases of neuroendocrine 

- lung tumors 



Patient 






Size 


Follow-up 


CgA 


CgA 


SYP 


Ki67 


sst2 


SS12IHC 


ssJ2 IHC 


no. 


Diagnosis 


Sex/age 


(cm) 


(mo) 


tHC 


RT-PCR 


IHC 


IHC* 


RT-PCR 


Mab(10G4) 


(K230 Ab) 


1 


.WD NET 


F/35 


3J5 


NED 90 


+ 


44-4 


+ F 


1.5 


44-4 


+ 


, 4 


2 


WD NET 


F/29 


4 • 


NED 45 


-* 


44 


+ F 


3 


++ 


+ 


+ 


3 


WD NET 


F/41 


2.5 


NED 70 


+ 


4+ 


4- 


0.1 


+++ 


*+ 


+ F 


4 


WDNETt 


F25 


4 


NED 21 


+ 


4+ 


+ 


2.6 


+44 


+ F 


+ F 


5 


WD NET| 


M/58 


3.8 


NED 23 


4 


444 


4 


13 


44+ 


+ F 


4 


6 


WD NET 


M/52 


2.5 


NED 42 


4 


44 


+ 


. 4.5 


4++ 


+ F . 


+ F 


7 


WD NET 


F/69 


3.5 


NED 47 


4 


44 


+ 


1 


+ 


*4 


+ 


8 


WD NET 


M/29 


3 


NED 70 


4 


44+ 


+ , 


1.5 


44-4 


" + 


4 




WD NET 


M/27 . 


4 


NED 108 


4 • 


444 


4 


NT 


444 


+ 


+ 


10 


WD NE Cat 


M/66 


'8 


A WD 55 


+'* 


44 


4 F 


1.1 


4-f 


+ F 


+ 


11 


WD NET} 


F/29 


2 


A WD 56 


•f 


444 


+ 


2.5 


+4+ 


+ 




12 


wd uer 


F/32 


3 


NED.26 


4 


44 


+ 


4" " 


""+++• 


+ 


+ 


13 


WD NE Ca 


. WW 


3 


NED 133 


4 


44+ 


+ 


1 


+++ 


NT 


NT 


14 


WD NET 


W2B 


4 


NED 130 


4 


4+4 . 


+ 


1 


+++ 


+ 


+ 


15 


WD NET} 


M/41 


1;3 


A WD 53 


4 


44+ 


4 


1.5 


444- 


+ 


+ 


16 


WD NETf 


F/31 


1 


NED 13 


+ 


44 


4-4 


2.6 


4 


+ 


+ 


17 


WD NET 


F/53 


4 


NED 24 . 


4 


444 


+ F . 


4 


444 


+ F 


+ 


18 


WD NE Ca 


M/62 . 


3 


NED 6 




44 




13 


444 


+ 


+ F 


19 


WD NE CA 


F/73 


5 


DOD20 


4 


■ 44-4 


4 


3 


44 


+ 




20 


sec 


M/57 


6 


DOD12 


4 


4+ 


4 


45 


4 


+ F 


+ F 


21 


sec 


rW51 


4.5 


DOD5 


4 


4 


4 


35 




+ F' 


4 F 


22 


WD NE Ca 


M/60 


6 


NED 51 


4 


44 


4 


1.5 






4 


23 


sec 


F/56 


6 


DOD11 


4 


+ 


4 


50 








24 


WD NE Ca 


W77 


2.5 


NED 21 


4 


4 


+ 


24 


4 


+ F 


4 F 


25 


sec 


M/57 


5 


DOD10 


4 


44 


4+ 


80 








26 


sect 


M/68 


11 


recent case 


4 


4 


4+ 


71 


+ 







A WD, alive with disease; CgA, chromogranin A; DOD, died of disease; -t F, focal: positive in <5% of cells; IHC; Immunohistochemistry; 
Mab, monoclonal antibody; NECa, Neuroendocrine carcinoma; NED, no evidence of disease; NET, neuroendocrine tumor; NT. not tested; 
RT'PCR. reverse transcriptase polymerase chain reaction; SCC, Small-cell lung carcinoma; SYP, synaptopbysin; WD, welt differentiated. 
. ' K»67 IHC: values correspond to percentage of positive nuclei of neoplastic celts. 

"t Patients who had preoperative odreoscan performed. 

t Patients who had odreoscan performed and octreotide treatment. 



sst2 mRNA was confirmed in we)] to moderately differ- 
entiated neuroendocrine tumors, in agreement with the 
results obtained by Reubi et al (32) by means of radio- 
active 1SH. The presence of sst2 mRNA in SCC had 
never been reported in human specimens, except for two 
samples included in. Reubi et aJ.'s series (32). Although 
the data on cell lines support the observation that SCCs 
contain sst2 (42), slightly discrepant results were found 
in some of samples described here. Unfortunately, SCCs 
are rarely operated on, and therefore it is difficult to 
collect a large number of surgical specimens. The five 
samples studied in the current series by means of RT- 
PCR had a low amount (two samples) or absent (three 
samples) sst2 mRNA. This could be the result of the 
extensive necrosis commonly present in such rumor 
types. However, because care was taken to freeze frag- 
ments thai weremacioscopically devoid of necrotic ar- 
eas, a more likely hypothesis is that sst2 expression is 
reduced in poorly differentiated rumors. Recently, Reis- 
inger ct al. (29) showed that the uptake of somatostatin 
analogues in patients with SCC undergoing chemo- 
therapy is significantly lower, and therapeutic external 
factors" may a/fee I the receptor status of individual m- 
mors. In addition, the uptake of somatostatin analogues 



in metastatic deposits of SCC has been shown to be low 
or absent (29,2). The present findings suggest that the 
sst2 mRNA content is related to the degree of tumor 
differentiation. These data roust be confirmed in larger 
series of nonneurbendocrine tumors to ascertain whether 
the observed loss or decrease of sst2 expression in neu- 
roendocrine rumors is a common, event linked to neo- 
plastic dedifferentiation. In addition, further studies are 
needed to assess the functionality of such receptors, by . 
comparing the profile of sst2 expression in tumor tissues 
with binding assays employing labeled somatostatin and 
wiih the clinical response to diagnostic and therapeutic 
administration of somatostatin analogues. 

To this purpose, several investigators have demon- 
strated a correlation between clinical imaging or re- 
sponse to somatostatin analogue treatment, and ssl2 
mRNA content in single, samples of carcinoid tumors 
(15,22). Northern blotting and 1SH weje the techniques 
used for sst2 mRNA identification. This kind of corre- 
lation is useful for selecting patients for somatostatin 
analogue treatment, although the demonstration of recep L . 
tor mRNA in a cell does not imply per se that the recep- 
tor is fully functional. 

The present study telied on a highly sensitive tech- 
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-FIG.- .8. sample no.-14(lypical carcinoid). In -sttu- hybrid- 
ization (ISH) for sst2 rnRNA shows a weak cytoplasmatlc 
staining (A) in most tumor cells.. An ISH performed with an 
unrelated probe was negative in a serial section of the 
same tumor (B). This , sample was strongly positive by 
reverse transcriptase polymerase chain reaction for sst2 
mRNA and by tmmunohistochemistry. (Nonradioactive- 
ISH revealed by peroxidase and dlamtnobenzidine, as 
substrate. Nuclei counterstained with hemalum.) Bar: 
75 pm. - 



; nique, RT-PCR, to identify all samples bearing even 
sraaJl amounts of sst2 mRNA. Indeed, in a previous 
study, single samples exhibiting octreotide-binding sites 
* had no demonstrable sst2 mRNA by mearis of ISH, pos- 
sibly due to the low sensitivity of the ISH procedure (34). 
The RT-PCR bas shown sst2 mRNA transcripts in tbe 
majority of samples here studied. Only four samples 
were negative, alj belonging to poorly differentiated high 
grade tumors, which usually follow an aggressive course. 
A decrease of sst2 mRNA expression in association with 
neuroendocrine tumor dedifferentiation had also been re- 
ported in neuroblastomas (37). In tbe above report, as 
weJI as in the current study, samples having an unfavor- 
able piognosis were found to contain a relatively low . 
amount of sst2 mRNA, as compared with well- 
differentiated rumors. 

In the current sample series, eight samples were in- 
vestigated before surgery with radiolabeled octreotide. 
Despite the low figures, alj tbe samples positive at the 
diagnostic procedure had a strong RT-PCR signal for 
sst2 mRNA. Three of eight patients were ajso responsive 
to ocueootfe ueatment administered at the time of re- 
-'apse or metastatic spread. More extensive correlative 
chnjcopaihologic smdies on the sst status are needed to 
better define the lissue distribution of somatostatin bind- 
ing sites and tbeir potential clinical role in the Deatment 
of patiems. 

Sst2 evaluation. by means of JSH (14,32) or RT-PCR 
(26,37) is a highly sensitive and reliable procedure. Un- 
fortunately, these techniques have limitations because 
frozen tissue is needed for some of them, and radioactive — 
material oi costly and lime- consuming methods aie nec- 



essary for others. InramjohistocbcmicaJ analysis of sst2 
by means of specific antibodies represents an ideal, 
cheap, and rapid alternative, easily applicable to archival 
materia]. For (hese reasons, several investigators have 
raised polyclonal antibodies specific for sst (8,10,15,17, 
18). In the current study, tested tumor fragments adjacent 
to those snap frozen for RT-PCR analysis were tested 
with a polyclonal antibody against a C-termiDaJ portion 
of the sst2A splice variant (35,36). In addition, .a moDr> 
clonal antibody was produced in the Department of Pa- 
thology (University of Turin) against an N-tenrunal se- 
quence of the human sst2. This antibody was the first 
monoclo n al devel oped against sst2 and was.sbown to be 



highly specific for sst2 in Western blot and immunobis- 
tocbemical analysis. Both tbe monoclonal and tbe poly- 
clonal antibodies specifically reacted with all. samples 
also positive by RT-PCR (with minor discrepancies in 
two samples, likely due to rumor heterogeneity). Tbe 
observed correlation between RT-PCR and immunohis^ 
tocbemistry indicates that the latter may be a reliable 
diagnostic tool and may allow irjamunom^tocbernical in- 
vestigation for sst2 even in small biopsy samples. Tins in 
rum may enable a rapid screening of sst2-positive rumors 
for medical treatment with somatostatin analogues. 

Having corifirmed in a relatively large series that tbe 
vast majority of. neuroendocrine rumors of the lung con- 
tain variable amounts of sst2 mRNA, a final comment is 
deserved for sst2 expression- in nonneuroendocrine lung 
carcinomas. No data have been reported thus far in tbe 
literature concerning normal human lung, although in the 
present study some bronchial cells of peritumoral paren- 
chyma weie positive for sst2 when immunomstocberni- 
cal analysis was performed with either an ti body. Tbe 
staining was specific because it was abolished using pre- 
absorbed antibodies. Therefore, it is likely that normal 
. human lung tissue contains sst2/ This might be con- 
fumed by alternative techniques (e.g., Western blot, RT- 
PCR). However, in situ morphologic procedures, such as 
those employed here, have definite advantages. In fact, 
the lung is rich in vessels, and in several tissues (either iri- 
tumoral or in mflammatory-reactive conditions) the ves- 
sels were recently shown to contain sst (6). 

A low expression of sst2 was found in 25% of lung 
carcinomas of nonneuroendocrine type investigated in 
the present study by means of RT-PCR. Therefore, sst 
type 2, at least, does not appear to be extensively ex- 
pressed in nonneuroendocrine carcinomas of tbe lung. 
However, because two tumors in the control group (a 
squamous carcinoma and an adenocarcinoma, respec- 
tively) had positive octreotide scintigraphy, but no sst2 
mRNA, it is plausible that a heterogeneous disnibution 
of sst occurs in nonneuroendocrine lung rumors. Other 
receptor rypes may be expressed in these rumors and may 
be xesponsible for the positive icsults iii diagnostic test- 
ing. Because sso is also known to bind somatostatin 
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analogues, such as octreotide, with high affinity (24), the 
expression of this jeceptor type will be investigated in 
future studies. □ 
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Expression of somatostatin receptor types 1-5 iii 81 cases of 
gastrointestinal and pancreatic endocrine tumors. A correlative 
inimunohistochemical and reverse-transcriptase polymerase chain 
reaction analysis. 

Papotti M, Bongtovnnni M , Volante M , AlliaE, Laudolfi S , Hclboe L , Schindler M . 
ColeSL . Bussolati G . 

Department of Biomedical Sciences and Oncology, University of Turin, Via Santena 7, 
10126 Turin, Italy, mauro.papotti@unito.it 

Somatostatin receptors (SSTRs) have been extensively mapped in human rumors by 
means of autoradiography, reverse-transcriptase polymerase chain reaction (RT-PCR), in 
situ hybridization (ISH) and irnmunohistochemistry (IHC). We analyzed the SSTR type 
1 -5 expression by means of RT-PCR and/or IHC in a series, of 81 functioning and non- 
functioning gastroenteropancreatic (GEP) endocrine rumors and related normal tissues. 
Moreover, we compared the results with clinical, pathological and hormonal features. 
Forty-six cases (13 intestinal and 33 pancreatic) were studied for SSTR 1-5 expression 
using RT-PCR, IHC with antibodies to SSTR types 2, 3, 5 and ISH for SSTR2 rnRNA. 
The vast majority of rumors expressed SSTR types 1, 2, 3 and 5, while SSTR4 was 
detected in a small minority. Due to die good correlation between RT-PCR and IHC data 
on SSTR types 2, 3, and 5, thirty-five additional GEP endocrine tumors were studied widi 
IHC alone. Pancreatic insulinomas had an heterogeneous SSTR expression, while 100% 
of somatostatinomas expressed SSTR5 and 100% gastrinomas and glucagonomas 
expressed SSTR2. P re-operative biopsy material showed an overlapping 
immunoreactivity with that of surgical specimens, suggesting that the SSTR status can be 
detected in the diagnostic work -up. It is concluded that SSTRs 1-5 are heterogeneously 
expressed in GEP endocrine tumors and that IHC is a reliable tool to detect SSTR types 
2, 3 and 5 in surgical and biopsy specimens. 
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P-cadherin ovei expression is an indicator of clinical outcome in invasive 
breast carcinomas and is associated with CDH3 promoter 
hyponiethylation. 

Paredes J, Albergaria A, Oliveti a JT , Jcroninio C MiJanczi F . Sclimitt PC . 

Institute of Pathology and Molecular Immunology of Porto University (IPATIMUP), 
Braga, Portugal, jpaxedes@ipatimup.pt 

PURPOSE: P-cadherin overexpression has been reported in breast carcinomas, where it 
was associated with proliferative high-grade histological tumors. This study aimed to 
analyze P-cadherin expression in invasive breast cancer and to correlate it with rumor 
markers, pathologic features, and patient survival. Another purpose was to evaluate the P- 
cadherin promoter rnethylation pattern as the molecular mechanism underlying this gene 
regulation. EXPERIMENTAL DESIGN: Using.a series of invasive breast carcinomas, P- 
cadherin expression was evaluated and correlated with histologic grade, estrogen 
receptor, MIB-1, and p53 and c-erbB-2 expression. In order to assess whether P-cadherin 
expression was associated with changes in CDH3 promoter rnethylation, we studied the 
rnethylation status of a gene 5'-flanking region in these same carcinomas. This analysis 
was also done for normal tissue and for a breast cancer cell line treated with a 
demethylacing agent. RESULTS: P-cadherin expression showed a strong correlation with 
high histologic grade, increased proliferation, c-erbB-2 and p53 expression, lack of 
estrogen receptor, and poor patient survival. This overexpression can be regulated by 
gene promoter rnethylation because the 5-Aza-2'-deoxycytidine treatment of MCF-7/AZ 
cells increased P-cadherin mRNA and protein levels. Additionally, we found that 71% of 
P-cadherin-negative cases showed promoter rnethylation, whereas 65% of positive ones 
were unmethylated (P = 0,005). The normal P-cadherin-negative breast epithelial cells 
showed consistent CDH3 promoter rnethylation: CONCLUSIONS: P-cadherin expression 
was strongly associated with tumor aggressiveness, being a good indicator of clinical 
outcome. Moreover, the aberrant expression of P-cadherin in breast cancer might be 
regulated by gene promoter hypomethylation. 
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Mammary-derived growth inhibitor protciu and messenger ribonucleic 
acid concentrations in different physiological states of the gland. 

Politis h Gorcwtt RC , MuUer T. Grossc R . 

Department of Animal Science, Cornell University, [thaca 14853. 

Expression of mammary-derived growth inhibitor in tissue from lactating and involuting 
bovine mammary glands was investigated. Seventeen lactating, pregnant (220 to 272 d in 
gestation) cows were divided in two groups of 8 and 9 cows each. Cows of the first group 
were slaughtered while in lactation. Cows of the second group (9 involuting cows) were 
slaughtered at 13 to 52 d following sudden cessation of milking High concentrations of 
mammary-derived growth inhibitor (.63% of the total protein) were detected in mammary 
tissue of lactating cows. Mammary-derived growth inhibitor (less than .10% of the total 
protein) was dramatically reduced during most of the involution period (1 3 to 45 d 
following cessation of milking). Mammary-derived growth inhibitor was again detected 
(.28% of the total protein) during the last stage of the involution (46 to 53 d after 
cessation of milking), which coincided with colostrum formation. When steady state 
concentrations of mammary-derived growth inhibitor mRNA were examined, the results 
obtained mirrored those obtained at the protein concentration. These data suggest that 
regulation of mammary-derived growth inhibitor occurs via modulation of the steady 
state concentration of its mRNA. Furthermore, there is a strong correlation between 
mammary-derived growth inhibitor expression arid lactation in dairy cows. 
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T-cell receptor V beta-family usage in primary cutaneous and primary 
nodal T-cell non-Hodgkin's lymphomas. 

Preesman AH, Hu HZ, Tilanus MG , de Gcus B: Schuurman HJ ReitsmgR van 
Widien DF . vau Vloten WA. de Weger RA . " 

Department of Pathology, University Hospital Utrecht, The Netherlands. 

To evaluate whether the expression of T-cell receptor (TCR) V beta families in eight 
cases of malignant T-cell lymphomas took place in a preferential manner, we analyzed 
four cases of mycosis fungoides (MF), the most common form of primary cutaneous T- 
cell non-Hodgkin's lymphomas (NHL), and four cases of primary nodal T-cell NHL. The 
usage of V beta families in T-cell populations was investigated on mRNA that was 
transcribed to cDNA using a C beta primer and reverse transcriptase. Subsequently, the 
specific usage of the families was analyzed by polymerase chain reaction (PCR) using 
combinations of the selected C beta-oligonucleotide primer and one of the family-specific 
V beta primers. Peripheral blood lymphocytes from four healthy volunteers and 1 
"reactive" lymph node served as a control and expressed all 20 V beta families tested for. 
In T-cell lines, with restricted V beta expression, and in three patients with advanced MF, 
only one or two V beta families were expressed at the mRNA level. In an early MF lesion 
this monoclonal expression was absent several V beta families were expressed with a 
weak intensity This may indicate either a polyclonal origin of MF, or that too few 
monoclonal neoplastic cells were present in the tissue specimen. In the four nodal T-cell 
NHL, only one family could be clearly distinguished, whereas some of the other V beta 
families showed only a weak expression. These latter families represent the reactive T- 
cell component in the nodal T-cell NHL. Both in nodal T-cell NHL and in MF there was 
no preferential expression of a particular V beta family. There was a good correlation 
between PCR data and the expression of V beta-family protein products observed by 
lmmunohistochemistry on tissue sections of the T-cell lymphomas. All T-cell lines three 
cases of MF, and three cases of nodal T-cell NHL showed a rearrangement of the TCR 
beta chain on DNA level. 
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Matriliu-3 iu human articular cartilage: increased expression in 
osteoarthritis. 

P«»jg <X WeseJohG, Klatt AR , Wagener R , Swoboda B . 

Division of Orthopaedic Rheumatology, Department of Orthopaedics, University of 
Erlangen-Nuremberg, Rathsberger Str. 57, D-91054 Erlangen, Germany. 
oliver.pullig@med.uni-erIangen.de 

* OBJECTIVE: Matrilin-3 is a member of the recently described matrilin family of 
extracellular matrix proteins containing von Willebrand factor A- like domains. The 
matrilin-3 subunit can form homo-tetramers as well as hetero-oligomers together with 
subunits of matrilin- 1 (cartilage matrix protein). It has a restricted tissue distribution and 
is strongly expressed in growing skeletal tissues. Detailed information on expression and 
distribution of extracellular matrix proteins is important to understand cartilage function 
. in health and in disease like osteoarthritis (OA). METHODS: Normal and osteoarthritic 
cartilage were systematically analysed for matrilin-3 expression, using 
inununohistpchernistry, Western blot analysis, in situ hybridization, and quantitative 
PCR. RESULTS: Our results indicate that matrilin-3 is a mandatory component of 
mature articular cartilage with its expression being restricted to chondrocytes from the 
tangential zone and the upper middle cartilage zone. Osteoarthritic cartilage samples with 
only moderate morphological osteoarthritic degenerations have elevated levels of 
matrilin-3 mRNA. In parallel, we found an increased deposition of matrilin-3 protein in 
the cartilage matrix. Matrilin-3 staining was diffusely distributed in the cartilage matrix, 
with no cellular staining being detectable. In cartilage samples with minor osteoarthritic 
lesions, matrilin-3 deposition was restricted to the middle zone and to the upper deep 
zone. A strong correlation was found between enhanced matrilin-3 gene and protein 
expression and the extent of tissue damage. Sections with severe osteoarthritic 
degeneration showed the highest amount of matrilin-3 mRNA, strong signals in in situ 
hybridization, and prominent protein deposition in the middle and deep cartilage zone. 
CONCLUSION; We conclude that matrilin-3 is an integral component of human articular 
cartilage matrix and that the enhanced expression of matrilin-3 in OA may be a cellular 
response to the modified microenvironment in the disease. Copyright 2002 OsteoArthritis 
Research Society International. 
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Up-regulation of mitochondrial peripheral benzodiazepine receptor 
expression by tumor necrosis factor alpha in testicular leydig cells. 
Possible involvement in cell survival. 

Rey C , Mauduit C , Naureils O , Benahmed M, Louisot P , Gasnier F . 

hNSERM U. 189, Faculte de Medecine Lyon-Sud, BP12, 69921 cedex, Oullins, France. 

Porcine Leydig cells in primary cultures are resistant to tumor necrosis factor alpha 
(TNFalpha) cytotoxicity. Here we report that these cells can be rendered sensitive to 
TNFalpha killing by treatment with the translational inhibitor cycloheximide, suggesting 
the existence of proteins that can suppress the death stimulus induced by the cytokine. In 
search of these cytoprotective proteins, we focused on the constituents of the 
mitochondrial permeability transition pore (PT pore), whose opening has been shown to 
play a critical role in the TNFalpha-mediated death pathway. We found that TNFalpha 
up-regulated mRNA and protein expression of the mitochondrial peripheral 
benzodiazepine receptor (PBR), an outer membrane-derived constituent of the pore. A 
strong correlation was established between the resistance of the cells to TNFalpha killing 
and the density of PBR-binding sites. Concomitantly, TNFalpha down-regulated Bcl-2 
mRNA and protein expression. As Bcl-2 has been shown to be an endogenous inhibitor 
of the PT pore, we hypothesize that the TNFalpha-induced up-regulation of PBR 
expression may compensate for the decrease in Bcl-2 levels to prevent the opening of the 
PT pore. 
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GLUTI messenger RNA and protein induction relates to the malignant 
transformation of cervical cancer. 

Rndlovvski C , Becker A J , Schroder W , Rath VV , Buttner R , Moser M . 

Dept of Gynecology and Obstetrics, University Hospital Heidelberg, Vossstr 7-9, D- 
69 I 1 5 Heidelberg, Germany. 

We studied whether induction of glucose transporters (GLUTs) t to 4 correlates with 
human papillomavirus (HPV)-dependent malignant transformation of cervical epithelium. 
Tissue samples of cervical intraepithelial neoplasia (CIN; grades 1 to 3), invasive 
carcinomas, and lymph node metastasis were examined. HPV typing was performed. 
Tissue sections were irnmunostained with GLUTI to GLUT4 antibodies. Messenger 
RNA (mRNA) in situ hybridization confirmed GLUT1 protein expression. Weak 
expression of GLUT I was found in nondysplastic HPV-positive and HPV-negative 
epithelium; significant expression was observed in preneoplastic lesions, correlating with 
the degree of dysplasia. In CIN 3 high-risk HPV lesions, cervical cancer, and metastasis, 
GLUTI was expressed at highest levels with a strong correlation of GLUT1 mRNA and 
protein expression. Imrriunostains for GLUT2 to GLUT4 were negative. Cervical tumor 
cells respond to enhanced glucose utilization by up-regulation of GLUT1 . The strong 
induction of GLUT1 mRNA and protein in HPV-positive CIN 3 lesions suggests GLUTI 
overexpression as an early event in cervical neoplasia. GLUTI is potentially relevant as a 
diagnostic tool and glucose metabolism as a therapeutic target in cervical cancer. 
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Expression and distribution of lamimn alplial and alpha2 chains in 
embryonic and adult mouse tissues: an immunochemical approach, 

Sasaki T . Giltay R , TaksU, Tinipl R , Talts JF . 

Max-Planck- Institute for Biochemistry, Martinsried, D-82152, Germany- 
Protein levels, mRNA expression, and localization of laminm alphal and alpha2 chains in 
development and in adult mice were examined. Recombinant fragments were used to 
obtain high-titer-specific polyclonal antibodies for establishing quantitative 
radioimmuno-inhibition assays. This often demonstrated an abundance of alpha2 chain, 
but also distinct amounts of alphal chain for adult tissues. The highest amounts of alphal 
were found in placenta, kidney, testis, and liver and exceeded those of alpha2. All other 
tissue extracts showed a higher content of alpha2, which was particularly high in heart 

. and muscle when compared to alphal . Content of gammal chain, shared by most 
laminins, was also analyzed. This demonstrated gammal chain levels being equal to or 

. moderately exceeding the sum of alpha l and alpha2 chains, indicating that these isoforms 
representee major known laminin isoforms in most adult mouse tissues so far examined. 
Moreover, we found good correlation between radioimmuno-inhibition data and mRNA 
levels of adult tissues as measured by quantitative real-time reverse transcriptase-PCR. 
Embryonic tissues were also analyzed by radioimmuno-inhibition assays. This 
demonstrated for day 1 i embryos comparable amounts of alphal and gammal and a 
more than 25-fold lower content of alpha2. This content increased to about ,10% of 
alphal in day 13 embryos. The day 1 8 embryo showed in heart, kidney, and liver, but not 
yet in brain and lung, alphal/alpha2 chain ratios comparable to those in adult tissues. 
Immunostaining demonstrated alphal in Reichert's membrane (day 7.5), while alpha2 
could not be detected before day 11.5. These data were compared with 
immunohistochemical localization results on several more embryonic and adult tissue 
sections. Our results regarding localization are consistent with those of earlier work with 
some notable exceptions. This was in part due to epitope masking for monoclonal 
antibodies commonly used in previous studies in esophagus, intestine, stomach, liver, 
kidney, and spleen. 
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Discordant regulation of granzyme H and grauzymc B expression in 
human lymphocytes. 

Sedeiies KA, Sayers TJ, Edwards KM, Chen W, Pcllicci DG / Godfrey PI Trapani 
JA. 

Cancer Immunology Laboratory, Peter MacCalium Cancer Centre, Locked Bag 1 , 
A'Beckett Street, East Melbourne, 8006, Australia. . 

We analyzed the expression of granzyme H in human blood leukocytes, using a novel 
monoclonal antibody raised against recombinant granzyme H. 33-kDa granzyme H was 
easily detected in unfractionated peripheral blood mononuclear cells, due to its high 
constitutive expression in CD3(-)CD56(+) natural killer (NK) cells, whereas granzyme B 
was less abundant. The NK lymphoma cell lines, YT and Lopez, also expressed high 
granzyme H levels. Unstimulated CD4(+) and particularly CD8(+) T cells expressed far 
lower levels of granzyme H than NK cells, and various agents that classically induce T 
cell activation, proliferation, and enhanced granzyme B expression failed to induce 
granzyme H expression in T cells. Also, granzyme H was not detected in NK T cells, 
monocytes, or neutrophils. There was a good correlation between mRNA and protein 
expression in cells that synthesize both granzymes B and H, suggesting that gzmH gene 
transcription is regulated similarly to gzmB. Overall, our data indicate that although the 
gzmB and gzmH genes are tightly linked, expression of the proteins is quite discordant in 
T and NK cells. The rinding that granzyme H is frequently more abundant than granzyme 
B in NK cells is consistent with a role for granzyme H in complementing the pro- 
apoptotic function of granzyme B in human NK cells. 
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BCL2 protein expression parallels its mRNA level in normal and 
malignant B cells. 

Shen Y , Iqbal J, Huang JZ, Zhou G ; Chan WC . 

Department of Pathology and Microbiology, University of Nebraska Medical Center, 
Omaha, USA. 

The regulation of B-cell lymphoma 2 (BCL2) protein expression in germinal center (GC) 
B cells has been controversial. Previous reports have indicated posttranscriptional 
regulation plays a dominant role. However, a number of recent studies contradicted these 
reports. Using real-time polymerase chain reaction (PCR) and Standardized Reverse 
Transcripiase-PCR (SiaRT-PCR), we measured the level of mRNA expression in GC, 
mantle zone (MNZ), and marginal zone (MGZ) cells from laser capture microdissection. 
Both quantitative RT-PCR measurements of microdis'sected GC cells from tonsils 
showed that GC cells had low expression of BCL2 transcripts commensurate with the 
low protein expression level. These results are in agreement with micro array studies on 
fluorescence-activated cell sorter (FACS)-sorted cells and microdissected GC cells. We 
also examined BCL2 mRNA and protein expression on a series of 30 cases of diffuse 
large B-cell lymphoma (DLBCL) and found, in general, a good correlation. The results 
suggested that BCL2 protein expression is regulated at the transcriptional level in normal 
B cells and in the neoplastic cells in most B-cell lymphoproliferative disorders. 
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Quantitative determinations of the steady state transcript levels of 
hexokinase isozymes and glucose transporter isoforms in normal rat 
tissues and the malignant tumor cell line AH130. 

Shinohara Y, Yamamoro K . InooK, Yamazaki N . Tcrada H . 

Faculty of Pharmaceutical Sciences, University of Tokushima, Japan. 
yasuo@ph.tokushima-u.ac.jp 

The steady state transcript levels of the four hexokinase (HK) isozymes and four glucose 
transporter (GLUT) isoforms were determined quantitatively by Northern analysis of 
RN A samples from rat tissues using synthetic fragments o f (he RN As encoding the HK 
isozymes and GLUT isoforms. Results showed that the levels of HK isozyme transcripts 
were low in rat tissues, the level of that most highly expressed, the type I isozyme (HKl) 
in the brain being 0.025% of the total poly(A)+ RNA. A good correlation was found 
between the reported HK activities and the total amounts of transcripts encoding all HK 
isozymes in various tissues, showing that the HK activities in tissues can be estimated 
from the total amount of transcripts encoding HK isozymes. The proposed associated 
expressions of HK isozymes and GLUT isoforms in particular tissues were confirmed at 
their transcript levels. The steady state transcript levels of type II HK and the type I 
GLUT fsoform in the malignant tumor cell line AHI30 were also determined 
quantitatively. 
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UVA irradiation-induced activation of activator protein-1 is correlated 
with induced expression of AIM family members in the human 
keratinocyte cell line HaCaT. 

Silvers AL. Bowdcn GT . 

Department of Radiation Oncology, Arizona Cancer Center, The University of Arizona 
Tucson 85724, USA. . 

To determine whether the transcription factor activator protein- 1 ( AP- 1 ) could be 
modulated by ultraviolet A (UVA) exposure, we examined AP-1 DNA-binding activity 
and transactivation after exposure to UVA in the human immortalized keratinoevte cell 
line HaCaT. Maximal AP-I transactivation was observed with 250 kJ/m2 UVA between 
3 and 4 h after irradiation. DNA binding of AP-1 to the target 12-0- 
tetradecanoylphorbol-13-acetate response element sequence was maximally induced 1-3 
h after irradiation. Both de novo transcription and translation contributed to the UVA- 
mduced AP-I DNA binding. c-Fos was implicated as a primary component of the AP-1 
DNA-bindmg complex. Other components of the complex included Fra-2, c-Jun, JunB 
and JunD. UVA irradiation induced protein expression of c-Fos, c-Jun, Fra-1 and Fra-2. 
Phosphorylated forms of these induced proteins were determined at specific time points 
A strong correlation existed between UVA-induced AP-1 activity and accumulation of c- 
Fos, c-Jun and Fra-1 proteins. UVA irradiation also induced c-fos and c-jun mRNA 
expression and transcriptional activation of the c-fos gene promoter. These results 
demonstrate that UVA irradiation activates AP-1 and that c-fos induction may play a 
critical role in the response of these human keralinocytes to UVA irradiation. 
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Rat kidney glutamyl aminopeptidase (aminopeptidase A): molecular 
identity aud cellular localization. 

SongjL, Ye M Troyanovskaya M Wilk E , WUkS, Healy DP . 

Department of Pharmacology, Mount Sinai School of Medicine, City University of New 
York, New York 10029. 

Glutamyl aminopeptidase [aminopeptidase A (EAP), EC 3.4. 11 .7] is ail ectoenzyme that 
selectively hydrolyzes acidic amino acid residues from the amino terminus of 
oligopeptides- EAP activity is highest within the kidney and small intestine. The murine 
pre-B cell BP-1/6C3 and the human kidney glycoprotein gpl60 differentiation antigens 
have been reported to have biochemical properties indistinguishable from EAP. It is not 
known, however, if rat kidney EAP is a homologue of these antigens or molecularly 
distinct. Using the reverse transcription-polymerase chain reaction method with 
oligonucleotide primers based on the BP-1/6C3 nucleotide sequence, we isolated a 450- 
bp partial cDNA from rat kidney poly(A)+ RNA, The partial cDNA encoded a predicted 
protein diat was 92% and 86% identical to the murine BP-1/6C3 and human gp 160 
antigens, respectively; the amino acid sequence within the zinc-binding domain was 
completely conserved. Purification of EAP from rat kidney and rnicrosequence analysis 
of a cryptic digest peptide fragment ( 1 8-mer) indicated that the fragment was highly 
similar to a region within the BP-1/6C3 and gpl60 proteins. Northern blot hybridization 
and immunoblot analyses were also consistent with labeling of products the same size as 
reported for the BP-1/6C3 and gpl60 antigens. There was a good correlation between the 
cellular distribution of EAP rrtKNA and EAP immunoreactivity, with proximal tubules 
and glomerular mesangial cells having the highest densities. These results indicate that 
rat kidney EAP is a species homologue of the murine BP-1/6C3 and human gp!60 
antigens. Furthermore, on the basis of its cellular localization, rat kidney EAP is likely to 
be involved in degradation of oligopeptides within the glomerulus and the glomerular 
filtrate. Since cells that express EAP also express receptors for angiotensin It, an 
intrarenal vasoactive hormone that is a substrate for EAP, these results further suggest 
that EAP may play a role in modulating the activity of intrarenal angiotensin II. 
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Tumor necrosis factor-alpha uprcgulatcs (he prostaglandin E2 EP1 
receptor subtype and the cycIooxygcnasc-2 isoform in cultured amnion 
WISH cells. 

Spaziaai EP, Benoit RR , Tsibris JC Gould SF . O'Brien WF . 

University of South Florida Health Science Center, Department of Obstetrics 8c 
Gynecology, Taxnpa 33612, USA. espa2ian@c0mLmed.usf.edu 

Recent studies have demonstrated a strong correlation between infection and preterm 
labor. Preterm delivery is also associated with high levels of cytokines and prostaglandins 
in amniotic fluid. The purpose of this study was to investigate the effect of tumor necrosis 
factor-alpha (TNF-alpha) on the levels of cyclooxygenase, prostaglandin E2 production 
(PGE2), and expression of the PGE2 receptor subtype EP1 in amnion WISH cell culture. 
Amnion WISH cell cultures were incubated in increasing concentrations of TNF-alpha 
(0-50 ng/ml). Changes in cyclooxygenase and EPl receptor proteins were evaluated by 
Western blot analysis. Changes in EPl mRNA were evaluated by Northern blot, and 
culture fluid concentrations of PGE2 were estimated by enzyme immunoassay (ElA). 
-EPl protein (p<0.01), EPl mRNA (p<0.05), cyclooxygenase-2 (COX-2) protein 
(pO.OOl), and PGE2 concentrations (p<0.0l) all increased with increasing 
concentrations of TNF-alpha. Changes in COX-1 protein were not observed following 
TNF-alpha-incubation. The results suggest that TNF-alpha may play a role in infection- 
induced preterm labor by its pleiotropic ability to simultaneously stimulate COX-2 
activity, PGE2 concentrations, and PGE2 EPl receptor levels in human amnion. 
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Transcriptional activity of potent glucocorticoids: relevance of 
glucocorticoid receptor isoforms and drug metabolites. 

Sptka 1, Hammer S, Klcuscr B , Korting HC .' Schafer-Korting M . 

Institut for Pharmazie, Abteilung fur Pharmakologie und Toxikologie, Freie Universitat 
Berlin, Berlin, Germany. 

As compared to standard glucocorticoids (GC), prednicarbate (PC) is favorable in the 
treatment of eczema due to its high benefit/risk ratio. The remarkable anti-inflammatory 
effects of PC are in strong contrast to its reported low glucocorticoid receptor (GR) 
binding affinity. In transfected COS-7 cells we related the transcriptional potencies of 
PQ its metabolites and conventional GC to their receptor binding properties. Moreover, 
the expression pattern of the human GR isoform hGRalpha and its mutual dominant 
negative inhibitor hGRbeta in skin cells have been investigated as well as the influence of 
hGRbeta on receptor binding and transactivation. hGRalpha mRNA and protein was 
largely overexpressed in skin cells. hGRbeta showed no influence on hGRalpha binding 
and transactivation. Concentration response curves indicated the greater transact! vation 
potency of betamethasone 17-valerate followed by dexamethasone and prednisolone 17-. 
ethylcarbonate. Native PC appeared almost as potent as dexamethasone. With both a 
strong correlation was observed between transactivation and GR binding. Copyright 2003 
S. Karger AG, Basel 
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Specific inhibition of AQP1 water channels in isolated rat intrahepatic bile 
duct units by small interfering RNAs, 

Splinter PL , Masyuk AI , LaRusso NF . 

Center for Basic Research in Digestive Diseases, Division of Gastroenterology and 
Hematology, Mayo Medical School, Clinic, and Foundation, Rochester, Minnesota 55905, 
USA. 

Cholangiocytes express water channels (i.e. aquaporins (AQPs)), proteins that are 
increasingly recognized as important in water transport by biliary epithelia. However, 
direct functional studies demonstrating AQP-mediated water transport in cholangiocytes 
are limited, in part because of the lack of specific AQP inlubitors. To address this issue, 
we designed, synthesized, and utilized small interfering RNAs (siRNAs) selective for 
AQP1 and investigated their effectiveness in altering AQP I -mediated water transport in 
intrahepatic bile duct units (IBDUs) isolated from rat liver. Twenty-four hours after 
transfection of IBDUs with siRNAs targeting two different regions of the AQP 1 
transcript, both AQP1 mRNA and protein expression were inhibited by 76.6-92,0 and 
57.9-79.4%, respectively. siRNAs containing the same percent of base pairs as the 
AQP 1 -siRNAs but in random sequence (i.e. scrambled siRNAs) had no effect. 
Suppression of AQP 1 expression in cholangiocytes resulted in a decrease in water 
transport by IBDUs in response to both an inward osmotic gradient (200 mosm) or a 
secretory agonist (forskolin), the osmotic water permeability coefficient (P(f)) decreasing 
up to 58.8% and net water secretion (J(v)) decreasing up to 87%. A strong correlation 
between AQP1 protein expression and water transport in IBDUs transfected with AQP1- 
siRNAs was consistent with the decrease in water transport by IBDUs resulting from 
AQP1 gene silencing by AQP 1 -siRNAs. This study is the first to demonstrate the 
feasibility of utilizing siRNAs to specifically reduce the expression of AQPs in epithelial 
cells and provides direct evidence of the contribution of AQP 1 to water transport by 
biliary epithelia. 
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Type IV collagcnase (M(r) 72,000) expression in human prostate: benign and 
malignant tissue. 

Stearns ME, Wang M. 

Department of Pathology, Medical College of Pennsylvania, Philadelphia 19129. 

The expression of type IV collagenase (M(r) 72,000) has been examined in tissues from patients 
with benign prostatic hyperplasia (6 patients) and varying Gleason grades of malignant prostate 
cancer (18 patients), Immunoperoxidase labeling indicated that expression of the type IV 
collagenase was weak or nonexistent in benign tissue but consistently strong in the glandular and 
ductal epithelial cells of prostate tumors diagnosed at Gleason grades 1-8. In moderate to 
advanced cancer (i.e., Gleason grades 2 to 8), invasive tumor foci in the stromal tissue produced 
relatively modest amounts of type IV collagenase. The normal stromal tissue (i.e., fibroblasts) 
uniformly failed to produce detectable levels of type IV collagenase in the 24 patients examined. 
Northern and quantitative slot blot hybridization assays demonstrated that collagenase type IV 
rnRNA levels were low in benign tissue and high in malignant tumors. In contrast, the stromal 
cells did not express significant amounts of type IV collagenase mRNA. Enzyme- linked 
immunosorbent assays demonstrated that the amounts of type IV collagenase protein correlated 
directly with the mKNA levels in the tumor tissue. The studies suggest that type IV collagenase 
may be selectively overexpressed by malignant, preinvasive prostatic epithelial cells. 
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The decompensated detrusor III: impact of bladder outlet obstruction on 
sarcoplasmic endoplasmic reticulum protein and gene expression. 

Stein R , Gong C. Hutcheson JC , Canning DA, Zderic SA . 

Division of Urology, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, 
USA. 

PURPOSE: Regulation of calcium ion homeostasis has a significant role in smooth 
muscle contractility. The sarcoplasmic endoplasmic reticulum, calcium, magnesium, 
adenosine triphosphatase (SERCA) is a regulatory ion pump that may have a role in the 
functional outcome after outlet obstruction^ We investigate what correlation if any existed 
between SERCA protein and gene expression, and the contractile properties in the same 
bladder. MATERIALS AND METHODS: Standardized partial bladder outlet 
obstructions were created in adult New Zealand white rabbits, which were divided into, 
control, sham operated and obstructed groups. Muscle strip studies subcategorized the 
obstructed group into compensated (force greater than 50% of control) and 
decompensated (force less than 50% of control). Microsomal membrane and total RNA 
fractions were prepared from the same bladder tissue. Membrane proteins were used for 
Western blot analysis using a SERCA specific monoclonal antibody, and total RNA was . 
assessed with Northern blot analysis. RESULTS: The relative intensities of signals for 
the Western and Northern blots demonstrated a strong correlation between protein and 
gene expression. Furthermore there was a strong association between the loss of SERCA 
messenger RNA and protein expression and loss of bladder function. CONCLUSIONS: 
Bladder contractility after outlet obstruction is influenced in part by smooth muscle cell 
ability to maintain calcium homeostasis via SERCA. The loss of SERCA protein 
expression is mediated by down-regulation in gene expression in the same bladder. These 
data suggest that smooth muscle ion pump gene expression is in part mechanically 
(pressure work) regulated. 
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TNF-alpha and IL-8 at e upregulated in the epidermis of normal human 
skin after UVB exposure: correlation with neutrophil accumulation and E- 
selectiu expression. 

Strickland t , Rhodes LE , Flanagan BF, Fried mann PS . 

Department of Dermatology, University of Liverpool, United Kingdom. 

The in vivo response to ultraviolet B (UVB) radiation in skin is characterized by the 
accumulation of both mononuclear and polymorphonuclear cells within the dermis and an 
induction of vascular endothelial adhesion molecules. Epidermal production of cytokines 
(IL-8 and TNF-alpha) has been strongly implicated in the development of UVB-induced 
inflammation. In the current study, we examined the time course of IL-8 and TNF-alpha 
mRNA and protein expression in the epidermis over a 24-h period after in vivo UVB 
irradiation. Also, the induction of adhesion molecule expression and the accumulation of 
neutrophils within the dermis were followed; We found constitutive expression of both 
cytokines (mRNA and protein) in the epidermis of unirradiated skin. IL-8 was rapidly 
•upregulated after irradiation and mRNA and protein increased at 4 h, reaching a 
maximum between 8 and 24 h. TNF-alpha mRNA and protein was minimally increased 
by 8 h after UVB irradiation and reached a maximum by 24 h. No significant alteration in 
ICAM-1 or VCAIVt-l expression was observed. E-selectin expression, which was absent 
from control samples, was increased from 4 h onward and also reached a maximum at 24 
h, coinciding with peak neutrophil accumulation. A strong correlation (r = 0.96) was 
found between number of E-selectin-positive vessels and numbers of infiltrating 
neutrophils at this time. Moreover, because E-selectin expression was increased before 
any apparent increase in TNF-alpha protein (4 h), TNF-alpha does not appear to be 
involved in the early induction of the adhesion molecule, but cytokines such as TNF- 
alpha and IL-8 may act subsequently to augment the inflammatory response. 
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Basic fibroblast growth factor expression is increased in human renal 
fibrogenesis and may mediate autocrine fibroblast proliferation. 

Sjnitej^ Zeisberg M , 'Hemmerlein B, Sattler B. Hummel K , Becker V; Muller GA . 

Department of Nephrology and Rheumatology, Georg- August-University Gottingen, 
Germany, fstrutz@gwdg.de 

BACKGROUND: Interstitial fibroblasts play a critical role in renal fibrogenesis, and 
autocrine proliferation of these cells may account for continuous matrix synthesis Basic 
fibroblast growth factor (FGF-2) is mitogenic for most cells and exerts intracrine, 
autocrine, and paracrine effects on epithelial and mesenchymal cells. The aims of the 
present studies were to localize and quantitate the expression of FGF-2 in normal and 
pathologic human kidneys and to study the in vitro effects of FGF-2 on proliferation, 
differentiation, and matrix production of isolated cortical kidney fibroblasts. METHODS: 
FGF-2 protein expression was localized by immunofluoresence double labelings in 
normal and fibrotic human kidneys. Subsequently, interstitial FGF-2 labeling was 
determined semiquantitative^ in 8 normal kidneys and 39 kidneys with variable degrees 
of interstitial fibrosis and was correlated with the morphometrically determined 
interstitial cortical volume, hi addition, FGF-2 expression was quantitated by immunoblot 
analysis in three normal and six fibrotic kidneys. FGF-2 mRNA was localized by in situ 
hybridizations. Seven primary cortical fibroblast lines were established, and expression of 
FGF-2 and FGF receptor-1 (FGFR-I) were examined. The effects of FGF-2 on cell 
proliferation were determined by bromodeoxyuridine incorporation and cell counts those 
on differentiation into myofibroblasts by staining for alpha-smooth muscle actin, and 
those on matrix synthesis by enzyme-linked immunosorbent assay for collagen type I and 
fibronectin. Finally, proliferative activity in vivo was evaluated by expression of MIB- 1 
(Ki-67 antigen). RESULTS: In normal kidneys, FGF-2 expression was confined to 
glomerular, vascular, and a few tubular as well as interstitial fibroblast-like cells. The 
expression of FGF-2 protein was increased in human kidneys, with mbuJointerstitia! 
scarring correlating with the degree of interstitial fibrosis (r = 0.84, P < 0.01 ). 
Immunoblot analyses confirmed a significant increase in FGF-2 protein expression in 
kidneys with interstitial scarring. In situ hybridization studies demonstrated low-level 
detection of FGF-2 mRNA in normal kidneys. However, FGF-2 mRNA expression was 
robustly up-regulated in interstitial and tubular cells in end-stage kidneys indicating that 
these cells are the source of excess FGF-2 protein. Primary cortical fibroblasts express 
FoK2 and FGFR-] m vitro. FGF-2 induced a robust growth response m these cells that 
could be blocked specifically by a neutralizing FGF-2 antibody. Interestingly, the 
addition of the neutralizing antibody alone did reduce basal proli feration up to 31 5% In 
addition, FGF-2 induced expression of alpha-smooth muscle actin up to 1.6-fold but no 
significant effect was observed on the synthesis of collagen type 1 and fibronectin 
Finally, staining for MIB- 1 revealed a good correlation of interstitial FGF-2 positivity 



with interstitial and tubular proliferative activity (r= 0.71, P < 0.01 for interstitial 
proliferation, N = 30). CONCLUSIONS: Interstitial FGF-2 protein and mRNA 
expression correlate with interstitial scarring. FGF-2 is a strong mitogen for cortical 
kidney fibroblasts and may promote autocrine fibroblast growth. Expression of FGF-2 
correlates with interstitial and tubular proliferation in vivo. 
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Adiposity elevates plasma IYTCP-1 levels leading to the increased CD1 lb- 
positive monocytes in mice. 

TakahasliiK. MizuaraiS , Araki H , Mashiko S , Isliihara A . Kanatani A , Kadani H , 
Kotani H 

Banyu Tsukuba Research Institute in collaboration with Merck Research Laboratories, 
Tsukuba, Ibaraki 300-26 1 1, Japan. 

Obesity is currently considered as an epidemic in the western world, and it represents a 
major risk factor for life- threatening diseases such as heart attack, stroke, diabetes, and 
cancer. Taking advantage of DN A microarray technology, we tried to identify the 
molecules explaining the relationship between obesity and vascular disorders, comparing 
mRNA expression of about 12,000 genes in white adipose tissue between normal, high 
fat diet-induced obesity (DIO) and d-Trp34 neuropeptide Y-induced obesity in mice. 
Expression of monocyte chemoattractant protein-1 (MCP-1) mRNA displayed a 7.2-fold 
increase in obese mice as compared with normal mice, leading to substantially elevated 
MCP-1 protein levels in adipocytes. MCP-1 levels in plasma were also increased in DIO 
mice, and a strong correlation between plasma MCP-1 levels and body weight was 
identified. We also showed that elevated MCP-1 protein levels in plasma increased the 
CD 1 lb-positive monocyte/macrophage population in DIO mice. Furthermore, infusion of 
MCP-1 into lean mice increased the CD1 lb-positive monocyte population without 
inducing changes in body weight. Given the importance of MCP-1 in activation of 
monocytes and subsequent atherosclerotic development, these results suggest a novel role 
of adiposity in the development of vascular disorders. 
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Augmented expression of neuronal nitric oxide synthase in the atria 
parasyinpathctically decreases heart rate during acute myocardial 
infarction in rats, 

Takimoto Y , Aoyama T , Tanaka IC Keyaniura R , Yni Y , Sasayaina S . 

Department of Cardiovascular Medicine, Graduate School of Medicine, Kyoto 
University, Kyoto, Japan. 

BACKGROUND: Nitric oxide (NO) synthesized within sinoatrial ceils recently has been 
shown to participate in die autonomic control of heart rate. We hypothesized that NO in 
the neuronal cells in the heart was increased and parasympatheticaliy regulated heart rate 
after myocardial infarction (MI). METHODS AND RESULTS: We examined heart rate 
dynamics and neuronal NQ synthase (nNOS) expression and activities in the atria of rats . 
with MI 1, 3, 7, and 14 days after MI (n=7 to 22 for each group). Both the mRNA levels 
of nNOS in the atria determined by competitive reverse transcriptase-polymerase chain 
reaction and the protein levels determined by Western blotting were significantly 
increased compared with controls 1, 3, and 7 days after ML nNOS activity in the atria 1 
day after infarction was also increased in MI rats. nNOS immunoreactivity was observed 
in nerve fibers in the atria. After infusion of a specific inhibitor of nNOS and iNOS, l-(2- 
trifluoromethylphenyl) imidazole (TRIM) (50 nig/kg IV), heart rate was significantly 
(P<0.01) increased in MI rats compared with controls. I, 3, and 7 days after ML The 
iNOS-speciftc inhibitor, 1400W (10 mg/kg SC), did not significantly affect the heart rate 
in rats with ML The effect of TRIM was abolished by pretreatment with L-arginine (25 
mg/kg IV) or by parasympathetic blockade with atropine but not by propranolol. There 
was a strong correlation (r=0.837, P<0.0001) between the nNOS protein expression and 
heart rate change after TRIM infusion. CONCLUSIONS: These results indicate that 
increased nNOS parasympatheticaJIy-decreased heart rate via the production of NO in 
rats with acute ML 
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Differential upregulation of cellular adhesion molecules a t the sites of 
oxidative stress in experimental acute pancreatitis. 

Tclek G, Ducroc R , Scoazec JY, Pasquicr C , Feldmann G . Roze C . 

INSERM U 410, Universite Pans 7 Denis Diderot, 75870 Paris, France. 

BACKGROUND: Severe acute pancreatitis (AP)(2) is associated with exaggerated 
leukocyte adherence and activation. Endothelial cellular adhesion molecules (CAMs) can 
be induced by cytokines, but also directly by oxygen free radicals (OFRs), mediated by 
nuclear factor kappa-B (NF-kappa B). We investigated die behavior of inducible CAMs 
in relation to pancreatic oxidative stress. Our novel modification of cerium capture 
histochemistry (reaction of OFRs with cerium produces laser reflective Ce perhydroxide 
precipitates) combined with reflectance confocal laser scanning microscopy (CLSM) 
allows the histological codemonstration of in vivo OFR production and immunolabeled 
CAMs, or NF-kappa B. METHODS; Taurocholate AP was induced in rats; sham 
operated and normal animals served as controls. To achieve in situ, in vivo reaction of 
cerium with OFRs, animals were perfused with CeCl(3) solution at different time points 
(1, 2, 8, 24 h) and then sacrificed. E-selectin, P-selectin, ICAM-1, VCAM, and NF-kappa 
B p65 were labeled by immunofluorescence (IF) on frozen sections of cerium perfused 
pancreata. IF and Ce perhydroxide reflectance were simultaneously detected by CLSM. 
Pancreatic gene expression of the same CAMs was quantified by competitive RT-PCR 
(MIMIC internal control). RESULTS: Control pancreata showed negligible reflectance 
and minimal CAM expression. Early (I, 2 h) AP samples were characterized by intense, 
heterogeneous acinar OFR production, strong P-selectin, and increasing ICAM 
expression, with nuclear translocation of p65, histologically all colocalizing with the 
areas of acinar oxidative stress. Adherent polymorphonuclear leukocytes (PMNs) 
displayed weak OFR formation. Later (8, 24 h), a slowly declining P-seleetin, but 
persisting ICAM-1 expression, was paralleled by widespread adherence of PMNs 
producing surprisingly large amounts of OFRs. VCAM and E-seJectin showed a mild 
increase at 24 h. CAM gene activation was in good correlation with the protein 
expression. CONCLUSIONS: The early acinar oxidative stress is colocalized with NF- 
kappa B activation, preferential P-selectin, and ICAM upregulation in this AP model. 
Subsequently, adherent, activated PMNs become the major source of OFRs, thereby 
contributing to tissue damage. Copyright 2001 Academic Press. 
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Myotonic dystrophy: an unstable CTG repeat in a protein kinase gene. 
Timchenko L , Monckton DG , Caskcv CT . 

Department of Molecular arid Human Genetics, Baylor College of Medicine, Texas 
Medical Center, Houston 77030, USA. 

Myotonic dystrophy (DM) is caused by the amplification of CTG repeats in die 3 r 
untranslated region of a gene encoding a protein homologous to serine/threonine protein 
kinases. In DM patients the CTG repeats are extremely unstable, varying in length from 
patient to patient and generally increasing in length in successive generations. There is a 
strong correlation between the size of the repeats and the age of onset and severity of the 
disease. The molecular basis of the effect of the CTG expansion on the development of 
the DM phenotype continues to be investigated. The first working hypothesis of the 
molecular mechanism of DM was a reduction in steady-state myotorrin-protein kinase 
(Mt-PIC) mRNA and protein levels. However, although the consensus finding is that the 
Mt PK mRNA and protein levels are decreased in DM patients, it is still not clear if this 
reduction leads directly to the DM phenotype. In this short review we discuss the 
molecular aspects of CTG instability and the expression of the myotonin-protein kinase 
gene in normal and DM populations. 

Publication Types: 

• Review 

• Review, Tutorial 
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Induction of class 3 aldehyde dehydrogenase in the mouse hepatoma cell 
line Hcpa-1 by various chemicals. 

Torroncn R , Korkalamen M . Karcnlampi SO . 

Department of Physiology, University of Kuopio, Finland. 

The mouse hepatoma cell line Hepa-l was shown to express an aldehyde dehydrogenase 
(ALDH) isozyme which was inducible by TCDD and carcinogenic polycyclic aromatic 
hydrocarbons. The induced activity could be detected with benzaldehyde as substrate and 
NADP as cofactor (B/NADP ALDH). As compared with rat liver and hepatoma cell 
lines, the response was moderate (maximally 5-fold). There was an apparent correlation 
between this specific form of ALDH and aryl hydrocarbon hydroxylase (AHH) in the 
Hepa-1 wild-type cell line-in terms of inducibility by several chemicals. However, the 
magnitude of the response was clearly smaller for ALDH than for AHH. Southern blot 
analysis showed that a homologous gene (class 3 ALDH) was present in the rat and 
mouse genome. The gene was also expressed in Hepa-1 and there was a good correlation 
between the increase of class 3 ALDH-specific mRNA and B/NADP ALDH enzyme 
activity after exposure of the Hepa-1 cells to TCDD. ft is concluded that class 3 ALDH is 
; inducible by certain chemicals in the mouse hepatoma cell line, although the respective 
enzyme is not inducible in mouse liver in vivo. 

PMID: 1 505055 [PubMed - indexed for MEDLINE] 
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Relationship between cyclin Dl and p2I(Wafl/Cipl) during differentiation 
of human myeloid leukemia cell lines. 

Ullmannova V , Stockbaucr P , Hradcova M , Soucek J , Haskovec C . 

Department of Molecular Genetics, Institute of Hematology and Blood Transfusion, U 
Nemocnice I, 128 20 Prague 2, Czech Republic. u.Uman@uhkt.cz 

Expression of cell cycle-regulating genes was studied in human myeloid leukemia cell 
lines ML-1, ML-2 and ML-3 during induction of differentiation in vitro. 
Myelotnonocytic differentiation was induced by phorbol ester (12-o-Tetradecanoyl- 
phorbol- 13 -acetate, TPA), tumor necrosis factor alpha (TNFalpha) or interferon gamma 
(n^FgajTima), or their combination. Differentiation (with the exception of TNFalpha 
alone) was accompanied by inhibition of DNA synthesis and cell cycle arrest. Inhibition 
of proliferation was associated with a decrease in the expression of cdc25A and cdc25B, 
cdk6 and Ki-67 genes, and with increased p21(WafI/Cipl) gene expression, as measured 
by comparative RT-PCR. Expression of the following genes was not changed after 
induction of differentiation: cyclin AI, cyclin D3, cyclin El and p27(FCipl). Surprisingly, 
cyclin Dl expression was upregulated after induction by TPA, TNFalpha with 
IFNgamma or BA. Cyclin D2 was upregulated only after induction by BA. The results of 
the expression of the tested genes obtained by comparative RT-PCR were confirmed by 
quantitative real-time (RQ) RT-PCR and Western blotting. Quantitative RT-PCR showed 
as much as a 288-fold increase of cyclin Dl specific mRNA after a 24h induction by 
TPA. The upregulation of cyclin Dl in differentiating ceils seems to be compensated by 
the upregulation of p2 l(Wafl/Cipl ). These results, besides others, point to a strong 
correlation between the expression of cyclin Dl and p21(Wafl/Cipl) on the one hand 
and differentiation on the other hand in human myeloid leukemic cells and reflect a rather 
complicated network regulating proliferation and differentiation of leukemic cells. 
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Intestinal carbamoyl phosphate synthase I in human and rat Expression 
during development shows species differences and mosaic expression in 
duodenum of both species. 

Van Beers EH, Rin^s EH, Posthuma G . Dingemanse MA . Taminiau J A . Hermans 
HS, Einerhand AW . Duller HA . Dekker J . 

Pediatric Gastroenterology and Nutrition, Department Pediatrics, Emma Children's 
Hospital, Academic Medical Center, Amsterdam, The Netherlands. 

The clinical importance of carbamoyl phosphate synthase I (CPSI) relates to its capacity 
to metabolize ammonia, because CPSI deficiencies cause lethal serum ammonia levels. 
Although some metabojic parameters concerning liver and intestinal CPSI have been 
reported, the extent to which enterocytes contribute to ammonia conversion remains 
unclear without a detailed description of its developmental and spatial expression 
patterns. Therefore, we determined the patterns of enterocytic CPSI mRNA and protein 
expression in human and rat intestine during embryonic and postnatal development, using 
in situ hybridization and immunohistochemistry. CPSI protein appeared during human 
embryogenesis in liver at 31-35 e. d. (embryonic days) before intestine (59 e.d,), whereas 
in rat CPS! detection in intestine (at 16 e.d.) preceded liver (20 e.d.). During all stages of 
development there was a good correlation between the expression of CPSI protein and 
mRNA in the intestinal epithelium. Strikingly, duodenal enterocytes in both species 
exhibited mosaic CPSI protein expression despite uniform CPSI mRNA expression in the 
epithelium and the presence of functional mitochondria in all epithelial cells. Unlike rat, 
CPSI in human embryos was expressed in liver before intestine. Although CPSI was 
primarily regulated at the transcriptional level, CPSI protein appeared mosaic in the 
duodenum of both species, possibly due to post-transcriptional regulation. 
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Expression of deoxycytidine kinase in leukaemic cells compared with solid 
tumour cell lines, liver metastases and normal liver. 

van der Wilt CL , Kroep JR , Loves WJ , Rots MG , Van Groeningen CJ , Kaspers GJ , 
reters GJ . 

Department of Medical Oncology, VU University Medical Center, Amsterdam, The 
Netherlands. 

Deoxycytidine kinase (dCK) is required for the phosphorylation of several 
deoxyribonucleoside analogues that are widely employed as chemotherapeutic agents. 
Examples include cytosine arabinoside (Ara-C) and 2-chlorodeoxyadenosine (CdA) in 
the treatment of acute myeloid leukaemia (AML) and gemcitabine to treat solid tumours. 
In this study, expression of dCK mRNA was measured by a competitive template reverse 
transcriptase polymerase chain reaction (CT RT-PCR) in seven cell lines of different 
histological origin, 16 childhood and adult AML samples, 10 human liver samples and 1 1 
human liver metastases of colorectal cancer origin. The enzyme activity and protein 
expression levels of dCK in the cell lines were closely related to the mRNA expression 
. levels (r=0.75, P=O'.026 and r=0.86, P=0.007). In AML samples, dCK mRNA expression 
ranged from 1.16 to 35.25 (xlO(-3)xdCK7beta-actin). In the cell line panel, the range was 
2.97-56.9 (xl0(-3)xdCK/beta-actin) of dCK mRNA expression. The enzyme activity in 
liver metastases was correlated to dCK mRNA expression (r=0.497, P=0.05). In the liver 
samples, these were not correlated. dCK mRNA expression showed only a 36-fold range 
in liver while a 150-fold range was observed in the liver metastases. In addition, dCK 
activity and mean mRNA levels were 2.5-fold higher in the metastases than in the liver 
samples. Since dCK is associated with the sensitivity to deoxynucleoside analogues and 
because of the good correlation between the different dCK measurements in malignant 
cells and tumours, the CT-RT PGR assay will be useful in the selection.of patients that 
can be treated with deoxycytidine analogues. 
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Expression of cytokines and growth factors in human 
glomeruloneph ritides, 

Waldherr R , Noronha IL , Niernir Z , Kruger C Stcia H , Stunun G . 
Department of Pathology, University of Heidelberg, Germany. 

Numerous experimental studies point to the potential role of cytokines and growth factors 
in the pathogenesis of renal disease. However, from the various autocrine and paracrine 
mediators identified in vitro and in animal models, so far only a few have been 
demonstrated in selected human glomerulopathies. We examined two types of 
glomerulonephritis (GN): extracapillary GN with anti-neutrophil cytoplasmic 
autoantibodies (ANCA), an example of an acute form of GN, and mesangial IgA GN, 
usually a chronic form of GN, with immunocytochemistry, in situ hybridization and the 
polymerase chain reaction. Normal renal tissue from tumour nephrectomies served as a 
control. In ANCA-positive GN with active renal lesions (crescents, glomerular and 
vascular necrosis), infiltrating mononuclear cells in glomeruli and in the interstitium 
expressed interleukin (IL)-1 beta, tumour necrosis factor (TNF)-alpha, IL-2, interferon 
(FFN)-gamma, platelet-derived growth factor (PDGF) and transforming growth factor 
(TGF)-beta. Cytokine expression was also observed in activated resident cells, including . 
endothelial cells, capsular .epithelial cells, smooth muscle cells of vessel walls, fibroblasts 
and some tubular epithelial cells. In addition, we noted an increase in the cytokine and 
growth factor receptors TNF-R, IL-IR type II, IL-2R, IFN-gamma R and PDGF beta-R! 
In contrast, in mesangial IgA-GN, IL-1 beta, TNF-alpha, IFN-gamma and IL-2 were 
usually absent in glomeruli. Mesangial expansion in this disorder was accompanied by an 
increased expression of PDGF, PDGF beta-R, TGF-beta and IL-6 in mesangial areas. In 
both conditions a good correlation was observed between cytokine expression at the 
mRNA (in situ hybridization) and protein level (immunocytochemistry).( ABSTRACT 
TRUNCATED AT 250 WORDS) 

Publication Types: 

• Review 

* Review, Tutorial 
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Malignant transformation of the human endometrium is associated with 
overexpressiou of lactoferrin messenger RNA and protein. 

Walmer PIC Padui CJ , Wrona IMA . Healy BE, Bentley RC Tsao MS , Koliler MF . 
McLacltlan JA , Gray KB . 

Department of Obstetrics and Gynecology, Duke University Medical Center, Durham, 
North Carolina 27710. 

In the mouse uterus, lactoferrin is a major estrogen-inducible uterine secretory protein, 
and its expression correlates directly with the period of peak epithelial cell proliferation. 
In this study, we examine the expression of lactoferrin mRNA and protein in human 
endometrium, endometrial hyperplasias, and adenocarcinomas using 
irnmunohistochemistry, Western unmunoblotting, and Northern and in situ RNA 
hybridization techniques. Our results reveal that lactoferrin is expressed in normal 
cycling endometrium by a restricted number of glandular epithelial cells located deep in 
the zona basalts. Two thirds (8 of 12) of the endometrial adenocarcinomas examined 
overexpress lactoferrin. This tumor-associated increase in lactoferrin expression includes 
an elevation in the mRNA and protein of individual cells and an increase in the number 
of cells expressing the protein. In comparison, only I of the 10 endometrial hyperplasia 
specimens examined demonstrates an increase in lactoferrin. We also observe distinct 
cytoplasmic and nuclear immunostaining patterns under different fixation conditions in 
both normal and malignant epithelial cells, similar to those previously reported in the 
mouse reproductive tract. Serial sections of malignant specimens show a good correlation 
between the localization of lactoferrin mRNA and protein in individual epithelial cells by 
in situ RNA hybridization and immunohistochernistry. Although the degree of lactoferrin 
expression in the adenocarcinomas did not correlate with the tumor stage, grade, or depth 
of invasion in these 12 patients, there was a striking inverse correlation between the 
presence of progesterone receptors and lactoferrin in all 8 iactoferrin-positive 
adenocarcinomas. In summary, lactoferrin is expressed in a region of normal 
endometrium known as the zona basalis winch is not shed with menstruation and is 
frequently overexpressed by progesterone receptor-negative cells in endometrial 
adenocarcinomas. 

PMID: 7867003 [PubMcd - indexed for MEDLINE] 
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Malignant Transformation of the Human Endometrium Is Associated with 
Overexpression of Lactoferrin Messenger RNA and Protein 
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ABSTRACT 

In the mouse uterus, lactoferrin is a major estrogep-inducible uterine 
secretory protein, and its expression correlates directly with tbe period of 
peak epithelial tell proliferation. In this study, we examine tbe expression 
of lactoferrin mRNA and protein to human endometrium, endometrial 
hyperplasias, and adenocarcinomas using immunohisiocbtmistrj, West- 
ern imrounobMting, and NoHhero and in situ KNA hybridisation Jech- 
niqnes. Our results reveal that lacloferrin is expressed in normal cycling 
endometrium by a restricted number of glandular epithelial cells located 
deep in the zona basalis. Two thirds (8 of 12) of the endometrial adeno- 
carcinomas examined overexpress lactoferrin. This rumor- associated in- 
crease in lactoferrin expression includes an elevation in the mRNA and 
.protein of individual cells and an increase In' the number of cells express- 
ing the protein. In comparison, only 1 of the 10 endometrial hyperplasia 
specimens examined demonstrales an increase in lactoferrin. We also 
observe distinct cytoplasmic and nuclear immunostammg patterns under 
different fixation conditions in both normal and malignant epithelial cells, 
similar to those previously repojtrd in tbe mouse reproductive tracL 
Serial sections of malignant specimens show a good correlation between 
the localization of lactoferrin mKNA and protein in individual epiiheliaJ 
cells by in situ KNA hybridization and immujiohisiocbemistry. .Although 
the degree of Jactoterrin cxpiession in tbe adenocarcinomas did not 
correlate with tbe tumor stage, grade, or depth of invasion in these 12 
patients, lhere was s striking inverse correlation between the presence of 
progesterone receptors and lactoferrin in all 8 Jac l of rrrin- positive ode»o~ 
carcinomas. In summary, lactoferrin is expressed in a region of Dormal 
endometrium known as the zona basalis which Is not shed with menstru- 
ation and is frequently over expressed by progesterone retrptor-mgativ* 
cells 3n endometrial adenocarcinomas. 



INTRODUCTION 

Tbe uterus is a sex stejoid- responsive organ that plays a major role 
in women's htajlh. Hysterectomies wcie tbe most frequently per- 
formed majoi surgical pjocedwes in a 20-year study interval 
(1965-1984; RcJ. 1). Fifty-eight to 80% of ibese I2j5 million 
procedures were peifoimed foi estrogen-related disorders of prolifer- 
ation. Chronic unopposed estrogen exposure, most commonly asso- 
ciated with type II ovulatory disorders, eventually leads to the devel- 
opment of complex endometrial hyperplasia and adenocarcinoma. 
Since tbe sot steroids, esijo«cn and progesterone, act on rheii target 
tissues by regulating the expression of a wide variety of signaling 
rbolecules. identifying these regulatory factois will provide critical 
information towards under standing normal reproduction and repro- 
ductive tjact pathology. Out cunent Ia)ow)edge of estrogen and pro- 
gesterone action on the reproductive tract is based to a great extent on 
information collected from loclcnts (2). Although differences exist 
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between the., reproductive physiology of rodents and bnman^ the 
mouse has been a useful model for studying steroid hormone action in 
the human female reproductive tract (3, 4). One potential regulatory 
molecule shown to be regulated by estrogen in the mouse leproductJvc 
tract is lactoferriD. Lactoferrin is a basic glycoprotein with an extraor- 
dinarily high aifinjty foi iron that was origiMlly discovered in mflfc 
This protein is expressed in a wide variety of tissues, roost notably in 
polymorphonuclear leukocytes and roost mammalian exocrine glan- 
dular secretions. In the mammary gland (5) and tbe female reproduc- 
tive tract of the mouse (6-8), laclofenin is regulated by endocjine 
hormones; Prolactin stimulates lactoferrin synthesis in tbe breast; 
whereas in tbe uterus and vagina, the ovarian sex steroid, 170- 
estradiol, is tbe inducer. (6, 7, 9). To date, lactoferrin is one of the lew 
genes that have beefi identified in the rodent that are directly regulated 
by estradiol. The Jacioferrirj geDe contains an ERE 2 that is important 
for. regulating its expression m vivo in the mouse reproductive tract 
Being linked to Estradiol, the expression of lactoferrin by the uterine 
epithelium parallels the onset of DNA synthesis. Although sequencing 
information suggests that the human lactoferrin gene also contains a 
functional imperfect ERE in the 5'-flanlring promoter regiou (10, 11), 
theie is very little data regarding lactoferrin expression in tbe human 
female reproductive tract. 

The purpose of ouj study was to examine the expression of lacto- 
ferriD in the human endometrium under normal and pathological 
conditions by immuDohistochemistry, immunoblotting, and Northern 
and in siiu RNA hybridization techniques. In addition, we looked for 
conejations between bcioferrin expression and several parameters, 
such as tbe stage of the ineDStrual cycle, the distribution of estrogen 
and progesterone receptors, HER-2/ne« expression, markers of cell 
proliferation, and the histopatbological grade and extent of royome- 
irial invasion in ibe adeoocarcinonoas. Our data demonstrates that 
lactoferrin is expressed in a very restricted number of glands Lu the 
basal region of normal human endometrium and is markedly oveiex- 
pTessed in a significant Durnbej of the uterine adenocarcmornas by 
PR-negative cells. 



MATERIALS AN1) METHODS 

Tissue fVrpa ration ond iJistolo^ical Eva) ua lion. Sujgical pathology 
specimens weit obtained born DuJte University Medical Ccdicj (Duiham, NQ 
and ihc DcpailmcDl of Pathology at MODbea) General Hospila) (Quebec. 
Mojiirtot, Canada). Cycling endometrium w as obiained from 27 women (a^e,* 
31-49) : aj)d auophic tudo/nttriuni was obtained bom 7 postmenopausal 
women (ages 64-77). Hysiejeoomics from cycJmg women weic performed 
for subfttrosal ttioroyomas {n ~ 6). pelvic itlaxanon (n = S), pelvic pain 
(;i = 4), peritoneal tndomeuiosis (n = 2), 3Dd canctr Ot eitliti tbe eaoctrvu 
In = 1) or the ovaiy (n ~ J). In addition, 12 adcnocaicinojDBS, 3 ii)'p»cJ 
complex bypriplasias, 3 ccunplcj bypcrjilaiias without 3typia, and 4 ijmpk 
hypoplasia w f .i C z.nn)y2fd. P.zch human uicnis was bivalvtd il>o;iJy atitt 
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hysterectomy, and endometrium v/as removed from the fun da] region. A full 
tbic)a»css biopsy was placed into either 10% neutral-buffered formalin or 
BowVs solution overnight at roam temperature before dehydration, paraffin 
embedding, and sectioning at 4 fim on si)ani2ed slides. HistologicaJ cvalua- 
nons of hematoxylin and eosin-stained slides were performed blindly by one 
board-certified pathologist Normal endometrial samples were dated by the 
criteria of Hoyes el ol (12). Endometrial hyperplasias and carcinomas were 
classified according to the current recommendations, of the Internationa) 
Society of Gynecological Pathologists under the auspices of WHO (13). 
Histological grading of tumors was performed according to Federation Inter- 
nationale des Gynccologistes el Obstetrisies criteria (H). Each specimen was 
jcad a minimum of three times, and only specimens that were read consistently 
the same way were included in the study. Unstained sections of the same 
tissues wcie used for the cyiochemical analysis of protein and mRNA expres- 
sion using specific reagents. A few endometrial samples were frozen for 
subsequent protein and RNA extraction, which were evaluated by Western and 
1 Northern blotting, respectively. All human tissues were handled with the 
precautions and the guidelines requijed by Dulce University and National 
Institute of Environmental Health Sciences, 
/ ImmnnolocoKiation. Slides chosen for study were depa/affinized and 
( rinsed in 20% glacial acenc acid a) 4'Cfor 15 s to inhibit endogenous alkaline 

(phosphatase. All subsequent incubations and washes were at room tempera- 
ture. Sections were next equilibrated in PBS for 20 min and incubated for 20 

I. ruin with }J>% normal goal serum diluted in PBS to block nonspecific binding. 
Deiection of lactoftrrin was performed primarily with a rabbit anti-human 
lactoferrin polyclonal anlisera generated in our laboiaiory and affinity purified. 
. ! Similar results weje also seeD with a nun affinity -purified commercial antisera 
( . (Biogcntx, San Ramon,. Ca). Following incubation at iooid tempera lure with 
i" primary antisera foj 60 min, the sections were washed in PBS twice for 30 min 
l each, and lactoferrin was located using an alkaline phosphatasc-biotin- 
/ sneplavidin detection system (Vecmtain ABC-AP kit: Vtctoi Laboratories, 
j Burlingame, CA; oi the Super Sensitive Detection System: Biogeneo, San 
j -. Ramon, CA). To identify nonspecific staining, pre immune rabbit serum was 

(used in place of the primary antibody. The immunor taction was quantitatcd by 
determining the percentage of glands and ihe perceniage of cells sraining for 
lactofenin in the 20ns basal is and iht 2ona Junctionals, with b minimum of 
300 cells counted in each region. PR antibody was provided by Geoffrey 
GrecDe (KD66), and a commercial source was also used (Biogenex, San 
Ramon, CA). Identical siaining patterns were confirmed with both prepara- 
tions. Other commercially obtained an lis era include PCNA (Bjogcncx, San 
Ramon, CA), ER (ERJD5; AMAC, WesrbrooJL Ma), M1B-3 (AMAC), and 
HER-2/nrw (Biogcnex, San Ramon, CA). The primary antisera incubations 
were 2 b for the PR, j b for PCNA. M1B-J. and ER, and 30 min for 
HER-2/ncu. Antisera dilutions were 1:100 for MIB-j and 1:20 for HER-2/nen. 
Antigen retrieval (Biogenex, San Ramon, CA) was performed before adding 
the progestejone primary antisera. 

Western Blot Analysis. Proteins were exn acted from endometrial biopsies 
by homogenizatioh on ice in J% Tiiibn-X and 20 m.M Tris-HQ (pfl 7.4) with 
pioicast inhibitors (10 rig/ml leupeptin. 200 KU/ml aprotinin, and 20 pg/ml 
phenylroerbylsuJJonyl fluoride) and clarified by cenirifugarion ai 45,000 rpm 
for 30 min in a Beckman 70.1 Ti rotor: then the supernatant was analysed for 
protein concentration by tbe BCA protein assay (Pierce, Roc Word, IL). Ali- 
quots of 200 ^ig wcit separated by electrophoresis on an S3% SDS polyac- 
tylamide gel, blotted onto nitroccDulost nttmbiancs, incubated with poly- 
dona) rabbi) antihuman Ucioferrin Bnusera, and locawzed with an " J J-latxlcd 
ionkcy an ti- rabbit immunoglobulin, i* described previously (7). 

}n Sitv Hybridisation. All slides were preiicrjted with 0J2 N HQ fot 50 
min at room temperature, digested with 1 /ie/ml proteinase-K f Sigma Cbem- 
icai Co., St. Louis. MO) in 0.1 M Tris-HCl (pK 7.4V 0.05 m £DTA for J5 sun 
at 37"C, and then treated with 0.1 M iritthanOlaniiue-OJ^f % aceiic anhydride 
for s min ai.rooro temperature and 0.] m Tris- glycine (pH 7.4) Jor 30 min 3t 
loom irmpeiaime. The scciions were iuhscquently dehydrated with "jaded 
nhznol, air dried, and prthybridi2cd at 50*C for J Ij in 2 > SSC. 10 him DTI", 
- X Dtnhardt'i joJuiion, 100 pp/ml ol botb salmon ipcriD DNA and ye ail 
'fvNA, and 50% fprmarnide 05). The slides were tben hybrioi2«d ovcrntglii ai 
MTC in ibe same medium vtiib JO'S? doiian sulfate and 2*10* cpm-'/*- ol tbe 
ipffific RNA protx. The Iserolcrrio oligonucleotide probe was amplified by 
1*CR njing priinris thai ipmnrd nucleotides 718-J654 ()0) and cloned into 
pOfcM-42. "S-lr*hf ltd ;.enic and ami:* use RNA probe* >*t i r mude wjih ij>e 
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Promega Riboprobe kit (Prom eg a, Madison, Wi), washed twice in 1 X SSC foi 
10 min ai room temperature, digested with RNase [2.8 >g/m? RHasc-A, 03 
ixg/ml RNase-TJ, ]0 mM Tris-HQ (pH 7.4), and 15 dim Nad], and washed 
again with 1 X SSC twice, for 20 mm each time at 50*C, twice for 20 in 
0J X SSC at 55 W C, and once for 20 min. at 6XTC Tbe sections wtrt then 
dehydrated and dipped- in KodaJt autoradiographic emulsion (MTB-2) for 
detection of specific mRNA expression. The slides were allowed to develop for 
2 weeks. After this period, the slides were developed nsing Kodak DJ9 
developer and Kodak Rapid Fixej- 

Nonbero RNA Analysis. Total ctlluJai RNA was purified from frozen 
tissue by the guanidinc isomiocyanale-ccsiuiD chloride method, and poly(A*)* 
RNA was isolated by oligo<dT)-ctllulost chromatography using methods 
described previously fl5> For Northern blot analysis, po!y(A*>RNA "was 
lesorved by electrophoresis on 1J% formaldehyde agarose gels, stained witb 
ethidinm bromide, and transferred to a nylon membianc The membrane was 
probed with a ^P-labelcd lactoferrin cDNA derived from human uterus (nu- 
cleotides 718-1654; accession no. S52659) using PCR techniques, followed 
by cloning into pGEM-42 (Promega, Madison, WI). In order to insure that the 
quality arid quantity of RNA analyzed by Northern blotting was equivalent 
between control and treated groups, the Wot was probed simultaneously for 
glycejaldehyde-3- phosphate dehydrogenase. 

. Statistical Analysis. Values are presented as means ± SC. Differences 
between the zona basalis and functionalis were tested by the two-tailed 
Student's r tesL * 

J«ESULTS 

Immunohistocbemical Analysis of Lnctoferrin Protein 

Normal Cycling Endometrium. JmrauDohistocbenrical studies of 
normal cycling human endometrium localize lactoferrin protein pre- 
dominantly lo ibe glandular epithelium deep in the zona basalis and 
not to the functionalis (Fig. 1A> The association of lactoferrin protein 
expression with the zona basalis is statistically significant. (P < 0.001; 
Fig. 2). Two to 56% of the glands express lactoferrin at any given time 
during the mcnsiruaj cycle. U ; ithin positive glands, lactofenin protein 
immunolocajizaiioD is heterogeneous in that positively staining epi- 
thelial cells are interspersed with cells negative for lactoferrin expres- 
sion. No appaicDl differences in morphology, PCNA, ER, or PR 
expression ate seen lo account for the heteiogeneous pattern of intra- 
and inietgland lactoferrin expression in normal endometrium. Similar 
to our previous findings in mouse uterine epithelial cells, the positive- 
staining glandular cells of the human endometrium deir>onsuaie two 
distinct iromunostaining patterns foi Jactofenin, cytoplasmic and nu- 
clear {Fig. }B% seen 'with both formalin and Bouin's fixation. In 
evaluating the tempotal expression of lactoferrin, there is a uend 
towards more glands expressing lactoferrin during the scoetojy 
phase. Because ol the large variance, tbe trend is not statistically 
significant. As expected, tbe polymorphonuclear leukocytes in the 
endometrium also stain intensely fox Jaclofenin protein, which is 
stored in their secondary granules (Fig. 3). These results demonstrate, 
foi the first lime, ibai lactoferrin protein is expressed in tbe human 
eitdomtuiuro predominately by polymorphonucleai leukocytes and 
epithelial cells of glands located deep in tbe 2ona basalis. 

Proliferative Endometrial Disorders: Hyperplasias and Adeno- 
carcinomas- lrnniunohistochcmi5iry reveals that the expicssion of 
jarrr-frrris /vr-Jrin i' incrftasrd in 66.6% (S n/ 32) f>f Jhr. tDdnwetriaJ 
ad enocnici nomas examined. In one- half of these cases (4 ol 12), 
bctofexriD is intensely expressed by malignant epithelial cells 
throughout the entire tumois (Fig. 4). The oihcr four adenocajcinomas 
demonstrate inCKasrd staining for lactofenin in concentrated regions 
ol the luinois. In all tigbl cases where lactofenin expression is 
elevated, the cells expressing feciofcirin have one similarity ^"itb 
norma) positive gjands in that ihcy demonstrate heterogeneous stain- 
ing ol inteispersed positive and negative cells. However, the expres- 
sion ol iactoieiiin bv tbe malignant cells cl:a:!y dilirrs fiom normal 
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Fit 3- Localization oi Jaoofcrrin protein id normal cycling cndorotlnum by immo- 
' nohisiocbemisrjy using a specific pciyclorwl antibody. Our analysis rtvepls th»l laciof- 
cmn protein b prtstnlin » limited numbcJ of glands locsrtd in the zona baselii c-I the 
endometrium MJe X JOl Also, note that hcicfcrnD is beierofrencouj vittin j>ositjve 
jlandi, cells staining foi iactoftnin mc inloxpciscd wiifa negative- staining cpiibcliBJ 
cells throughout the s^nd. TVo in>monohij:ocbcmical staining p»tltros arc noted tor 
I&ciofcrrin in normal uterine epithelium. In ont pattern, Uciofcrrin protein is imrounoJo- 
caiized primarily ovct the cytopUsra (JffJ), and in tbe oibtr, ibe sttining is seen over the 
nucleus (B2); X 40. 



positive glands in that the lactoferrin is not limited lo the basal regions 
of the tumors, many more ceDs are positive, and tbe relative intensity 
of the staiDing ovei individuals cells is increased. Although increased 
lactoferrin expression is associated with malignant transformation, we 
do not find a correlation between laclofenin protein presence and tbe 
stage, nuclear, grade, Federation Internationale des Gynecologist.es et 
Obstctristes grade, oi the depth of myomerrial invasion in ihe 12 
tumors studied (Table J). In sharp contrast to the common dys regu- 
lation of lacloferrin expression found in the malignant endometrium, 
only 1 of 10 endometrial hyperplasia specimens evaluated contained 
an increased number of cells staining Jot lactoferrin. The hyperplastic 
specimen overexposing lactoferrin was read as complex without 
atypia. 

In Siru and Northern Analyses of Lactoferrin mRNA 
Expression in Normal and Malignant Endornr trium 

To further our ur}de; standing of !hc location of lacioierrin protein 
synthesis in the human endometrium, we examined laclofenin mRNA 
expression by in situ and Northern hybridi2arion using specific 33 S- 
labeled probes lot human lactoferrin. No detectable RNA hybridiza- 
tion is observed in the normal endometrium by in shu hybridization, 
even in the presence of iroiminode I enable piotein, usins equivalent 
hybridization conditions and development limes as used for tbe ade- 
nocarcinomas. Consisieni with ihc Jesuits obtained by in siiu bybrid- 
i?.;iiion. long csposuic lirnei- were, reqirijed to dtmonfirair laciofenin 
mRNA in normal endometrium by Northern blot u.ring poly(A*)- 
mRNA. This indicates that laclofenin mRNA is prcsrni in normal 
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tissue but in very low levels, consistent with ibe very limited pattern 
of piotein expression in normal endometrium. BquivaJenl RNA load- 
ing and quality for each specimen was demonstrated by etbidiwrj 
bromide staining of the RNA gels (data not shown) and by probing for 
the housekeeping gene glyceraldebydeo-phosphate dehydrogenase 
that does not fluctuate significantly with tbe metabolic state of the 
tissue. A representative Northern blot 5s shown in Fig. 5. In situ 
hybridization with several adenocarcinomas reveals that there is a 
direct correlation between the Ionization of lactoferrin mRNA and 
the immunos tain ing of expressed Jaaoferrin protein (Fig, 4). Lactof- 
errin mRNA is not associated with polymorpbonucleaj leukocytes by 
in sim hybridization in either normal or malignant tissue. - 

Western Blot Analysis 

To confirm the specificity of tbe antisera that wc used for imrm> 
nomsiocneraisrry, we perfonneJ Western 6/or aoa/ysfc on proteins 
extracted from both normal and malignant endometrium which were 
separated by 8.5% SDS-polyacrylamide gel electrophoresis (Fig, 6), 
Iromunobloning identified a single broad protein band with a molec- 
ular weight between 70,000- 80,000 in both normal and neoplastic 
endometrial tissue bomogenates, consistent wilh tbe reported molec- 
ular weight of human lactoferrin. Supporting tbe imjnunocytod>ern> 
cal analysis, a representative iramunoblot clearly demonstrates that 
tbe proportion of protein that is lactoferrin is markedly increased in 
tbe aderwc^cinomas'in comparison to the normal endometrium. Tbe 
molecular weight of lacioferrin in the adenocarcinomas appears to 
have a slightly higher molecular weight that the predominant form in 
normal tissue. 
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Ft£. 3. Po)vTDDiphonucIcir leukocytes (orr^o) art scattered ihiougbooi ihc cndomc- 
rnuro wxJ yain iniejjsery tor fcrtolc rrin. JLaciottxrin is » knoM-n component of ibc 
irconduy gianuJu in pol/rDorphonucloi leukocytes. Tbt presence at » segmented 
milieus and lacioftmo p»oit»i » tn excellent method for identifying ibis group of 
iufismmBtorj cells. ^ X 20. B, X 60. 



Correlation of Lactoferrin Expression with the Expression of 
PCNA, Ki-67, HEJt-2/»ru, ER, and PR 

In an attempt 10 characterize the phenotype of endometrial cells 
which express lactoferrin, wt performed imrounohistocbemistry on 
serial sea ions for (he Ki-67 antigen, PCNA, HER-2/nru, Jacioferrin, 
ER, and PR. In normal tissue, Ki-67 and PCNA expression are cell 
cyde- specific markers of cell proliferation (16, 17). Upon analysis of 
normal cycling endometrium, do relationship between lactoferrin pro- 
tein expression and ER, PR, or Ki-67 expression was observed. 
Similarly, in most of the adenocarcinomas evaluated, no relationship 
^as Doled between lactoferrin and VCl^A pjoiein expression. How- 
tver, in one adenocarcinoma (Fig. 7), there was a clear inverse 
i e)a i ions hip seen between laciofenin and PCNA JocaJization, which 
w as present throughout the entire tomor. Most dramatic, however, was 
a striking inverse correlation seen between Jacioferrin and PR expres- 
sion in 8 of 8 PR- positive uterine adenocarcinomas (rig. 7). Two 
tumors negative for PR aJso did not express lacioferrin. Although an 
inverse correlation was also suggested wjth HER-Z'nru and PR, (he 
inverse correlation was more precise with lacioferrin in these turnois. 

DISCUSSION 

In the mouse uterus, lactoferrin is an est) op en* induced uterine 
secretory protein that is presort throughout the epithelium (7), and it 



is expressed concomitantly with epithelial cell proliferation. In con- 
trast to lactofeirin's ubiquitous expression in the estrogenized mouse 
uterine epitbelhtjn, Jacioferrin protein is limited to glandular epithelial 
cells in the basal regions of normal bum an endometrium and usually 
to glands that were directly adjacent to the myometrium (i^ r the 
deepest glands of the zona basalis). This regional localization of 
lactoferrin . expression is not surprising in that other biochemical 
parameters have been reported to show site srpecifirity in primate 
endometrium. These parameters include ibe proliferative index and 
the expression of the secretory component of IgA (18). Similar to our 
observations in the endometrium, lacioferrin is also expressed region- 
ally in the mammary gland, b bovine breast tissue, lactoferrin is 
localized primarily to the basal alveolar cells (39); whereas in human 
breast tissue, the ductal epithelium appears to be the primary source of 
secreted lacioferrin during Jactation (20). 

Examining the endometrium on different days of the menstrual 
cycle demonstrates a trend towards increased lacioferrin expression 
during the luteal phase. Although this data is not statistically signif- 
icant, the cyclic variation may be biologically relevant Kim ct oL (21) 
recently reported that the basal endometrial epithelial cells are unique 
because they proliferate during the postovojatory luteal phase. Inter- 
estingly, in the mouse uterus, theie is a diiect correlation between 
lacioferrin expression and epithelial cell proliferation (6). There/ore, 
lactoferrin may have a similar role in the human and mouse endome- 
trium, if lactoferrin expression is- cyclic, the ERE in the 5 '-flanking 
promotej region of the human lactoferrin gene may be activated 
during the luteal phase (30, 31), 

Another similarity between human and mouse uterine lactoferrin 
expression is the observation of two imrounohislccbemicaJ staining 
patterns. In one pattern, the antiseia binds primarily over the cyto- 
plasm, and in the other, the nucleus is the primary site of localization. 
Although this could represent a fixation artifact, we have now ob- 
served this pattern in two species and under different fixation condi- 
tions. It has been demonstrated that signaling peptides, i.e.. platelet- 
derived growth factor (22), lnl-2 (23), and probasin (24), can be 
selectively directed into the nucleus, cytoplasm, or secreted. It is 
believed that this differential processing may allow proteins lo have 
inlracrine, autocrine, and paracrine roles, depending on the physio- 
logical stale of the cell. The two locaTrzarion patterns observed suggest 
thai laciofetrin might have signaling sequences that direct the final 
destination of the mature peptide. 

In normal endometrium, lactoferrin mRNA is present but in very 
low levels. Prolonged exposure times are needed lo visualize the 
mRNA band with Northern analysis. Although with in siiu hybrid-. 
i23tion the lactoferrin mRNA signal is easily seen in adenocarcino- 
mas, we failed to localize the lactoferrin mRNA in normal endome- 
trium using an equivalent exposure time. These low levels of mRNA 
in normal endometrium suggest thai the synthesis and degradation of 
lacioferrin rnRNA is more tightly regulated in normal tissue than in 
the adenocarcinomas. Because of the low levels of message in nor mal 
tissue, we are unable at this time to definitively condvdt that lacto- 
ferrin mRNA is synthesized by die same epithelial cells which express 
the pjotcin. 

In endometrial auenoc.arcinomas, malignant transformation of the 
endometrium is associated with the up-jegulaiion of lactolerrin 
mRNA and piotein biosynthesis. The up-regulation at the RNA level 
is demonstrated by sn increase in sleady-siate RNA levels using both 
in siiu hybridization and Northern analysis techniques. In ihcif. can- 
cers, we also observe an increase in the number of lactoferrin- positive 
cells, which express both the protein and mRNA. In this study, 8 of 
the 12 adenocarcinomas evaluated ovcrcxpress lacioferrin, compared 
with only ] of 10 hyperplastic specimens. The form of lactoferrin 
protein extracted from endometrial adenocarcinomas appears to have 
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Kg. 4. CoJocaliuiion of Hcloftnin pro* tin (Ufi panth) *nd 
niRNA (right poneh) in a normal proliferative endometrium ind 
endometrial idtuocardooinas by performing unmunoMstocbcjDb- 
try and m iitu RNA hybridization on serial sections. Dual analysis 
Df protein sod mRNA expression reveals thai glands in nonn&J 
' endometrium do not have detectable roWlfA, *s measured by m jtw 
hybridization (w/r pojitU\ X 40% whereas analysis of tbe adeno- 
carcinomas clteily demonstrates a direct condition bctwteD pro- 
It in and RNA expression loi laciofenin (middle and botiom pontfr, 
X-JO). Note ihei laaofmin pjoieio and roRNA Is distributed in > 
hettrogcBeoro palters in ihc epithelial cells of (be adenocaicinomas. 
As is shown in Bg, 1, » beieiogeneojis staining pattern for Iaclol- 
crrin protein is iko seen fjequcDlly fa normal endometrium. 
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a slightly higher molecular weigh! than ihc protein present in normal 
tissue. This could be due to alterations in the processing of the 
lacioferrin ntRNA, protein, or glycosylation by. Ibe malignant cells. 
Alternatively, tbeje could be minor differences between lactoferrin 
protein which is present in neutrophils and the form synthesized by 
uterine epithelial cells. We suggest two hypotheses to explain lactof- 
errin ove j expression in endometrial adenocarcinomas. Ih the first 
hypothesis,, laciofenin biosynthesis is deregulated by the same pro- 
cesses thai lead to the malignant transformation of endometrium. If 
this hypothesis, is true then laciofejrin may be a useful mark er for 
endometrial adenocarcinoma investigation, and fuitber re search is 
needed to determine whether laciofenin plays a contributing role in 
tbe malignant transformation. A second hypothesis is that lactoferrin- 
positive human endometrial adenocarcinomas evolve from tbe clonal 
expansion of cells lesiding in ibe legener alive zone (zona basalis) of 
normal endometrium. It is interesting to speculate that lactoferrin 
expression in endometrial cancer may be linked to estrogen action in 



some way, since proliferative disorders of human endometrium are 
Jinked to cbj on »c estrogen exposure over several years and sequencing 
data suggests that the promoter for the human lactoferrin gene does 
contain an ERE. 

Although the function of laciofenin is unknown, a variety of 
biological roles have beeD proposed for lactoferrin which could link 
this protein to a role jd cancer, including the regulation of DNA 
synthesis (25-29), modulation of the immune response (25, 30, 3)), 
and iion transport (32). Some forms of laciofenin are jeportcd to have 
RNase activity (33, 34). Sea e ted RNases are involved in develop- 
ment, reproductive function, neoplasia, angiogenesis, and unmiiBC 
suppression. (35, 36) If angiogeDesis and immunosuppression are 
components of laciofenin RNase activity, these properties could pro- 
mole tumor growth. 

In the endometrial adenocarcinomas, we observed a heterogeneous 
expression pa Hern for laciofenin, PCNA, Hei-2//ieu, ER, and PR- 
With regard to prognosis, patient survival is reponedly worse when 
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antiscia KJD6S. and laclo/crnn wis dritcied by a specific polyclonal antisua. 
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LACTOFTRRIN DYSREGULATIOH IN ENDOMETMAt CANOE* 
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Tigr 5. Northern analysts of laciofrfnn tnRTM rjrpressioB con/i/ms that Mtdomrtnal 
adenocaiciDoniDS (jUmj J and s>£nj'6cmriy over expi ess the 2.5-kilobase uanscnpi of 
human Jacioferrin is comparison to nnrroa] endometrium (limn J and 2). The Northern 
hvbikJi2*tion dan supports iht is iim RNA results and confirm s thai )»ctoicnin RNA 
expression is dysrejylatrd in uterine ademicaicinornai. Norma) uterine tissues appe~w to 
contain, low Modj-sUU RNA levels oi iBCtojtrrin, reflecting a controlled patirni of 
protein cypreisioo. Equivalent RNA loading and quality loj eacb specimen was demon- 
strated by ethidlum bromide staining oi the RNA gels (dai* not shown) and by probing /or 
t housekeeping gene (gjycrraldchyde- 3- phosphate ccbycrogenasc) that dots not fluctuate 
significantly with tbc metabolic state of the tissue. 



endometrial adenocarcinomas Jose sex steroid receptors (37, 38)» have 
a higher proliferative index (39), and demonstrate DNA aneupjoidy. 
During the tumor progression of endometrial adenocarcinomas, it 
appears that the loss of steroid hormone receptors occurs earlier than 
either the increase in proliferation rale or the development of DNA 
aneuploidy (40). In our study, we note a striking inverse correlation 
between the expression of lactoferrin and 'PR in the endometrial 
adenocarcinomas. An inverse relationship also bas been described for 
}lER~2Jneu and PR in endometrial adenocarcinomas that correlates 
with patient prognosis. Furthermore, in cancers of the human endo- 
metrium, ovary and breast Bcr-2//iew expression bas been associated 
with advanced disease and poor survival (41-43). HER-2/new is an 
oncogene that shares sequence homology with the epidermal growth 
factor receptor and is speculated to contribute to aberrant growth. Of 
note is that lactoferrin biosynthesis in the mouse uterus is associated 
with the expression of the epidermal growth factor. Like HER-2/neu, 
the epidermal growth factor receptor is also frequently over ex pressed 
in PR-negative cells of endometrial adenocarcinomas (44). The am- 
plification of growth factor receptor expression in PR-negative endo- 
metrial adenocarcinomas may be associated with the acquisition of 
growth autonomy and hormone independence, which may contribute 
to the poorer prognosis of PR-negative endometrial carcinomas (45). 
Some endometrial adenocarcinomas, including recurrent rumojs, can 
be treated successfully with progesterone therapy (46, 47). Although 
the significance of the inverse relationship between lactoferrin and PR 
expression is not fatown, we speculate that the PR-negative cells do 
not undergo ihe normal gjowth inhibition and secretory differentiation 
normally associated with progesterone action. 

A survey of human tissues reveals that lactoferrin is expressed by 
most normal mammalian exocrine glands and rosy be a prognostic 
marker. in tumors (20). Lactoferrin is found in normal ductule breast 
epithelium and in primary breast carcinomas- In breast luroojs, there 
is an inverse correlation between laclofenin and ERs (20). Notably, 
lactoferrin expression in breast cancer may fall into the same category 



A. Immunoblot 



B. Coomassie Blue 
Protein Staining 



Tig. 6. Wcsitrn blot anaJysis using an antiserum 
ipeciBe fot human lactoferrin w^a performed on 
proteins can acted horn normal" and mtJignani cd- 
douietrium and separated by SDS-polyacrylamidc 
fcl electrophoresis. A bioad protein baod »^ib a 
tnolccwjai vcifht between 70,000 and 80.000 is 
delected in both norma) and neoplastic endometrial 
tissue, hornofrnatei, con; >< tent with the it pon td 
moitculai weight oi human lactoferrin (A\ Tbc 
most sign ifi cant observation from the immunoblot- 
tinj studies is that laclolerrio ptoleir. is maifcedly 
rkvai^ rf i n iht adenocarcinomas, in comparison to 
normal endometrium, which suppotts tbc immvno- 
fytochctnicai anafysij thst deraopst:aUs t »iesui 
Dumber oi ctJJi positive lot terioJenio piotrin in 
the uterine tumors. B. ihe relative 2 mount of protein 
loaded in car h Iaj>e by Coomassie blue siainin^ 
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F>g- 7. Coinkticm of laaof min protein t*prt> 
simi with the PCNA, a marker of pToIHeraiion, and 
PR »s measured by iinJTmnohretocheinis&j per- 
formed on serial section* X 2D. In most cases, no 
relationship bttwtCD lacioicrrin apd PCKA "pjcv 
sioa is lound in either normal or maJijnarrl tftdo- 
mtlmrm (dtfa not shown). Howevtf, in 1 of 12 
»daiocarcmoma$, in Invent conthpan a seen 
bcrweat lactofemn (A) and PCNA (B) localiza- 
tion. This suiting w « consistent through- 
out the entire rumor, suggesting tbt possibairy of 
c=H cycle regulation of iKtofenic cxpressjon r» 
tfiis aoerrocardnoroa, Tbe boncm panels exhibit as 
inverse conthlioo between betofenin (C) and PR 
expression (DX ~>>k* w » ^ in aI1 ^ Cnd °" 
; menial adetrocarcinornw which t*prfssed PR. 
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as tbe otbei makers for ER-negative rumors, such as amplification of 
EGF receptor, HER-2/nr", and transforming growth facloi a expres- 
sion, which are associated with pool piognosis. I» gastric carcinomas, 
JactoJcniD expression is associated with transformation of specific 
cell types including intestinal-type carcinomas, adenomas, and incom- 
plete intestinal metaplasias (28). Although this is complete specula- 
tion at this Time, pejhaps iactofenin oveiexpression in the various 
malignancies may complement the actions of the growth factoi path- 
way molecules and contribute to tbe autonomous growth of these 
minors. 

In conclusion, oni studies icveal that Iactofenin is associated 
witb a unique population of epithelial cells in ibt zona basalis and 
that Iactofenin ovcreapiession may be associated with malignant 
information of the human endometrium. Further studies are 
needed to elucidate the jolt of lactoferrin in normal and patholog- 
ical endometrial physiology. 
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Cell proliferation in human soft tissue tumors correlates with platelet- 
derived growth factor B chain expression: an immuuohistochemical and in 
situ hybridization study. 

Wang J, Coltrera MP , Gown AM . 

Department of Pathology, University of Washington, Seattle 98 195. 

mn^ h ° r f tCSted the h yP ofhesis the B chain of the platelet-derived "growth factor 
(FDGF), a known connective tissue mitogen and growth factor, could be expressed by 
human soft tissue tumors/and that its expression could play a role in the control of cell 
proliferation in these tumors. Using a set of 56 soft tissue tumors, including benign 
tumors and all three grades of sarcomas, PDGF-B chain protein was localized using 
immunoh.stochemistry and PDGF-B mRNA was localized using in situ hybridization 
The hypothesis that PDGF-B expression was related to cell proliferation was tested by 
simultaneously demonstrating the expression of the proliferating cell nuclear antigen in 
SS?! ^ sec | ions of the same tumors/Sixty and 82% of tumors had demonstrable 
PDGF-B mRNA and protein, respectively, with a strong correlation between their 
degrees : of expression (P = 0.0001). Among the sarcomas, a strong correlation between 
pr£r » eXpreSS,0n 31,(1 ^casing malignant tumor grade (P = 0.006), and between 
r LKjb-tt expression and increasing proliferating cell nuclear antigen index (P = 0 01) 
was tound. All tumors were also demonstrated to express the beta receptor of PDGF via 
unmunohistochemistry. These studies suggest that PDGF-B expression may be an 
uimqrtant mediator of cell proliferation control, via an autocrine mechanism, in human 
sort tissue tumors and may correlate with clinical outcome in the sarcomas. 

PMID: 790391 1 [PubMed - indexed for MEDLINE] 
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Expression of cadherins and catenins in paired tumor and non neoplastic 
primary prostate cultures and corresponding prostatectomy specimens. 

DeFlavia P Dhir R. Becich MJ . _ ..omasA , 

Department of Patliology, University of Pittsburgh Medical Center, PA, USA. 

Cadherins are a family of transmembrane proteins that play a crucial role in cell : 
d.fferentiat.on cell migration, and intercellular adhesion. Cadherins are associated- with 
catenins through their highly conserved cytoplasmic domain. Down-regulation of E- 
cadhenn protein has been shown in various human cancers. This study examined the 
express..™ of cadhenns and associated catenins at the mRNA level. Paired tumor and 
nonneoplastic primary prostate cultures were obtained from surgical specimens 

£^ WP J" "T™ r f VerSe ^^e-polymerase chain reaction 
(QMF RT-PCR) and quant.tat.ve analysis were performed and correlated with 
immunostain results. Six of seven cases of neoplastic cultures showed moderately-to- 
markedly decreased levels of E-cadherin and P-eadherin mRNA. Similar losses of alpha- 
catemn and beta-caten.n mRNA were also observed. The results of QMF RT PCR 
showed good correlation with the results of immunohistochemical studies based on 
corresponding formalin-fixed sections. In conclusion, this paper presents a coordinated 
dow„-regulat,on m the expression of E-cadheri„ and associated catenins at S£25a 
and protem level m most of the cases studied. This down-regulation may play an 
important role in the pathogenesis of prostate cancer. 
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Abstract Cadherins are a family of transmembrane 
proteins that play a crucial role in cell differentiation 
cell migration, and intercellular adhesion. Cadherins arc 
associated with catenins through their highly conserved 
cytoplasmic domain. Down-regulation of E-cadherin 
protein has been shown in various human cancers This 
study examined the expression of cadherins and associ- 
ated catenins at the mRNA level. Paired tumor and non- 
neoplastic primary prostate cultures were obtained from 
surgical specimens. Quantitative multiplex fluorescence 

D^Ao? nSCriptaSC p ° lymerasc chain taction (QMF 
K I -rCR) and quantitative analysis were performed and 
correlated witli imraunostain results. Six of seven cases 
of neoplastic cultures showed moderately-to-markedly 
decreased levels of E-cadherin and P-cadherin mRNA 
Similar losses of a-catenin and ^-catenin mRNA were 
also observed. The results of QMF RT-PCR showed 
good correlation with the results of immunohistochem- 
icaJ studies based on corresponding formalin-fixed sec- 
tions. In conclusion, this paper presents a coordinated 
down-regulation in the expression of E-cadherin and 
associated catenins at the mRNA and protein level in 
most of the cases studied. This down-regulation may 
play an important role in the pathogenesis of prostate 
cancer. 
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Prostate cancer is the most common malignant tumor 
and the second leading cause of cancer death in men 
The - clinical course of patients with prostate cancer 
varies widely, and different factors contribute to this 
marked clinical variability, including genetic back- 
ground, hormonal environment, and the invasive 
potential of the tumor. Invasion and metastasis are the 
hallmarks of malignancy and have been closely Jinked to 
alterations in ce)Mo-celt adhesion, cell migration and 
interactions with extracellular matrix components {22). ' 

Cadhenns are a family of transmembrane glycopro- 
teins responsible for maintaining the integrity of tissue 
and are involved in cell differentiation, cell migration 
and intercellular adhesion through a calcium-dependent 
mechanism characterized by homotypic adhesion [35- 
37). Their highly conserved cytoplasmic domains asso- 
ciate with catenins, a group of intracellular proteins that 
mediate contact between the cadherins and the micro- 
filaments of the cytoskeleton. Each cadherin subclass 
shows a unique tissue distribution: E-cadherin is pre- 
dominantly expressed in epithelial cells and P-cadherin is 
restricted to decidua tissue and the basal or lower layers 
of stratified epithelium (30). 

The accumulating evidence suggests a decrease or loss 
of function in E-cadherin and P-cadherin in several 
human carcinomas (3, 9, 23, 32). Loss of heterozygosity 
(LOH) at chromosome 16 in the location of tbe E-cad- 
herin gene is present in a high percentage of prostate 
cancers [6, 2 1, 26, 28J. Decreased expression of E-cadherin 
is seen m various human malignant tumor cell lines, and 
the level of decrease correlates with the invasive potential 
of the tumor cell lines [1, 9, 12, 24, 40, 42). In addition, 
many, but not all, immunohistochemical studies using 
formalin-fixed, paraffin-embedded tissue have shown that 
the E-cadherin prolein is decreased in prostate cancer and 
the decrease is correlated with tumor grade (4, 8, 13, 27, 
38). Other studies have shown decrcased-to-absent 
P-cadherm levels, but variable E-cadherin levels [33]. 
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a- and ^-catenins bind the cytoplasmic domain of 
E-cadherin and link it to the cytoskelelon [16, 3 ]]. Down- 
regulation of expression and deletion of a-calenin genes 
were identified in several human cancer cell lines [19, 24, 
42]. Immunohistochemical studies showed decreased 
ctiatenin staining/which correlated well with the loss of 
E : cadherin staining and patient survival [25, 29, 39]. In 
addition, decreased 0-catenin protein expression appears 
to be associated with malignant transformation of epi- 
thelial tissue [34], These results suggest that cadherins 
and catenins may function as tumor invasion- suppressor 
genes. However, most of these results were obtained in 
studies using transformed tumor cell lines and formalin- 
fixed, paraffin-embedded tissue. To our knowledge, 
no studies to date have examined the co-expression of 
cadherins and catenins at the mKNA level using material 
derived from surgical specimens. 

Recently, we have developed the methodology to 
cultivate primary epithelial cells under defined condi- 
tions from surgical prostatectomy specimens [20]. Areas 
of both carcinoma and noh- neoplastic tissue are identi- 
fied grossly, verified histologically, and then cultured 
separately, resulting in paired primary cultures of both 
non-neoplastic and neoplastic epithelium from the same 
patient. The non- neoplastic tissue cultures serve as an 
important control of any person-to-person variability in 
the expression of the genes of interest. 

In this study, we investigated the co-expression 
of cadherin and calenin mRNA. from multiple paired 
primary prostate cultures derived from surgical prosta- 
tectomy specimens using quantitative multiplex fluores- 
cence reverse transcriptase-polymerase chain reaction 
(QMF RT-PCR) [41, 44). Immunohislochemistry stud- 
ies of cadherins and catenins were performed on the 
corresponding paraffin-embedded prostate tissue. 



Material* and methods 

Tissue specimen and primary prostate cultures 

We sludied patients with adenocarcinoma of the prostate who 
underwent prostatectomy at the University of Pittsburgh Medical 
Center during the lime interval January J996-January 1998. No 
patient had received treatment prior lo surgery. Fresh prostatec- 
tomy specimens were sectioned and grossly examined, representa- 
tive samples of neoplastic and non-neoplastic tissue were collected, 
and the diagnoses were confirmed by examination of hematoxylin 
and cosin (H&E) -stained sections. Epithelial cells from lumor and 



non-neoplastic areas of the specimen were isolated and then cul- 
tured with a chemically defined medium (CDM) without addmon 
of growth factors as previously described. Tissue fragments were 
cut into small pieces and underwent a xrks or collagenous diges- 
tions. Following each digestion, the cells were pelleted by centri- 
fugalion at 1,000 rpm for 4 M1N [20). To selectively promote 
epithelial cell growth, the pellets were resuspendcd.and mamUnned 
in serum-free CDM supplemented with epithelial growth factor 
(EGF) [20). The culture's morphology was examined dairy, and 
epithelial cells were allowed to grow until confluence was reached, 
between days 7 and 10. 

Isolation of total RNA and synthesis of cDNA 

Total RNA was extracted from the cultured primary prostate 
epithelial cells at first passage according lo the Trixol somtion 
(Gibco BRL, Rockville, MD) modified method of Chomczynslti 
and Sacchi [7). The RNA was then quamiutcd spectrophotomel- 
rically. Two micrograms of total RNA were used for first strand 
cDNA synthesis using oligo-dT primers and MMLV reverse tran- 
scriptase (Gibco BRL> Rockville. MD). 

PCR primers and quantitative multiplex fluorescence PCR 

PCR primers for hepalocyte growth factor (HGF) and c-myc 
were synthesized as previously described |I7). PCR primers for 
E-cadherin, P-cadherin, a-calenin, /1-calenin, and p-aclin cDNA 
were designed according to cDNA sequences provided by the 
GeneBank (WVyH^ ncbi.nlm.gov/gcnebank/query). The P™*** 
were selected to amplify 150-250 bp target genes and the PCR 
products from each target gene were designed lo have a different 
size (Table 1). The reverse primers were synthesized with fluores- 
cein molecules covalently attached lo the 5' end (BRL, RockvDle, 
MD). Twenty-five microliters of PCR reactions for QMF-PCR 
. contained primers (20 uM each), cDNA corresponding to 50 ng of 
total RNA, dNTPs, and reaction buffer. The reactions were am- 
plified for 21 cycles at 94 °C for I min, 57 "C for 2 min; and 72 'C 
Tor 2 min. Five microliters of QMF-PCR reactions were routed wrib 
an equal volume of sequencing gel loading buffer, denatured, and 
aliquots were elecirophorcsed on an AB1 373A automated sequencer 
(Applied Biosystcms, Foster City. CA) using a matrix specific for 
fluorescein (Fig. I A). 

Automated sequencer gels were run for 6 h at 30 W using 
Genescan software (ABI. Foster City, CA). Lane assignments and 
areas of the peaks corresponding lo fluorescent peaks were assigned 
and quanutaled by the Genescan software using Pholomumpber 
tube (PMT) voltages (Fig. IB). All experiments were done m 
triplicate and the results presented as means and standard devia- 
tions (SD). 

Antibodies and immunohislochemistry 

. Five-micron sections were obtained from formalin-fixed, P*™ m *~ 
embedded tissue blocks. They were deparanWztd and ^atcd 
with graduated elhanols. Slides were then microwaved in MM 



T»We 1 Summary of PCR 
primer sequences and PCR 
products 



Primer Sequence 



Target gene 



Size of PCR product (bp) 



8 
9 
10 



Forward cccacactglgcccatctacg 
Reverse gcttctccttaalgtcacgc 
Forward caaagtgggcacagatggtgig 
Reverse cigcltggaltccagaaacgg 
Forward gcaagagccagctctgittagc 
Reverse acligagctgaitcagtctclgg 
Forward gatggacaaclalgagccagg 
Reverse tataccaggcgggaagcaicg 
Forward ttctgglgccaclaccacagc 
Reverse igcaigccctcatclaaigtc 



/t-Aclin 

E-cadherin 

P-cadherin 

a-Catenin 

0-Catenin 



170 
190 
220 
182 
218 



3J0 



220 ty 



E-cxJbtnn 
IWbp 1 



F»g- 1 A Genescan image of 
QMF RT-PCR or five paired 
non- neoplastic and neoplastic 
primary prostate cultures. 
1 cDNA corresponding to 50 ng 
of total RNA was subjected to 
21 cycles of PGR, and all 
reactions were done in tripli- 
cate. RNA of both E-cadhcrin 
and P-cadherin showed moder- 
ate-to-marked reduction in pri- 
mary neoplastic cultures in four 
or five cases (Cases 1, 2, 4, and 
5). B Representative automated 
sequencer traces (ABI 373A) 

from 21 cycles of QMF 
RT-PCR using cDNA from a b 

paired non-ncoplasttc (lop 

pane!) and neoplastic (bottom • 

pantt) primary prostate cultures 

(Case 2\ and copy number of DOO-ocoplasbc 

PCR products (at-actin, 

E-cadherin; and P-cadherin) 

was calculated using peak areas. 

Neoplastic culture showed 

markedly decreased expression 

of both E-cadherm and 

P-cadherin 



<> du'J a a y ' * « '.!*' , :j , u : i;/; 1| . i , . 



bcta-Actin 
<170bp) 



E-cadhcrin 
(l»bp) 



P-odhcxin 

(220bp) 



VJ 



neoplastic 



clnl buff " fP H r 60 > retrieval. The avidin-biotin 

ffir^-^'T^ 11 ^ wa$ "'*«d from 
pS >yJ^ Atones (Burlmgame, Calif, USA). The E-cadherin 
H^fi . n ^ alCDm ' 3nd ^ Cattnin antib °dies ( all mouse mono- 

Evaluation of immunosiaining 

The percentage of positive cells, intensity of the staining, and eel- 
Mar location of the staining WCf e examined by two independent 
observers usmg normal prostate epithelium from the same speci- 
men a S an internal control. Thr intensity of the signal was graded 
* s rong, moderate, weak, and negative. The staining pattern of 
ihe tumor was compared with that of normal epithelium from the 
same specimens. 



Results 

Expression of E-cadherin and P-cadherin 

Since many prostate cancers are known to be histolog- 
ically heterogeneous, adjacent H&E-stained sections 
ol the tissue fragments sent to culture were reviewed 



lo assess tissue homogeneity and to rule out the presence 
of other diseases. The paired primary cultures for this 
study were selected based on the following histologic 
features: (I) non-neoplastic tissue showed no cancerous 
foci or high-grade prostatic intraepithelial neoplasia 
(PJN); (2) neoplastic sections contained less than 5% 
non-cancerous epithelium. Seven out of 38 pairs of 
cultures met the selection criteria and were included in 
this study. Histologically, all seven cases were moder- 
ately differentiated adenocarcinomas, with Gieason 
scores ranging from 5 to 7 (median = 6). In six of these 
cases, adjacent sections of tumor contained no benign 
prostate epithelium, and in one case (specimen 4), there 
was less than 5% non-cancerous epithelium. Prostate 
stromal cells express HGF, whereas the epithelial cells 
express c-mel, the receptor for HGF [17, 20). The cases 
included in this study showed no detectable HGF 
expression after 21 cycles of QMF RT-PCR (data not 
shown) . This indicates there was do significant stromal 
cell contamination in the current epithelial cultures. 

QMF-PCR is an accurate method of measuring the 
relative levels of mRNA in small tissue samples [41]. Jn 
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: this study, we used this method to quantitate the mRNA 
levels of E-cadherin and P-cadherin, relative lo /t-actin. 
All the RNA samples contained no DNA contamina- 
tion, as shown by the absence of automated signals when 
RT was omitted from the RT-PCR reactions. We ob- 
served the expected linear increases in /t-actin, E-cadh- 
erin, and P-cadherin signal intensities between cycle 
numbers 18 and 24, with a cDNA input corresponding 
to 50 ng of total RNA. The ratios of /T-actin to 
E-cadherin and P-cadherin were constant, as was the 
E-cadherin/P-cadherin ratio. These results indicated that 
the target genes were consistently amplified in the reac- 
tions. Twenty-one cycles of PCR were chosen for all 
subsequent experiments. In addition, the raw sequencer 
quantitation of peak areas for /?-actin using the same 
cDNA input was similar among the paired primary 
prostate cultures (data not shown). 

When compared with the paired non-neoplastic pri- 
mary cultures and normalized with the /J-actin internal 
controls, four of the neoplastic cultures showed marked 
(>85%) reduction of E-cadherin mRNA levels, with 
one case showing no detectable E-cadherin mRNA. The 
other three neoplastic cultures showed mild- to-moderate 
reductions (Table 2). 

Six of seven neoplastic cultmes showed moderately 
lo markedly decreased P-cadherin mRNA levels when 
compared with non-neoplastic cultures. Interestingly, 



the four cases showing marked reduction in levels of 
E-cadherin mRNA, and also demonstrated significant 
losses of P-cadherin mRNA. The case with no detectable 
E-cadherin mRNA also demonstrated near total loss 
of P-cadherin mRNA (Case I). Another case (Case 3), 
with only mjld reduction in E-cadherirr mRNA, showed 
no significant change in Prcadherin mRNA. Genetic 
variations among the patients were evident in the 
marked differences in the baseline levels of E-cadherin 
" and P-cadherin expression seen in the non-neoplastic 
primary prostate cultures. 



Expression of oc-catenin and /f-cateniri 

The highly conserved intracytoplasmic domain of 
the cadherins interacts with a- and 0-catenins, with the 
catenins serving as a link between the cadherins and the 
microfilaments of the cytoskeleton. Six cases of neo- 
plastic cultures showed mild to-moderate reductions in 
a-catenin mRNA levels, ranging from 26 to 62%. These 
cases also demonstrated more severe reductions of 
/?-catenin mRNA levels and generally correlated with 
the changes of cadherins in the same specimen (Table 3). 
Interestingly, the case with no significant change of 
P-cadherin mRNA and only a mild loss of E-cadherin 
mRNA also showed no change in mRNA levels of both 



Table 2 E-cadhcrin and P-cadherin expression in 
lectomy specimens 


paired n on- neoplastic and neoplastic primary prostate cultures derived from prosU- 


Case 


E-cadherin' 




Perceniage 
loss in tumor b 


P-cadherin" 




Percentage 
loss in tumor b 




Non-neoplastic 


Tumor 


Non-neoplastic 


Tumor 


1 

2 
3 
4 
5 
6 
7 


71.0 ± 2.2 

27.4 ± 6.4 
18.0 db 0.2 

93.5 ± 12.0 
41.5 i 3.0 

56.7 ± I t 

53.8 ± 6.1 


ND e 

3.7 ± 0.3 
12.6 ± 1.9 

5.4 ± 0.9 
- 24.4 ± 4.0 

8.1 ± 1.0 
22.2 ± 0.7 


100 
86 
30 
94 
41 
86 
59 


65.7 ± 2.0 

53.4 ± 6.5 

88.8 ± 6.5 
87.8 ± 7.0 

30.5 ± 1.4 
71.8 ± 2.9 
62.7 ± 3.5 


0.8 ± 0.1 
17.8 ± 1.3 

86.4 ± 9.5 

25.5 ± 2.5 
18.1 ± 2.4 
17.1 ±2.2 
19.4 ± 2.2 


99 
67 
3 
71 
41 
76 
69 


* Level of E-cadherin and P-cadherin are normalized with /i-actin from the 


same sample: (E-cadherin or P-cadherm)/actin x 100 



h Percentage loss in tumor primary culture; [(N - T)/N) x 100% 
c ND, not delectable 



T & ble3 a-Catenin and /J-calenin expression in paired non-ncoplaslic and neoplastic primary prostate cultures derived from prosta- 
lectomy specimens , ' 



Case 



Non-ncoplaslic 



45.1 ± 7.1 

49.2 ± 1.6 

48.0 ± 5.9 

61.1 ± 2.5 

58.2 i 1.8 
46.7 ± 4.7 
43.2 ± 2.6 



Percentage loss 
in tumor*" 



p*-catemn* 



Tumor 



Non-neoplastic 



Tumor 



17.3 
36.2 
47.8 
40.8 
27.8 



± 
i 
± 

± 

33.6 ± 4.7 
26.4 i 1.7 



0.5 
4.0 
9.0 
2.6 
16 



62 
26 
< I 
33 
52 
28 
39 



19.7 ± 

21.7 ± 
40.1 ± 

36.8 ± 
29.8 ± 
31.5 ± 
22.3 ± 



3.6 
1.5 
5.3 
18 
2.0 
4.2 
1.5 



ND C 
3.9 i 0.2 
38.9 ± 6.2 

22.1 i 19 

15.2 ± I.I 

13.5 ± 1.6 

11.6 A 14 



a Level of a-catenin and 0-calenin aie normali2td with ^-actin from the same sample: (catenin/aclm) x 100 
b Percentage loss in tumor primary culture: |(N - T)/N) x 100% 
1 ND, not detectable 



Percentage 
loss in lumor 



100 

82 
3 
40 
49 
57 
48 
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a- and /?-catenins. Moderate interspecimen variation was 
observed in the baseline expression of a-catenin and 
0-catenin mRNA levels in the ndn-neoplastic cultures. 



Jmmunohistochemical studies of cadherins and catenins 

In benign prostate tissue, E-cadherin was, in all cases, 
uniformly localized to the membranes of luminal glan- 
dular epithelial cells, predominantly at cell-cell junctions 
(Fjg. 3A). One case of prostate cancer showed complete 
negative staining for E-cadherin (Fig. 3B), and the re- 
maining six cases demonstrated reduced immunostaining 
for E-cadherin, with 25-75% of cancer cells positive 
(Fig- 3C). The cancerous glands generally showed re- 
duced signal intensity and an altered heterogeneous 
stammg pattern, which included focal cytoplasmic stain- 
mg and I reduced membranous staining (Fig. 3C, Table 4). 

Benign prostate tissue/showed uniform basal cell 
immunoreactivily for P-cadherin, with principally cyto- 
plasmic and focal membranous pattern staining. The 

Table 4 Immunohistocheraical expression of cadherins and cate- 
cuhure Pr ° M3,CCl0my SpCcimcnS ^responding lo primary prostate 



benign, glandular non-basal epithelial cells and stromal 
cells were negative for P-cadherin (Fig. 3D). P-cadherm 
immunoreactivily was completely absent in two cases of 
prostate cancer (Fig. 3E), and the remaining five cases 
showed variable focal positivity, which was predomi- 
nantly cytoplasmic (Fig. 3F). This focal P-cadherin 
immunostaining positivity was confirmed by staining 
multiple sections and by using different monoclonal 
antibodies: In some cases, serial sections also appeared 
to show immunostaining for both E- and P-cadherins 
with the same neoplastic cells. 

In all cases of benign prostate tissue, a- and /^calenin 
protein expression showed strong homogeneous staining 
of the luminal glandular epithelium and the basal cells. 
In a pattern similar to that of normal Ercadherin, the a- 
and ^catenins were localized predominately at luminal 
epithelial cell-cell, borders (Fig. 2A, Q. In all cases of 
prostate cancer, there was a mild-to-moderate reduction 
in staining for both catenins with 50-75% of cells 
positive, and the staining tended to be heterogeneous 
(Fig. 2B, D; Table 4). 



Discussion 



Case 


E-cadherin 


P-cadherin 


a-caicDin 


A-calenin 


1 






+ + 


+ + 


2 


+ + 


+ ■ 


+ + 


+ + 


3 


+ + + ■ 


+ + 


+ + + 


+ '+ + 


4 






+ + 


+ + + 


5 


t + + 


+ 


+ + + 


+ + + 


6 


+ -f 


+ 


+ + + 


+ -f + 


7 


+ + 




+ + 


+ + + 



7^i a,ivC; . + ' < 25% P osi,iw ; 4 + . 25-50% posilive; + + -f 
>V-75V, positive; + + + + ( > 75% positive 



Fig. 2A-D Immunohtstocherm- 
pi staining of a- and 0-caienin 
in non-neopbstk prostate and 
in prostate adenocarcinoma. 
Original magnification x!15. 
^» C Normal membranous 
expression or a-catenin and 
^-catenin in non neoplastic 
prostatic epithelium (Case I). 
B Decreased immunostaining 
for a-catenin in prostate . 
adenocarcinoma (Case I). 
O Decreased and heterogenous 
immunostaining for /f-catenin 
in prostate adenocarcinoma 
(Case I) 



In this study, we observed a coordinated down-regula- 
tion of the expression in the genes involved in the 
cadhenn and catenin mediated cell-cell pathway at the 
mRNA level. The protein levels, as demonstrated by 
the immunohistochemical studies on the corresponding 
tissue sections, were generally consistent with the 
mRNA data as well as with that reported in the litera- 
ture [18, 33, 34, 38, 39). 

E-cadherin showed the most consistent loss of ex- 
pression at both the mRNA level and the protein level. 
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F>g. 3 A Imrminohrslochemical 
staining of E- and P-cadherin in 
non-neoplastic prostate and in 
prostate adenocarcinoma. 
Original magnification x II 5: 
A Normal membranous ex- 
pression of E-cadherin in non- 
neoplastic prostatic epithelium 
(Case I). B, C Complete 
negative (Case I) and focal 
heterogeneous (Case 3) immu- 
nostaihing for E-cadherin in 
prostate adenocarcinoma. 
D Normal continuous basal 
layer expression of P-cadherin 
in a portion of non-neoplastic 
prostatic epithelium (Case 1): 
Ey F Complete negative 
(Case I) and mild focal hetero- 
geneous (Case 3) immunoslain- 
ing for P-cadherin in prostate 
adenocarcinoma 



mm 




LOH at chromosome I6q, where both E-cadherin and 
P-cadherin are located, occurs in up to 30% of prostate 
cancers. Four of our cases showed a greater than 50% 
reduction in mRNA, ranging from 87 to 100%, sug- 
gesting that mechanisms in addition to LOH may play a 
role in the reduction of E-cadherin mRNA levels below 
that of the 50% predicted by LOH alone. 

Previous studies have suggested that P-cadherin 
could serve as a specific marker for basal cell differen- 
tiation and was not expressed in prostate cancer, 
although a recent study has shown focal P-cadherin 
expression in some prostate tumors [18]. In this study, 
we demonstrated P-cadherin mRNA in six of seven 
neoplastic cultures, although it was significantly reduced 
in all six cases. It is, impossible to rule but the possibility 
that some of the P-cadherin mRNA expression may 
have resulted from potential contamination by small 
numbers of non-neoplastic basal cells admixed with the 
neoplastic cells within the culture material. However, we 
favor Ihe interpretation that low levels of P-cadherins 
are expressed in cultured tumor cells, as well as weakly 
in some tumors in vivo owing to disruption of normal 
gene regulation. This interpretation is favored by the 
presence of focal immunostaining for P-cadherin protein 
in histologically neoplastic cells m five of seven cases. 
The histologic selection criteria (requiring minimal to no 
benign prostate glands) should also have minimized 
major contamination. Despite the presence of mRNA in 
primary tumor cultures and focal positive immuno- 



staining, P-cadherin immunostaining may still serve as a 
useful basal cell marker because the staining pattern was - 
distinctly abnormal in the cases where it was focaljy 
present. 

In this study, the levels of cr- and 0-catenin mRNA 
were also lower than norma! in six of seven cases, 
though the reductions were relatively Jess than those of 
cadherins, especially for the a-catenins. Catenin protein 
expression was generally moderate to weak by immu- 
nohistochemistry and showed a heterogeneous cyto- 
plasmic and weak membranous staining pattern. The 
immunopositivity was generally similar in most cases, 
with 25-75% cells showing positivity: The results of the 
immunohistochernical studies were generally consistent 
with the mRNA data; although not in all cases (e.g.. 
Case I); perhaps due to tumor heterogeneity. 

In this study, we also observed a coordinated down- 
regulation of E-cadherin and the catenins in most cases; 
this was most observable at the mRNA level. The cad- 
herins are tightly regulated during embryogenesis and 
appear to serve the need for precise spatio-temporal 
regulation. The promoter sequences of both E-cadherin 
and P-cadherin have been cloned and functionally ana- 
lyzed |2, 5, 10, 15]. Both promoters have similar regu- 
latory elements, such as GC-rich regions and CCAAT 
boxes. Although the two promoters share similar se- 
quences, in vitro binding studies suggest that the two 
promoters are regulated by different transcriptional 
factors [II]. The tissue specificity of these promoters 
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appears to be derived by different combinations of a 

relatively few factors common to many types of tissues 

and does not appear to be derived from transcription 

factors specfic for each type of tissue. In our study; the 

Tx, a . f ° f ex P ression ^ «- and /l-caienins at the 

T v ■ WaS correlate d with reductions in the levels 

111' C ?^ C " n ex P reSSi ° n - A,,hou 8 h ,he mecha- 
nism of this coordmated down-regulation is not known 

W. COOrd ' na,c ? P aMern ™PPO^ the hypothesis that' 
loss or alteration of some regulatory factors occurs 

mcVulT* 2 ' 6 — °' r grCSSi0n - PoSsibJe "danisms 
delude transcriptional factor alterations or hyper- 
meUiylation of the promoter region (14, 43]. Limited 
.nformauon ,s available o„ the transcrip'tional re^fa 
Uon of the catenms, but it is possible that similar 
mechanisms may play a role. 

Jn summary, this is the first comparative study of the 

ated cell^elladhesion pathway at the mRNA level using 

Sd"ZL aS ' ,C no ™°P ,a ** Primary culture! 

wdjeate tba, ()) , herc ls , mar|[ed ( . 

vana ,on m the normal levels of .he cadherins and 
ea enms; (2) m R NA Ievdj rf J '^and 

catenms are s.gn.ficantly reduced in some prostate can- 
cer primary cultures, and the reduction tends to be to the 

rTuhHrT 6 'I CaCh tUm ° r ' su ^ s ' in 6 a derect in a 
regulating mechanism common to all of these genes; (3) 

fcv?. a „H n , n K aPPe f rS '.° * pr£Sen ' at ^ 'he mRNA 
r™ J . * r° ,e ' n kVtl in S °"*P'°"^ cancers. This 
coord ln a,ed down-regulation of E-cadherin and catenin- 
mediated adhesion pathways may play a crucial role in 
tumor pathogenesis and metastasis. 
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Down-regulation of prostate-specific antigen expression by finasteride 
through inhibition of complex formation between androgen receptor and 
steroid receptor-binding consensus in the promoter of the PSA gene in 
LNCaP cells. 

Wang LG , LjuXM, KreisjY, Budman DR . 

Department of Medicine, New York University, Manhasset 11030, USA. 

As a specific competitive inhibitor of Salpha-reductase, an intracellular enzyme that 
converts testosterone to dihydrotestosterone, finasteride is being extensively used for the 
treatment of benign prostatic hyperplasia and in experimental settings for prostate cancer. 
In this study, we showed that finasteride markedly inhibited prostate-specific antigen 
(PSA) secretion and expression. The promoter of the PSA gene contains several well- 
known cis-regulatory elements. Among them, steroid receptor-binding consensus (SRBC) 
has been identified as a functional androgen-responsive element Our previous study 
showed that PSA was not only present in conditioned medium of the PSA-positive 
LNCaP cells but was also detectable in 'small- amounts in PSA-negative cell lines PC-3 
and DU-I45 (L. G. Wang et aL, Oncol. Rep., 3:911-917, 1996). A strong correlation 
between binding of nuclear factors to SRBC and the level of PSA present in the 
conditioned medium and cell extracts was found in these three cell lines, whereas no such 
correlation with binding was obtained using Spl oligonucleotide as a probe. Binding of 
LNCaP cell nuclear proteins to SRBC was drniinished when the cells were exposed to 25 
mjcroM finasteride, at which concentration 50% of both PSA mRNA and protein were 
inhibited. As a major component of DNA-protein complexes, the level of androgen 
receptor was dramatically decreased in the cells treated with finasteride. Our data indicate 
that inhibition of complex formation between SRBC and nuclear proteins due to the 
remarkable decrease in the level of androgen receptor plays a key role in the down- 
regulation of PSA gene expression by finasteride in LNCaP cells. 
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Expression of calcyclin in human melanocyte lesions. 
Weterman MA. van Muijen GN . Bloemers HP , Ruicer DJ . 

Department of Biochemistry, University of Nijmegen, The Netherlands. 

When comparing two subsequent stages of melanocyte tumor progression we identified 
calcyclin as a new potential progression marker, the expression of which was correlated 
with metastatic behavior of various human melanoma cell lines in nude mice. In this 
study, we describe a good correlation between RNA and protein levels in the xenografts 
of these cell lines and extended these experiments to a panel of 120 routinely processed 
human melanocytic cutaneous lesions: Northern blot analysis demonstrated that calcyclin 
RNA expression was elevated in melanoma metastases as compared to several types of 
nevocellular nevi. Calcyclin staining using a specific polyclonal antiserum showed a 
more complex pattern. A stronger staining in a higher percentage of positive cells was 
observed in thick primary melanoma (> or = 1 .5 mm) as compared to thin primary 
melanoma (< 1 .5 mm). Calcyclin expression was also present in a higher percentage of 
cells showing a stronger staining in melanomas with higher Clark levels (> II) 
corresponding to the vertical growth phase of primary melanomas. Protein expression in 
nevocellular nevi was confined to the dermal part and was highest in the lower parts of 
the dermis. Remarkably, dysplastic nevi (atypical moles), potential precursors of 
melanoma, did not show any expression at all, either in junctional or dermal parts. 
Confinement of the expression to the dermal part of nondysplastic nevi and primary 
melanomas may reflect interactions with the microenvironment of the reticular dermis 
that occurs with vertical growth! 

PMID: 8261423 [PubMed - indexed for MEDLINE] . * 
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Estrogen regulation of the cytochrome P450 3 A subfamily in humans. 

Williams ET, Leyk M, Wrighton SA , Pavics PJ , Loose PS , Shipley GL, Strobe! HW . 

Department of Biochemistry, Medical School, University of Texas Health Science Center 
at Houston, 643 1 Fannin, MSB 6.200, Houston, TX 77030, USA. 

This study examines the possible role of estrogen in regulating the expression of the 
human CYP3A subfamily: CYP3A4, CYP3A5, CYP3A7, and CYP3A43. To accomplish 
this goal, mRNA was quantified from human livers and endometrial samples, and total 
CYP3A protein levels were evaluated by Western immunoblot analysis of the liver 
samples. The human endometrial samples were from premenopausal and postmenopausal 
women. The premenopausal endometrium was either in the proliferative or secretory 
: phase, whereas for the postmenopausal endometrium samples, the women had been 
treated with either a placebo or estropipate, an estrogen substitute. After analyses, 
CYP3A4 mRNA was shown to have lower hepatic expression in females than in males. 
In the endometrium, CYP3A4 and CYP3 A43 are down-regulated by estrogen, whereas 
CYP3A5 is expressed at higher levels during the secretory phase. CYP3A7 was not 
detected in the endometrium. In addition, the CYP3 A subfamily showed increased 
mRNA expression in the liver as age increased. The expression levels of total CYP3A 
protein and total CYP3A mRNA showed good correlation. Despite apparent regulation of 
CYP3A4 mRNA expression by estrogen, the effects of estrogen may be overshadowed 
by additional regulators of gene expression. 
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Severely decreased MARCKS expression correlates with ras reversion but 
not with mitogenic responsiveness. 

Wojta szek PA , Stumpo DJ , Blackshenr PJ . Macara IG . 

Department of Pathology, University of Vermont College ofMedicine", Burlington 
05405. > o 

Phorbol ester-inducible phosphorylation of MARCKS, the 'gO-kDa' substrate of protein 
kinase C, was undetectable in several phenotypically dominant, non-transformed 
revertants independently derived from the ras-transformed cell line NIH3T3 DT-ras 
Extremely low expression of MARCKS protein accounted for this apparent lack of 
phosphorylation. MARCKS-encoding mRNA levels were correspondingly decreased 
relative to normal and ras-transformed cells in all four ras revertant cell lines studied: C- 
11 and F-2, derived by 5-azacytidine treatment and selection with ouabain; CHP 9CJ 
derived by ethylmethane sulfonate mutagenesis and selection with cis-hydroxy-L-pro'line" 
^i 2 nS^ enVed by transfection with *e human Krev-1 gene. However, re-expression' 
ol MARCKS after transfection of a cloned MARCKS cDNA into the C-l 1 ras revertant 
cells was not sufficient to induce retransformation. In fact, no significant difference in 
sensitivity to mitogenic stimulation by phorbol esters was observed among several cell 
lines expressing widely varying levels of MARCKS. This evidence argues against a 
direct role for MARCKS in mitogenic signaling. However, the strong correlation between 
attenuation of MARCIS expression and phenotypically dominant ras reversion suggests 
that a common negative regulatory mechanism might be responsible for both effects 
presenting a potentially useful strategy for identifying factors involved in transducing the 
ras signal. 
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Abstract 



Background 

Prostate stem cell antigen (PSCA) is a recently defined homologue of the 
Thy .i/Ly-6 family of glycosylphosphatidylinositol (GPI)-anchored cell 
surface antigens. The purpose of the present study was to examine the 
expression status of PSCA protein and mRNA in clinical specimens of human 
prostate cancer (Pea) and to validate it as a potential molecular target for 
diagnosis and treatment of Pea. 



Materials and Methods 

Immunohistochemical (IHC) and in situ hybridization (ISM) analyses of PSCA 
expression were simultaneously performed on paraffin-embedded sections 
from 20 benign prostatic hyperplasia (BPH), 20 prostatic intraepithelial 
neoplasm (PJN) and 48 prostate cancer (Pea) tissues, including 9 androgen- 
indepe-ndent prostate cancers. The level of PSCA expression was 
serniquantitatively scored by assessing both the percentage and intensity of 
PSCA- positive staining cells in the specimens. Then compared PSCA 
expression between BPH, PIN and Pea tissues and analysed the correlations 
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of PSCA expression level with pathological grade, clinical stage and 
progression to androgen-independence in Pea. 



Results 



.i-r.iion 



Rei,;, e ri.;, ?s 



In BPH and low grade PIN, PSCA protein and mRNA staining were weak or 
negative and less intense and uniform than that seen in HGP1N and Pea 
There were moderate to strong PSCA protein and mRNA expression in 8 of 
11 (72.7%) HGP1N and in 40 of 48 (83.4%) Pea specimens examined by 
JHC and ISH analyses, with statistical significance compared with BPH 
(20%) and low grade PIN (22.2%) samples (p < 0.05, respectively) The 
expression level of PSCA increased with high Gleason grade, advanced stage 
and progresston to androgen-independence (p < 0.05, respectively) In 
add.tion, 1HC and ISH staining showed a high degree of correlation between 
PSCA protein and mRNA overexpression. 

Conclusions 

Our data demonstrate that PSCA as a new cell surface marker is 
overexpressed by a majority of human Pea. PSCA expression correlates 
posrt.vely with adverse tumor characteristics, such as increasing 
pathological grade (poor cell differentiation), worsening clinical stage and 
androgen-independence, and speculatively with prostate carcinogenesis 
PSCA protein overexpression results from upregulated transcription of PSCA 
mRNA. PSCA may have prognostic utility and may be a promising molecular 
target for diagnosis and treatment of Pea. 



Outline Introduction 



Prostate cancer (Pea) is the second leading cause of cancer- related death in 
American men and is becoming a common cancer increasing in China 
Despite recently great progress in the diagnosis and management of 
ocahzed disease, there continues to be a need for new diagnostic markers 
that can accurately discriminate between indolent and aggressive variants of 
Pea. There also continues to be a need for the identification and 
characterization of potential new therapeutic targets on Pea cells Current 
diagnostic and therapeutic modadties for recurrent and metastatic Pea have 
been limited by a lack of specific target antigens of Pea. 

Although a number of prostate-specific genes have been identified (i e • 
prostate specific antigen, prostatic acid phosphatase, glandular kallikrein 2) 
the majority of these are secreted proteins not ideally suited for many 
immunological strategies. So, the identification of new cell surface antigens 
is critical to the development of new diagnostic and therapeutic approaches 
to the management ol Pea. ' ' 

Rei.er RE et al | J ] repo.ted the identification of prostate stem cell antigen 
(PSCA), a cell surface antigen that is predominantly prostate specific The 
PSCA gene encodes a J 23 amino acd glycoprotein, with 30% homology to 
tern eel antigen 2 (Sea 2). Like Sea-2, PSCA also belongs to a member of 
he Thy- J/ Ly 6 family s „d is anchored by a glycosylphosphaliclylinositol 
(GPJ) linkage. mRNA i„ situ hybridization (ISH) localized PSCA exore- 
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normal prostate to the basal cell epithelium, the putative stem cell 
compartment of prostatic epithelium, suggesting that PSCA may be a 
marker of prostate stem/progenitor cells. 

In order to examine the status of PSCA protein and mRNA expression in 
human Pea and validate it as a potential diagnostic and therapeutic target • 
for Pea, we used immunobistochernistry (IHC) and in situ hybridization 
(ISH) simultaneously, and conducted PSCA protein and mRNA expression 
analyses in paraffin- embedded tissue specimens of benign prostatic 
hyperplasia (BPH, n = 20), prostate intraepithelial neoplasm (PIN, n = 20) 
and prostate cancer (Pea, n = 48). Furthermore, we evaluated the possible 
correlation of PSCA expression level with Pea tumorigenesis, grade, stage 
and progression to androgen-independence. 
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Abstract 
Introduction 
Materials and methods 
Results 
Discussion 
Competing interests 
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Tables 



Table 

Correlation of PSCA 
expression with Gleason 
score 



TahU: 2 

Correlation of PSCA 
expression with clinical 

stage 



Tissue samples 

Ad of the clinical tissue specimens studied herein were obtained from 80 
patients of 57-84 years old by prostatectomy, transurethral resection of 
prostate (TURP) or biopsies. The patients were classified as 20 cases of BPH, 
20 cases of PIN, 40 cases of primary Pea, including 9 patients with recurrent 
Pea and a history of androgen ablation therapy (orchiectomy and/or 
hormonal therapy), who were referred to as androgen- independent prostate 
cancers. Eight specimens were harvested from these androgen-independent 
Pea patients prior to androgen ablation treatment. Each tissue sample was 
cut into two parts, one was fixed in .10% formalin for IHC and the other 
treated with 4% paraformaldehyde/0. 3 M PBS PH 7.4 in 0.1% DEPC for 1 h 
for JSH analysis, and then embedded in paraffin. All paraffin blocks 
examined were then cut into 5 urn sections and mounted on the glass slides 
specific for IHC and ISH respectively in the usual fashion. H&E-stained 
section of each Pea was evaluated and assigned a Gleason score by the 
experienced urological pathologist at our institution based on the criteria of 
Gleason score [2]. The Gleason sums are summarized in Table i . Clinical 
staging was performed according to Jewett-whitmore-prout staging system, 
as shown in Table 2. In the category of PIN, we graded the specimens into 
two groups, i.e. low grade PIN (grade 1 - II) and high grade PIN (HGP1N, 
grade 111) on the basis of literatures [3,4]. 

Immunohistochemical (IHC) analysis 

Briefly, tissue sections were deparaffinized, dehydrated, and subjected to 
microwaving in JOmmol/L citrate buffer, PH 6.0 (Boshide, Wuhan, China) in 
a 900 W oven for 5 rnin to induce epitope retrieval. Slides were allowed to 
cool at room temperature for 30 rnin. A primary mouse antibody specific to 
human PSCA (Boshide, Wuhan, China) with a 3:300 dilution was applied to 
incubate with the slides at room temperature for 2 h. Labeling was detected 
by sequentially adding biotinylated secondary antibodies and strepovidin- 
peroxidase, and localized using 3,3'-diaminobenzidine reaction. Sections 
were then counterstained with hematoxylin. Substitution of the primary 
antibody with phosphate- buffered-saline (PBS) served as a negative- 
stain mu control. 
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mRNA in situ hybridization (ISH) 

Five-fim-thick tissue sections were deparaffinized and dehydrated, then 
digested in pepsin solution {4 mg/ml in 3% citric acid) for 20 min at 37 5°C 
and further processed for ISH. Dtgoxigenin-labeled sense and antisense 
human PSCA RNA probes (obtained from Boshide, Wuhan, China) were 
hybridized to the sections at 48°C overnight. The posthybridization wash 
with a high stringency was performed sequentially at 37°C in 2 x standard 

saline citrate (SSC). for 10 min, in 0.5 x SSC for 15 min and in 0.2 x SSC 
for 30 mm. The slides were then incubated to biotinylated mouse anti- 
d.goxigenin antibody at 37.5°C for 1 h followed by washing in 1 x PBS for 
20 mm at room temperature, and then to strepavidin-peroxidase at 37 5°C 
for 20 min followed by washing in 1 x PBS for 15 min at room temperature 
Subsequently, the slides were developed with diaminobenzidine and then 
counterstained with hematoxylin to localize the hybridization signals 
Sections hybridized with the sense control probes routinely did not show any 
specific hybridization signal above background, All slides were hybridized 
with PBS to substitute for the probes as a negative control. 

Scoring methods 

; To determine the correlation between the results of PSCA immunostaining 
and mRNA in situ hybridization, the same scoring manners are taken in the 
present study for PSCA protein staining by ]HC and PSCA rnRNA staining by 

Each sl,de was read an d scored by two independently experienced 
urological pathologists using Olympus BX-41 light microscopes The 
evaluation was done in a blinded fashion. For each section, five areas of 
similar grade were analyzed semiquantitative^ for the fraction of ceils 
staining. Fifty percent of specimens were randomly chosen and rescored to 
determ.ne the degree of interobserver and iritraobserver concordance There 
was greater than 95% intra- and interobserver agreement. 

The intensity of PSCA expression evaluated microscopically was graded on o 
scale of 0 to 3+ with 3 being the highest expression observed (0 no 
staining; 1 + , mildly intense; 2+, moderately intense; 3+, severely intense) 
The staining density was quantified as the percentage of cells staining 
positive for PSCA with the primary antibody or hybridization probe as 
follows: 0 - no staining; 1 = positive staining in <25% of the sample" 2 = 
positive staining in 25%-50% of the sample; 3 = positive staining in >50% 
of the sample. Intensity score (0 to 3 + ) was multiplied by the density score 
(0-3) to give an overall score of 0-9 f j,*]. m this way, we were able to 
differentiate specimens that may have had focal areas of increased staining 
horn those that had diffuse areas of increased staining | 6 ]. The overall score 
for each specimen was then categorically assigned to one of the following- 
groups: 0 score, negative expression; 1-2 scores, weak expression- 3-6 
scores, moderate expression; 9 score, strong expression. 

Statistical analysis 

Intensity and density of PSCA piote.n and mRNA expression in BPH PIN and 
Pea tissues were compared using the Cbi-squa.e and Student's r-test 
Univariate associations between PSCA expression and Gleason score climrol 
stage and progression to andiogen-imlepenrience were calculated using 
F.sher's Exact lest. For all analyses, p < 0.05 was considered statically 
significant. 1 
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Figures 



PSCA expression in BPH 

In general, PSCA protein and mRNA were expressed weakly in individual 
samples of BPH. Some areas of prostate expressed weak levels (composite 
score 1-2), whereas other areas were completely negative (composite score 
0). Four cases (20%) of BPH had moderate expression of PSCA protein and 
mRNA (composite score 4-6) by IHC and ISH. In 2/20 (10%) BPH 
specimens, PSCA mRNA expression was moderate (composite score 3-6), 
but PSCA protein expression was weak (composite score 2) in one and 
negative (composite score 0) in the other. PSCA expression was localized to 
the basal and secretory epithelial cells, and prostatic stroma was almost 
negative staining for PSCA protein and mRNA in all cases examined. 



PSCA expression in PIN 



Figure 1 

Representatives of PSCA 
IHC and ISH staining in Pea 
(A. IHC staining, B. ISH 
staining, x200 
magnification) 



Jn this study, we detected weak or negative expression of PSCA protein and 
mRNA (^2 scores) in 7 of 9 (77.8%) low grade PIN and in 2 of 31 (3 8.2%) 
HGPIN, and moderate expression (3-6 scores) in the rest 2 low grade PIN 
and 5 of 11 (45.5%) HGPIN. One HGPIN with moderate PSCA mRNA 
expression (6 score) was found weak staining for PSCA protein (2 score) by 
IHC. Strong PSCA protein and mRNA expression (9 score) were detected in 
the remaining 3 of 13 (27.3%) HGPIN. There was a statistically significant 
difference of PSCA protein and mRNA expression levels observed between 
HGPIN and BPH (p < 0.05), but no statistical difference reached between 
low. grade PIN and BPH (p > 0.05). 



PSCA expression in Pea 



In order to determine if PSCA protein and mRNA can be detected in prostate 
cancers and if PSCA expression levels are increased in malignant compared 
with benign glands, Forty-eight paraffin-embedded Pea specimens. were 
analysed by IHC and ISH. It was shown that 19 of 48 (39.6%) Pea samples 
stained very strongly for PSCA protein and mRNA with a score of 9 and 
another 21 (43.8%) specimens displayed moderate staining with scores of 
4-6 (Figure 1). In addition, 4 specimens with moderate to strong PSCA 
mRNA expression (scores of 4-9) had weak protein staining (a score of 2) 
by IHC analyses. Overall, Pea expressed a significantly higher level of PSCA 
protein and mRNA than any other specimen category in this study (p < 
0.05, compared with BPH and PIN respectively). The result demonstrates 
that PSCA protein and mRNA are overexpressed by a majority of human 
Pea. 



Correlation of PSCA expression with Gleason score in Pea 

Using the semi-quantitative scoring method as described in Materials and 
Methods, we compared the expression level of PSCA protein and mRNA with 
Gleason grade of Pea, as shown in Table .1 . Prostate adenocarcinomas were 
graded by Gleason score as 2-4 scores = well-differentiation, 5-7 scores = 
moderate-differentiation and 8-10 scores = poor-diflerentiation 
Seventy-two percent of Gleason scores 8-10 prostate cancers had very 
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strong staining of PSCA compared to 21% with Gleason scores 5-7 and 17% 
with 2-4 respectively, demonstrating that poorly differentiated Pea had 
significantly stronger expression of PSCA protein and mRNA than 
moderately and well differentiated tumors (p < 0.05). As depicted in Figure 
1, IHC and ISH analyses showed that PSCA protein and mRNA expression in 
several cases of poorly differentiated Pea were particularly prominent, with 
more intense and uniform staining. The results indicate that PSCA 
expression increases significantly with higher tumor grade in human Pea. 

Correlation of PSCA expression with clinical stage in Pea 

With regards to PSCA expression in every stage of Pea, we showed the 
results in Table 2. Seventy-five percent of locally advanced and node 
positive cancers (i.e. C-D stages) expressed statistically high levels of PSCA 
versus 32.5% that were organ confined (i.e. A-B stages) (p < 0.05). The 
data demonstrate that PSCA expression increases significantly with . 
advanced tumor stage in human Pea. 

Correlation of PSCA expression with androgen- 
independent progression of Pea 

All 9 specimens of androgen-independent prostate cancers stained positive 
for PSCA protein and mRNA. Eight specimens were obtained from patients 
managed prior to androgen ablation therapy. Seven of eight (87.5%) of 
these androgen-independent prostate cancers were in the strongest staining 
category (score = 9), compared with three out of eight (37.5%) of patients 
with androgen-dependent cancers (p < 0.05). The results demonstrate that 
PSCA expression increases significantly with progression to androgen- 
independence of human Pea. 

It is evident from the results above that within a majority of human prostate 
cancers the level of PSCA protein and mRNA expression correlates 
significantly with increasing grade, worsening stage and progression to 
androgen- independence. 

Correlation of PSCA immunostaining and mRNA in situ 
hybridization 

]n all 88 specimens surveyed herein, we compared the results of PSCA IHC 
staining with mRNA ISH analysis. Positive staining areas and its intensity 
and density scores evaluated by IHC were identical to those seen by JSH in 
79 of 88 (89.8%) specimens (18/20 BPH, 19/20 PIN and 42/48 Pea 
respectively). Importantly, 27/27 samples with PSCA mRNA composite 
scores of 0-2, 32/36 samples with scores of 3-6 and 22/24 samples with a 
score of 9 also had PSCA protein expression scores of 0-2, 3-6 and 9 
respectively. However, in 5 samples with PSCA mRNA overall scores of 3-6 
and in 2 with scores of 9 there were less or negative PSCA protein 
expression {i.e. scores of 0-4), suggesting that this may i effect 
post transcriptional modification of PSCA or that the epitopes recognized by 
PSCA mAb may be obscured in some cancers. The data demonstrate that 
.the results of PSCA immunostaining were consistent with those of mRNA 
JSH analysis, showing a high degree of correlation between PSCA protein 
and rnRNA expression. 
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PSCA is homologous to a group of celt surface proteins that mark the 
earliest phase of hematopoietic development. PSCA mRNA expression is 
prostate-specific in normal male tissues and is highly up-regulated in both 
androgen-dependent and-independent Pea xenografts (LAPC-4 tumors). We 
hypothesize that PSCA may play a role in Pea tumorigenesis and 
progression, and may serve as a target for Pea diagnosis and treatment. In 
this study, 1HC and ISH showed that in general there were weak or absent 
PSCA protein and mRNA expression in BPH and low grade PIN tissues. 
However, PSCA protein and mRNA are widely expressed in HGPIN, the 
putative precursor of invasive Pea, suggesting that up-regulation of PSCA is 
an early event in prostate carcinogenesis. Recently, Reiter RE et al [1], 
using ISH analysis, reported that 97 of 118 (82%) HGP1N specimens stained 
strongly positive for PSCA mRNA. A very similar finding was seen on mouse 
PSCA (mPSCA) expression in mouse HGPIN tissues by Tran C. P et al [3]. 
These data suggest that PSCA may be a new marker associated with 
transformation of prostate cells and tumorigenesis. 

. We observed that PSCA protein and mRNA are highly expressed in a large 
percentage of human prostate cancers, including advanced, poorly 
differentiated, androgen-independent and metastatic cases. Fluorescence- 
activated cell sorting and confocal/ immunofluorescent studies 
demonstrated cell surface expression of PSCA protein in Pea cells [9]. Our 
IHC expression analysis of PSCA shows not only cell surface but also 
apparent cytoplasmic staining of PSCA protein in Pea specimens (Figure J. ). 
One possible explanation for this is that anti-PSCA antibody can recognize 
PSCA peptide precursors that reside in the cytoplasm. Also, it is possible 
that the positive staining that appears in the cytoplasm is actually from the 
overlying cell membrane [5]. These data seem to indicate that PSCA is a 
novel cell surface marker for human Pea. 

Our results show that elevated level of PSCA expression correlates with high 
grade (i.e. poor differentiation), increased tumor stage and progression to 
androgen-independence of Pea. These findings support the original IHC 
analyses by Gu 2 et al [9], who reported that PSCA protein expressed in 
94% of primary Pea and the intensity of PSCA protein expression increased 
with tumor grade, stage and progression to androgen- independence. Our 
results also collaborate the recent work of Han KR et al [2 0], in which the 
significant association between high PSCA expression and adverse 
prognostic features such as high Gleason score, seminal vesicle invasion and 
capsular involvement in Pea was found. Jt is suggested that PSCA 
overexpression may be an adverse predictor for recurrence, clinical 
progression, or survival of Pea. Hora H et al [II] used RT-PCR detection of 
PSA, PSMA and PSCA in ) ml of peripheral blood to evaluate Pea patients 
. with poor prognosis. The results showed that among 58 PCa patients, each . 
PCR indicated the prognostic value in the hierarchy of PSCA> PSA> PSMA RT- 
PCR, and extraprostatic cases with positive PSCA PCR indicated lower 
disease-progression-free survival than those with negative PSCA PCR, 
demonstrating that PSCA can be used as a prognostic factor. Dubey P et al 
[].?] repotted that elevated numbers of PSCA -r cells correlate positively 
with the onset and development of prostate carcinoma over a long tune 
span in the prostates of the TRAMP and PTE N t /- models compared with its 
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norma* prostates. Taken together with our present findings, in which PSCA 
is overexpressed from HGPJN to almost frank carcinoma, it is reasonable 
and possible to use increased PSCA expression level or increased numbers 
of PSCA-positive cells in the prostate samples as a prognostic. marker to 
predict the potential onset of this cancer. These data raise the possibility 
that PSCA may have diagnostic utility or clinical prognostic value in human 
Pea. 

The cause of PSCA overexpression in Pea is not known. One possible 
mechanism is that it may result from PSCA gene amplification. In humans, 
PSCA is located on chromosome 8q24.2 [1], which is often amplified in 
metastatic and recurrent Pea and considered to indicate a poor prognosis 
[13-1-5]. Interestingly, PSCA is in close proximity to the c-myc oncogene, 
which is amplified in >20% of recurrent and metastatic prostate cancers' 
[ 1 6, 1 7]. Reiter RE et al [ 1 8] reported that PSCA and MYC gene copy 
numbers were co-amplified in 25% of tumors (five out of twenty), 
demonstrating that PSCA overexpression is associated with PSCA and MYC 
coamplification in Pea. Gu 2 et al [9] recently reporteted that in 102 
specimens available to compare the results of PSCA immunostaining with 
their previous mRNA ISH analysis, 92 (90.2%) had identically positive areas 
• of PSCA protein and mRNA expression. Taken together with our findings, in 
which we detected moderate to strong expression of PSCA protein and 
mRIMA in 34 of 40 (85%) Pea specimens examined simultaneously by IHC 
and ISH analyses, it is demonstrated that PSCA protein and mRNA 
overexpressed in human Pea, and that the increased protein level of PSCA 
was resulted from the.upregulated transcription of its mRNA. 

At present, the regulation mechanisms of human PSCA expression and its 
biological function are yet to be elucidated. PSCA expression may be 
regulated by multiple factors [13]. Watabe T et al [3*] reported that • 
transcriptional control is a major component regulating PSCA expression 
levels. In addition, induction of PSCA expression may be regulated or 
mediated through cell-cell contact and protein kinase C (PKC) [20]. * 
Homologues of PSCA have diverse activities, and have themselves been 
involved in carcinogenesis. Signalling through SCA-2 has been demonstrated 
to prevent apoptosis in immature thymocytes [2J]. Thy-3 is involved in T 
cell activation and transducts signals through src-like tyrosine kinases [22]. 
Ly-6 genes have been implicated both in turnorigenesis and in cell-cell 
adhesion [2>2S]. Cell-cell or cell-matrix interaction is critical for local tumor 
growth and spread to distal sites. From its restricted expression in basal 
cells of normal prostate and its homology to SCA-2, PSCA may play a role in 
stem/progenitor cell function, such as self-renewal {i.e. anti-apoptosis) 
and/or proliferation [1]. Taken together with the results in the present 
study, we speculate that PSCA may play a role in turnorigenesis and clinical 
progression of Pea through affecting cell transformation and proliferation. 
From out results, it is also suggested that PSCA as a new cell surface 
antigen may have a number of potential uses in the diagnosis, therapy and 
clinical prognosis of human Pea. PSCA overexpression in prostate biopsies 
could be used to identify patients at high risk to develop recurrent or 
metastatic disease, and to discriminate cancers from normal glands in 
prostatectomy samples. Sirnilaify, the detection of PSCA- over expressing 
cells in bone marrow or peripheral blood may identify and predict metastatic 
progression better than current assays, which identify only PSA-positive or 
PSMA- positive prostate cells. 
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In summary, we have shown in this study that PSCA protein and mRNA are 
maintained in expression from HGPIN through all stages of Pea in a majority 
of cases, which may be associated with prostate carcinogenesis and 
correlate positively with high tumor grade (poor cell differentiation), 
advanced stage and androgen- independent progression. PSCA protein 
overexpression is due to the upregufation of its mRNA transcription. The 
results suggest that PSCA may be a promising molecular marker for the 
clinical prognosis of human Pea and a valuable target for diagnosis and 
therapy of this tumor. 
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[Expression of humau telomerase reverse transcriptase in cervix cancer 
and its significance) 

[Article in Chinese] 

XTL, Zhu T . WuP, Xu Q, Huang L , LiKZ, Lu YP , Ma D , 

Department of Obstetrics and Gynecology, Tongji Hospital, Tongji Medical College, 
Huazhong University of Science and Technology, Wuhan 430030, China. 

OBJECTIVE: To investigate the expression of human telomerase reverse transcriptase 
(hTERT) mRNA and protein in cervix cancer, cervical intraepithelial neoplasia (CIN) 
and normal cervix. METHODS: Expression of hTERT mRNA and the other two subunits 
. of telomerase, human telomerase RNA component (hTR), human telornerase-associated 
protein (hTPI) was determined by RT-PCR in 3 cervix cancer ceil lines, 2 diploid cell 
lines, 38 cases of cervix cancer, 16 cases of CIN and 20 cases of normal cervix. 
Telomerase activity was also examined by telomeric repeat amplification protocol 
enzyme-linked immunosorbent assay (TRAP-ELISA). Expression of hTERT protein was 
detected in all die cell lines and 10 1 cases of parafflnized cervix tissue sections. 
RESULTS: hTERT mRNA expression was detected in all of the three cervix cancer cell 
lines, 81.6% of cervix cancer, 37.5% of CIN, 5.0% of normal cervix, while in neither of 
the two diploid cell lines. The other two subunits of telomerase were prevalently 
expressed in all of the cell tines and most cervix tissues. There was a strong correlation 
between hTERT mRNA expression and telomerase activity. Imrnunostaining also 
revealed that hTERT protein was expressed in all three cervix cancer cell lines, 65.5% of. 
cervix cancer, 28.0% of CIN and 4.8% of normal cervix. CONCLUSION: Up-regulation 
of hTERT may play an important role in the development of CIN and cervix cancer, 
hTERT could be used as an early diagnostic biomarker for cervix cancer. 
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Gene expression signatures and biomarkers of noninvasive and invasive 
breast cancer cells: comprehensive profiles by representational difference 
analysis, microarrays and proteomics 

GM Nagaraja 1 , M Othman 2 , BP Fox 1 , R Alsaber 1 , CM Pellegrino 3 , Y Zeng 2 , R Khanna 2 , 
P Tamburini 3 , A Swaroop 2 and RP Kandpal 1 

1 Department of Biological Sciences, Fordham University, Bronx, NY, USA; department of Ophthalmology and 
Visual Sciences, University of Michigan, Ann Arbor, Ml, USA and J Bayer Corporation, West Haven, CT, USA 



We have characterized comprehensive transcript and 
proteomic profiles of cell lines corresponding to normal 
breast (MCF10A), noninvasive breast cancer (MCF7) and 
invasive breast cancer (MDA-MB-231). The transcript 
profiles were first analysed by a modified protocol for 
representational difference analysis (RDA) of cDNAs 
between MCF7 and MDA-MB-231 cells. The majority of 
genes identified by RDA showed nearly complete con- 
cordance with microarray results, and also led to the 
identification of some differentially expressed genes such 
as lysyl oxidase, copper transporter ATP7A, EphB6, 
RUNX2 and a variant of RUNX2. The altered transcripts 
identified by microarray analysis were involved in cell-cell 
or cell-matrix interaction, Rho signaling, calcium home- 
ostasis and copper-binding/sensitive activities. A set of 
nine genes that included GPCR11, cadherin 11, annexin 
Al, vimentin, lactate dehydrogenase B (upregulated in 
MDA-MB-231) and GREB1, S100A8, amyloid /J pre- 
cursor protein, claudin 3 and cadherin 1 (downregulated in 
MDA-MB-231) were sufficient to distinguish MDA-MB- 
231 from MCF7 cells. The downregulation of a set of 
transcripts for proteins involved in cell-cell interaction 
indicated these transcripts as potential markers for 
invasiveness that can be detected by methylation-speciftc 
PCR. The proteomic profiles indicated altered abundance 
of fewer proteins as compared to transcript profiles. 
Antisense knockdown of selected transcripts led to 
inhibition of cell proliferation that was accompanied by 
altered proteomic profiles. The proteomic profiles of 
antisense transfectants suggest the involvement of pepti- 
dyl-prolyl isomerase, Raf kinase inhibitor and 80kDa 
protein kinase C substrate in mediating the inhibition of 
cell proliferation. 

Oncogene (2006) 25, 2328-2338. doi: 10. 1 03 8/sj. one. 1209265; 
published online 28 November 2005 , 
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Introduction 

The transformation of a normal cell into a cancer cell 
has been correlated to altered expression of a variety of 
genes (Perou et al, 2000; Becker et al, 2005). The 
expression of some of these genes is a direct result of 
sequence mutation, whereas other changes occur due to 
alterations in gene products that participate in specific 
pathways. The changes in gene expression have been 
routinely characterized by classical subtraction hybridi- 
zation and differential display approaches (Cerosaletti 
et al, 1995; Alpan et al, 1996). With the availability 
of the human genome sequence and sequences for a 
number of other model organisms, traditional methods 
have largely been replaced by gene microarrays 
(Khan et al, 2001). These analyses have been used to 
characterize the molecular basis of a variety of diseases 
including cancer. A comprehensive analysis of a large 
number of cancer cell lines allowed clustering of genes 
into groups based on their expression patterns in 
phenotypically related cell lines (Khan et al, 2001; 
Dan et al, 2002; Rosenwald et al, 2002; van't Veer 
et al, 2002). The results of profiling experiments 
indicated expression of specific gene clusters in cell lines 
that have the same origin or have arisen from the same 
organ (Ross et al, 2000). A complementary approach 
that has been used in limited ways is proteomics. 
Proteomics scores for changes in different proteins 
and peptides in cells with characteristic pheno- 
typic differences. However, a comparative analysis of 
transcripts and proteins to establish a relationship 
between transcript changes and protein levels has not 
yet become routine. 

Although expression profiling of tumor tissue and its' 
comparison with normal tissue, in principle, is most 
appropriate to obtain the genetic signatures of a turner 
type, such comparisons have not been free of attendant 
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complications. These complications arise due to hetero- 
geneity of tumor specimens wherein any cell type- 
specific changes are likely to be masked by other cell 
types that constitute the tumor specimen. For this 
reason, well-characterized cell lines established 
from tumor tissue may prove more informative and 
have been considered useful by cancer researchers. 
Comparing gene profiles between cell lines has the 
potential to reveal genes that could be causative for 
the phenotype and other genes that can serve as tumor 
biomarkers. 

Our investigations are aimed at designating a subset 
of transcripts that could distinguish a normal breast cell 
from a breast cancer cell and help to predict tumori genie 
or metastatic potential of a transformed cell. We 
describe here transcript and proteomic profiles of a 
normal breast cell line, a tumorigenic but noninvasive 
breast carcinoma cell line and an invasive breast 
carcinoma cell line, and summarize them as a set of 
candidate biomarkers or targets for therapeutic inter- 
vention. The comparison of transcript profiles with 
proteomic profiles demonstrated that altered proteins 
were not always represented in the microarray desig- 
nated profiles and vice versa. Furthermore, we have 
targeted five transcripts that were upregulated in MCF7 
cells for investigating their role in cell proliferation 
pathways. The proteomic profiles have revealed that 
inhibition of cell proliferation by antisense knockdown 
was mediated by a specific set of proteins. 



Results 

Representational difference analysis 
As described in the Materials and methods section, 
RDA was performed by using cDNAs from MCF7 and 
MDA-MB-231 as tester/driver or driver/tester combina- 
tion. The difference product in the first case represents 
the genes that are either upregulated in or specific to 
MCF7. On the other hand, the difference product of 
MDA-MB-231 (tester) and MCF7 (driver) hybridiza- 
tion resulted in the isolation of cDNAs that are either 
upregulated in or specific to MDA-MB-231. The initial 
linkers used in this protocol had internal BgHl sites. 
One strand of the linker was used to amplify both the 
tester and driver cDNAs after linkers had been ligated 
to cDNAs. After removal of linkers from amplified 
cDNAs by digestion with Bglll, a dephosphorylated 
BgUl adaptor was ligated to tester DNA. The BglH 
adaptor had an internal EcoKl site. The difference 
product was digested with EcoKl and cloned in pBlue- 
Script vector. The cloning efficiency of the difference 
product was very low (5 x 10" c.f.u./^g of DNA). The 
low efficiency of cloning is attributed to a substantial 
fraction of amplified DNA product that is refractory to 
restriction digestion. The sequencing of a set of 100 
clones each from the difference libraries revealed 50 
different kinds of clones. The majority of these 
sequences were short fragments and represented either 
3' regions or internal fragments of transcripts. A 
summary of these clones is presented in Table 1. The 
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involvement of the majority of these cDNAs is well 
characterized either in tumorigenesis or in metastasis. 
The phenotypic characteristics of MCF-7 and MDA- 
MB-231 ideally match with the biological significance of 
these genes. The alterations in transcripts for Rho 
signaling proteins, Ca 2+ binding/requiring proteins, 
tight junctions/anchoring junctions/gap junctions, cop- 
per binding or sensitive proteins, and RUNX2 are 
particularly noteworthy. 

The differential expression of a representative number 
of RDA clones was validated by semiquantitative PCR. 
As shown in Figure 1, these transcripts were either 
specific to or upregulated in the ceil line that was used as 
a tester. Such analyses demonstrated that more than 
90% of the clones were differentially expressed. The 
abundance of transcripts and the results of RT-PCR 
were also confirmed by Northern blotting (Figure 2). 
The pattern of hybridization clearly indicates that all 
these transcripts showed differential expression in 
MCF7 and MDA-MB-231 cells that were used as 
driver/tester combinations for the RDA. 



Gene microarrays 

After obtaining preliminary molecular signatures of 
MCF7 and MDA-MB-231 cells by RDA, we used 
Affymetrix gene micoarrays to expand the above 
analysis to identify a comprehensive set of transcripts 
that is deregulated in invasive breast carcinoma cells. 
The comparisons of cell lines on the basis of transcripts 
that are either present or absent as shown in Figure 3 
revealed that a set of 123 genes distinguishes MDA-MB- 
231 cells from MCF7 and MCF10A. These genes can be 
classified by their involvement in functional classes such 
as transcription, signal transduction, cell adhesion, cell 
cycle, metabolism, transport, response genes and devel- 
opment (Figure 4). The majority of these genes 
participated in the process of signal transduction 
followed by transcription, cell adhesion and metabo- 
lism, respectively. A few transcripts in these classes were 
tested by real-time RT-PCR to confirm their altered 
abundance. The selected transcripts showed changes 
ranging between two- and 10-fold, 11- and 20-fold and 
greater than 20-fold, and were in close agreement with 
the results of microarray analysis. The qualitative 
pattern of change observed in microarrays analysis 
was readily reproduced by real-time or semiquantitative 
RT-PCR for all transcripts tested. 

The number of altered transcripts was over 1000 
based on a change of twofold or greater, and a majority 
of these genes, show changes varying between two- and 
fourfold (Figure 5), Interestingly, with all comparisons 
combined, there were 21 genes downregulated more 
than 50-fold and 55 genes that were upregulated more 
than 18-fold when specific cell line pairs were compared 
(Figure 5). The transcripts that represent the extremes of 
upregulated and downregulated scale can allow distinc- 
tion between MCF7 and MDA-MB-231 cells. These 
transcripts include GPCR 1 1 , cadherin 1 1 , annexin 
Ai, vimentin, lactate dehydrogenase B (upregulated 
x in " MDA-MB-231) and GREB1, S100A8, amyloid 
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Table 1 Differentially expressed transcripts identified by RDA 



Upregulated in MDA-MB-231 



Downregulated in MDA-MB-231 



Extracellular matrix/matrix-crosslinking proteins 
LOX 

Laminin fi 1 
Collagen Vl-a 1 

Calcium-binding proteins 
Reticulocalbin 1 
S100A8 
Cullin 5 

Transcription factors! promoter-binding proteins 
RUNX2 
c-Jun 
Fra-1 

Cell-cell adhesionj cell-surface receptor proteins: 

Cdhll 
CYR61 

MHC class II antigen y chain 
Protease-activated receptor- 1 
Protease-activated receptor-4 



ATPase/GTPase and signal transduction proteins 
ATP7a 
Caveolin 2 

AXL receptor tyrosine kinase 
Rho GEF 3 
Rho/Rac GEF 18 
P21-Rac2 

Metalloproteases and MM? inhibitor proteins 
TIMP-2 
MTl-MMP 

Stress-response proteins 

Dual specificity phosphatase (DUSP) 

Cytoskeletal component and binding proteins 
Moesin 
Vimentin 
Filamin B 

Cell-cycle regulation and growth/ differentiation! 'apoptosis proteins 
Cyclin B 1 
Cyclin E 
Cyclin A2 
Bcl2-like 1 protein 

Secreted proteins and growth factors 
Milk fat globule protein 
TGF-ot 

SMAD-specific E3 ub ligase 2 

Miscellaneous 
Prion protein 



Calcium-binding proteins 
CaJgranulin B 



Transcription factors/promoter-binding proteins: 
Chromosome 4 ORF 
Estrogen receptor 1 
RUNX2 variant (exon 8 deleted) 

Cell-cell adhesion I cell-surface receptor proteins 
Claudin 3 

Amyloid precursor protein 
Triose phosphate isomerase 
Plakoglobin 
Cdh I 
Cdh 3 

Annexin A9 
RAR-oe 
Connexin 31 

ATPasejGTPase and signal transduction proteins 

RhoD 

RhoB 

TGF-/? Rl 
Stress-response proteins 

Protein kinase H 1 1 

Cytoskeletal component and binding proteins 
Keratin 18 
Tubulin 5 1 

Microtubule-associated protein t 

Cell-cycle regulation and growth) differ entiationfapoptosis proteins 
S100A13 
S100C 

Aurora kinase AIK2 
Nucleosidediphosphate kinase 

Secreted proteins and growth factors 

Trefoil factor 3 (TFF3) 

Trefoil factor 1 (TFF1) 

Four and a half LIM domain 1 

Solute carrier family 16 SLCI6A6 
DNA replication 

DNA replication complex GINS-PSF2 
Miscellaneous 

Serine protease inhibitor type 1 (SPINT1) 

Human homolog of Xenopus protein XAG 

Hypothetical protein FLJ22222 

Hypothetical protein 20171 

Hypothetical protein MGC3265 



/? precursor protein, claudin 3 and cadherin 1 (down- 
regulated in MDA-MB-231). The distinction between 
MCF7 and MCF10A may be made based on keratin 19, 
serine protease, amyloid precursor, neuropeptide Y ; 
receptor Yl (upregulated in MCF7) and caldesmon, 
annexin ,A1, epithelial membrane protein 1, S100A2, 



keratin 15 (downregulated in MCF7). Likewise, 
MDA-MB-231 cells differ from MCF10A in vimentin, 
epithelial membrane protein 3, cadherin 1 1, GPCR 
116, collagen type XIII a 1, BcI2-associated athanogene 
2 (upregulated in MDA-MB-231) and keratin 15, 
cystatin A, cadherin 1, CD24, calcium- activated chlor- 
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ide channel, SI OOP, GPCR 87 (downregulated in 
MDA-MB-231). Thus, a small subset of transcripts 
may serve as an accurate signature of these cell lines. 
Several of these gene products have been shown to 
participate in tumorigenesis and invasiveness of breast 
carcinoma cell lines. 

The invasiveness phenotype of MDA-MB-231 cells 
specifically relates to changes in the following functional 
classes: (a) cell adhesion molecules, (b) Ca 2+ requiring, 
Ca 2+ binding or Ca 2+ regulatory genes, (c) copper- 
sensitive or copper-transporting proteins and (d) specific 
regulatory proteins of Rho signaling. Among these 
functional groups, 23 transcripts involved in cell-cell 
or cell-matrix interactions are underexpressed in MDA- 



MB-231 cells and 21 transcripts were overexpressed in 
this cell line (Table 2). The comparison of Ca 2+ - 
requiring/binding genes indicated downregulation of 
26 transcripts and upregulation of 26 transcripts in 
invasive cells as compared to noninvasive cells (Table 3). 
While Ca 2 + homeostasis is extensively investigated in 
human cancers, copper homeostasis is an underexplored 
area. The alterations in copper homeostasis in breast 
carcinoma cells were reflected by changes in trans- 
cripts corresponding to a variety of copper-binding or 
copper-sensitive proteins/enzymes (Tables 1 and 4). The 
deregulation of Rho signaling was evident from changes 
in various proteins involved in this pathway (Table 4). 



2331 




IKcyetev 30 cycles 

k(. ?* J >.♦ 5 b* '7 K* 9 JO* M ■ M* "T* -'13* J 4 15* Hi 17* 3*: 




Figure 1 Semiquantitative evaluation of selected transcripts, (a) 
RNA was isolated from confluent culture dishes containing 
MCF10A (Janes 1 and 4), MCF7 (lanes 2 and 5) or MDA-MB- 
231 (lanes 3 and 6) cells. The amount of RNA was first determined 
spectrophometrically. Equal amounts of RNA, as determined by 
absorbance at 260 nm, were amplified with primers specific to actin 
gene for 18 cycles (lanes 1-3) or 20 cycles (lanes 4-6). The lane 
containing size markers is labeled as M. (b) A set of primers 
corresponding to caveolin 2 (lanes 1 and 2), TGF-a (lanes 3 and 4), 
Moesin (lanes 5 and 6), LOX (lanes 7 and 8), Axl receptor (lanes 9 
and 10), RhoD (lanes 11 and 12), S100A13 (lanes 13 and 14), TFF3 
(lanes 15 and 16) and Claudin 3 (lanes 17 and 18) were amplified 
for 32 cycles. Lanes 1, 3, 5, 7, 9, 12, 14, 16 and 18 represent 
amplified products from MCF7 and lanes 2, 4, 6, 8, 10, 1 1, 13, 15 
and 17 represent MDA-MB-231 cells. The lanes containing PCR 
products from MDA-MB-231 cells are marked with an asterisk. 
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Figure 2 Analysis of selected transcripts by Northern hybridiza- 
tion. The probes specific to CD74 (a), CYR61 (b), SPINT1 (c) and 
DUSP (d) were labeled with a J2 P nucleotide and hybridized to 
blots containing RNA from MCF10A (lane 1), MCF7 (lane 2) 
and MDA-MB-231 (lane 3). The blots were washed stringently and 
developed as described. The amounts of RNA loaded were 
normalized as in Figure 1 . 
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Figure 3 Comparison of cell lines based on the presence or absence of transcripts. The absence or presence of a transcript in the 
AfTymetrix chip was scored by the fluorescence read-out as described in the Materials and methods section. 



Oncogene 



^332 



Molecular signatures of breast carcinoma cells 

GM Nagaraja et ai 




Gene Ontology 



1 Absent in MCF7 - Present in MD A MB 231 & MCF10A ■ Present in MCF7 - Absent in M0AMB231 & MCF10A 
Present in MDAMB231 - Absent in MCF10A & MCF7 □ Absent in MDAMB231 - Present in MCF10A & MCF7 



Figure 4 Functional classes of transcripts that differentiate a cell line pair. The transcripts identified as present or absent were 
classified based on their functional importance. 



Gene Expression Levels 




Table 2 Cell adhesion molecules altered in breast carcinoma cells 
Upregulated in MDA-MB-231 Downr emulated in MDA-MB-231 



Downregulated ) Upregutated 

No. of Folds 

| □ MCF7 vs 1 0A O MDAMB231 vs 1 0A ■ MDAMB231 vs MCF7 1 

Figure 5 Distribution of altered number of transcripts as a 
function of fold change. The altered transcripts were categorized in 
groups based on the magnitude of change in their abundance. 



Cadherin 4 
Cadherin 11 
Catenin 
Integrin a 6 

Transmembrane anchor protein 

Eph B2 

Dystouin 

Laminin ft 1 

Lam in 

Filamin B 

Filamin C 

Tailin 1 

Butyrophilin 

Spectrin-a 

Spectrin-/? 

Thrombospondin 

Plastin 3 

Adducin 3 y 

Lamin B receptor 

Laminin p 2 

Lamin A/C 



Claudin3 
Cadherin- 1 
Cadherin-3 
Cadherin- 18 

Cadherin, LAG seven pass receptor 

Down syndrome cell adhesion 

Catenin-^2 

Eph A4 

Ephrin A4 

Annexin A9 

Ankyrin 3 

Sarcoglycan 

Keratin 8 

MAP-7 

MAP-x 

Plakoglobin 

Plakophilin 

Discoidin domain receptor 
Zona occludens 3 
Periplakin 
Protocadherin a 9 
Laminin y 2 
Laminin a 3 



Proteome analysis 

To identify altered abundance of proteins and relate it to 
transcript profiles, we characterized the protein profiles 
of MCF-10A, MCF-7 and MDA-MB-231 cells. Typi- 
cally, > 300 protein spots could be visualized in silver- 
stained gels, and there were far fewer protein spots in 
gels that were stained with Coomassie blue. The 
comparison of MCF7 or MDA-MB-231 proteins with 
MCF-10A revealed that MCF-7 had li unique protein 
spots, while MDA-MB-231 had 15 spots that were not 
seen in MCF-10A. These proteins were either specific to 
or upregulated in these cell lines. The identity of these 
proteins is shown in Table 5. Out of these 26 protein 
spots, only 25 yielded amino-acid sequence. As shown in 
the table, the list includes proteins involved in stress 



response, protein- tagging activities, calcium-binding and 
calcium homeostasis proteins and some regulatory 
proteins. Prominent among these changes were proteins 
involved in calcium homeostasis such as crocalhin, 
calreticulin, calcyclin and reticulocalbin. The changes in 
signaling pathways between the two cell lines were 
indicated by altered levels of Rho GDP dissociation 
inhibitor 1 , an apoptosis/differentiation regula- 
ting protein galectin, Myc expression regulator far 
upstream binding protein- 1 and the microtubule reg- 
ulator protein stathmin. The translation initiation 
factors 5A and 4H were also selectively upregulated in 
MDA-MB-231 cells. 
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Table 3 Calcium binding or sensitive transcripts 
Upregulated in MDA-MB-231 Downregulated in MDA-MB-231 



Reticulocalbin 
Dystonin 
Folltstatin like 1 
Cullin 

Annexin A5 

EF hand domain containing- 2 

Hippocaicin like-2 

LDLR 

Steroid sulfatase 

MT-actin crosslinking factor 

Inositol- 1 ,4,5-triphosphate 

receptor 3 

Sorcin 

Guanine nucleotide- binding 

protein-y 

GAS6 

SWP-70 protein 
Jagged 1 

EGF-containing fibrulin-like 
ECM 

Transglutaminase 2 

Thrombin receptor-like 1 

Plastin 3 

FYN oncogene 

PKC-a 

Calmegin 

Calpain 

HEG homo log 

Cyr61 



S100A8 
S100A7 
S100A13 

Tumor associated Ca 2 + 
1 signal transducer 
Notch homolog 3 
PKD2 

Adenylate cyclase 
Phospholipase-C 
Chemokine ligand 1 2 
Ubquitin-specific protease 
PKC-^ 

Ret-pro tooncogene 

Signal peptide-CUB domain 

Mannosidase a 

Bradykinin receptor B2 

Solute carrier family 24 

Ca 2+ channel voltage-dependent /?3 

Regulator of G protein signaling 17 

Dystrobevin-oc 

Synaptogamin 1 

Matrix gla protein 

EF hand domain family member Dl 

PK-cyclic AMP dependent 

ATPase-Ca 2 + transporting 

Calmodulin 1 

CaM kinase 



Tabic 5 Proteomic profiles of MCF7 and MDA-MB-231 cells as 
compared to MCF10A cells 



MCF7 cells 



MDA-MB-231 cells 



Cell-cell adhesionj cell-surface 
receptor proteins 
Triose-phosphate isomerase 

Stress-response proteins 
Hsp27 

Superoxide dismutase 
Peroxiredoxin 2 



Cytoskeletal component /binding 
proteins 
Stathmin 

Cell-cycle regulation and growth/ 
differentiation/ apop to sis pro teins 

Nucleoside diphosphate kinase A 

S100C 

Secreted proteins and growth factors 
Macrophage migration inhibitory 
protein 

Miscellaneous 
Cyt c oxidase VIb 
Peptidyl-prolyl cis-trans isomerase 
Ubiquitin 



Calcium'binding proteins 
Calcyclin 
Calreticulin 
Crocalbin 
Reticulocalbin 

Transcription Factors/ 
Promoter-binding proteins 

Far upstream element 

binding protein- 1 

Far upstream element 

binding protein-2 

Cell-cell adhesion/ cell-surface 
receptor proteins 
Galectin 

A TPase/GTPasef signal trans- 
duction/ trafficing proteins 

Rho GDP dissociation 

inhibitor 1 

Stress-response proteins 
Hsp70 

Peroxiredoxin 2 

Cytoskeletal component/ 
binding proteins 
Stathmin 



Table 4 Altered transcripts involved in copper homeostasis and Rho 
signaling 



Copper-binding proteins 


Rho signaling proteins 


LOX 


Rho3 


LOX-I, LOX-2 


Rho/Rac GEF 18 


SCO cyt oxidase-deficient 


Rho GEF 12 


homolog 2 


Ras related C3 botulinum toxin 


COX 17 homolog 


substrate 2 


Metallothionein IE, IF and 2a 


Cdc 42 effector protein 3 


Ring finger protein7 


Rho GEF 3 


Amiloride- binding protein 1 


Rho GDP dissociation inhibitor p 


Neurotrypsin/motopsin 


RhoD 



Changes in proliferation characteristics and protein 
profiles in response to transfeciion with antisense 
constructs of selected transcripts 

We had observed significant upregulation of transcripts 
for DNA replication complex protein GINS PSF2, 
trefoil factor 3, aurora kinase AIK2, protein kinase HI 1 
and secreted protein XAG in MCF7 cells. We reasoned 
that antisense knockdown of the above genes in MCF7 
cells might indicate pathways involved in tumorigenesis 
and invasiveness. 

MCF7 cells were transfected with empty vector 
pCDNA3.1 or antisense constructs of the above genes. 
A semiquantitative amplification of pCDNA marker 
gene by RT-PCR confirmed the presence of the 
transfected construct in a significant proportion of the 
cell population. The transfected cells also showed a 
decrease in the target transcripts as observed by RT- 



Miscellaneous 
Heterogeneous nuclear 
ribonucleo protein H 
eIF4H (translation) 
elF5A (translation) 



PCR. The effects of antisense transfections were scored 
by growth characteristics of the transfectants. The cell 
proliferation was reduced between 15 and 40% when 
antisense transfectants were compared to cells trans- 
fected with empty vector. 

In order to relate decreased proliferation of transfec- 
tants to altered proteins, proteomic profiles of transfec- 
tants were compared with vector controls. The 
comparison of protein profiles of cells transfected with 
empty vector or antisense construct revealed alterations 
in several proteins for each transfectant (Table 6). These 
proteins included stress- response proteins, calcium- 
regulating proteins, translation factors, ubiquitin, pro- 
teins of electron transport chain and oxidative phos- 
phorylation, signaling proteins, cytokeratins, actin and 
actin regulating proteins and general regulatory factors. 
The number of altered proteins varied between 5 and 15 
for various transfections. Peptidyl prolyl cis-trans 
isomerase, calcium-regulating proteins, SOD, galectin, 
histidine triad protein and PKC substrate were promi- 
nent among altered proteins. We performed database 
searches to identify inter actors for all proteins that were 
altered in transfected cells and identified nearly' 350 
proteins (data not shown). A significant number of these 
interacting proteins are involved in transcriptional 
regulation. » 



Oncogene 



Molecular signatures of breast carcinoma cells 

GM Nagaraja et af 



^334 



Table 6 Altered proteins in MCF7 cells after antisense knockdown of specific transcripts 



After transfection with as- trefoil factor 3(TFF3) 



After transfection with as-protein kinase Hll 



Calcium • binding p ro teins 

Calmodulin 
Cell-cell adhesion) cell-surface receptor proteins 

Retinoic acid-binding protein II 
ATPase-GTPase J signal iransductionj 'trafficing proteins 

Raf kinase inhibitor 
Stress-response proteins 

Hsp27 

Peroxiredoxin 1 
Cytoskeletal component /binding proteins 

Cofilin-nonmuscle isoform 
Cell-cycle regulation and growth/ differentiation/apoptosis proteins 

Chromobox protein homolog 1 

Chromobox protein homolog 3 

Translate onally controlled tumor protein 
Miscellaneous 

40S ribosomal protein S 12 

ATP synthase D chain 

Cancer-associated Sm-like protein 

Cyt c oxidase polypeptide Va 

eIF5A (translation) 

Ferritin heavy chain 

His triad nucleotide- binding protein 

Histone H2B.n 

Peptidyl-prolyl isomerase 

Proteosome subunit /Mype 6 
, RNA-binding protein 8A 

Thioredoxin 

Ubiquitin crossreactive protein 

After transfection with as-aurora kinase AIK2 

Calcium-binding proteins 

Calgranulin B 

Calgranulin A 
Stress-response proteins 

Hsp27 

Superoxide dismutase 
Miscellaneous 

60S acidic ribosomal protein P2 
Peptidyl-prolyl isomerase 
Ubiquitin-crossreactive protein 



Cell-cell adhesionj cell-surface receptor proteins 
Galectin 

Retinoic acid-binding protein II 
A TPQse- G TPasef signal transduction /trafficing proteins 

PKC substrate 
Stress-response proteins 

Hsp27 

Cytoskeletal component /binding proteins 

Actin-oc, skeletal muscle 

Cytokeratin 18 
Miscellaneous 

30H-acyl Co A dehydrogenase II 

ATP synthase A chain 

Cyt c oxidase peptide Va 

Enhancer of rudimentary homolog 

Thioredoxin 

Ubiquinol-cyt C reductase 
No match 

After transfection with as-DNA replication complex GINS PSF2 
Calcium Binding proteins; 
Calreticulin 

A TPase-GTPase/ signal transduction / traffi cing proteins 

14-3-3 r\ 

PKC substrate 
Cytoskeletal component /binding proteins: 

Cytokeratin 1 

Cytokeratin 18 

Cell-cycle regulation and & growth/ differ entiation/apoptosis proteins 

Chromobox protein homolog 1 

Chromobox protein homolog 3 

MAP/MT affinity regulator 
Miscellaneous 

ATP synthase D chain 

Peptidyl-prolyl isomerase 

Ubiquinol-cyt c reductase 

After transfection with as-human homolog of XAG 
Calcium-binding proteins 

Calreticulin 
Stress-response proteins 

Hsp27 
Miscellaneous 

ATP synthase A chain 

Peptidyl-prolyl isomerase 

Ubiquitin crossreactive protein 

No match 



Discussion 

The results presented here validate the gene profiles 
obtained from different expression platforms ranging 
from subtractive hybridization to gene microarrays and 
proteomic analysis. The RDA protocol is powerful 
' enough to yield important genes that show significant 
alterations in their expression between cell lines, and can 
lead to isolation of full-length cDNAs by using 
appropriate modifications (Baskaran et aL, 1996; Jacob 
el aL, 1997). The detection of RUNX2, variant of 
RUNX2, EphB6, prion protein, lysyi oxidase and a 
copper transporter ATP7A transcripts by RDA warrant 
specific mention. RUNX transcription factors bind 
specific motifs on target gene promoters and regulate 
gene expression leading to cell growth, proliferation and 



differentiation (Pratap et al 9 2003). RUNX2 and its 
variant have differential repression activity toward the 
promoter of the cyclin-dependent kinase inhibitor 
(p21CIPl) (Westendorf et aL, 2002). The loss of EphB6 
expression due to methylation of its promoter is related 
to invasiveness of MD A-MB-23 1 (Fox and Kandpal, 
2004: unpublished observations). Lysyl oxidase, a 
copper-sensitive enzyme, causes oxidative deamination 
of lysine and hydroxy lysines of collagen to aldehyde 
forms to stabilize collagen fibrils (Siegel, 1976) that are 
found in invasive breast carcinoma cells (Akiri et aL, 

2003) . The activation of LOX is dependent on copper 
that is internalized and then transported to trans golgi 
network by copper transporter ATP7A (Pase et * aL, 

2004) , a protein mutated in Menkes disease (Moller 
et aL, 2005). Prion protein has also been reported as a 
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chaperone for copper (Jones et al, 2004). Thus, EphB6 
can serve as a marker for invasiveness, and LOX and 
ATP7A may be exploited as relevant targets for 
therapeutic intervention. 

The downregulation of junctional proteins along with 
inactivation of TIMPs as shown here is in agreement 
with other reports describing their relationship with 
invasiveness of carcinoma cells (Johnson, 1991; Kousi- 
dou et al, 2004; Shao et al., 2005) and promoter 
methylation (Costa et al, 2004). As several transcripts 
coding for junctional proteins are downregulated in 
invasive cells, we postulate that methylation-specific 
PCR can be exploited to use these transcripts as 
biomarkers of tumor cells in general and invasiveness 
in particular. The changes in cell-cell interaction 
correlate to cell phenotypes because such interactions 
influence Rho/Ras signal transduction pathways and 
vice versa (Malliri and Collard, 2003; Nagaraja and 
Kandpal, 2004; Ridley, 2004), and lend credence to the 
significance of altered transcripts for Rho and Rho 
GEFs as presented here. 

Early changes in calcium homeostasis as measured by 
calcium excretion have been reported in breast cancer 
(Campbell et al, 1983), and altered calcium signaling has 
been shown in invasive lung carcinoma cells (Amuthan 
et al, 2002). Prominent among calcium-binding proteins 
are S-100 protein, a group of intracellular messengers 
that respond to transient changes in calcium concentra- 
tion by binding to specific receptors (Marenholz et al, 
2004) and regulate cell growth, differentiation and 
motility, transcription and cell cycle. The S-100 proteins 
detected in the present study map to chromosome lq21, 
a region of genome that is frequently altered in human 
breast cancer cells (Bieche et al, 1995). Calcium ions act 
as a second messenger in specific signaling pathways in a 
variety of cancers (Missiaen et al, 2000) and are known 
to alter calcineurin to activate transcription factors such 
as NFATc (Luo et al, 1996). 

As dictated by post-transcriptional regulation, protein 
profiles showed far fewer changes as compared to 
transcript profiles, and the knockdown of five selected 
genes in MCF7 cells produced interesting changes in 
protein profiles. These genes, namely, XAG (secretory 
Xenopus laevis protein), trefoil factors 3, human aurora2 
kinase AIK2, protein kinase HI 1 and DNA replication 
complex GINS PSF2, have been shown to be estrogen 
responsive, oncogenic or involved in tumorigenesis (Yu 
et al, 2001; Fletcher et al, 2003; Katoh, 2003; Warner 
et al, 2003; Takayama et al, 2003). The antisense 
constructs of these genes appeared to work as siRNAs 
as suggested by the reduction in the transcript detected 
in RT-PCR of RNA isolated from the transfected cells. 
The involvement of the above transcripts in invasive 
potential is apparent from the observed upregulation of 
calcium-binding proteins in transfected MCF7 cells, 
which is comparable to their levels in MDA-MB-231 
cells. The proteins that appear to mediate inhibition of 
proliferation in antisense-transfected cells include PKC 
substrate, Raf kinase -inhibitor, histidine triad nucleo- 
tide-binding protein and peptidy-prolyl isomerase 
(Pinl). We believe histidine triad protein effects are 



most likely mediated via its interaction with ATM 
protein. Raf kinase inhibitor (Keller et al, 2004) and 
ATM (Hall, 2005) have been conclusively linked to 
transformation of cells, and the activity of Pinl has been 
related to p53-mediated signaling pathways (Mantovani 
et al, 2004; Berger et al, 2005). In this context, p53 
activation has also been hypothesized by Cu-SOD 
prion-like enzyme (Wiseman, 2005). Thus, alterations 
in copper homeostasis and p53-mediated signaling may 
be considered as a significant regulatory mechanism in 
tumorigenesis. 

In summary, we have presented here a set of 
candidate genes that can serve as biomarkers for 
tumorigenesis and invasiveness, and some of these 
markers may be used to develop DNA -based diagnostic 
tests. The alterations in transcripts for copper home- 
ostasis genes suggest copper chelation or inhibition of 
copper transporter ATP7A as potential targets for 
therapeutic application. The modulation of RUWC2 
splicing variants by chemicals that affect splicing 
machinery may also be explored as a therapeutic 
modality. The changes in EphB6 expression, if con- 
firmed in tumor specimens, may have prognostic 
significance. 



Materials and methods 

Breast cancer cell lines 

We used MCF-10A, a cell line established from normal breast, 
and two breast carcinoma cell lines MCF-7 and MDA-MB- 
231 that vary in their in vitro and in vivo invasiveness. All cells 
were cultured at 37°C/7% C0 2 . MCF-10A cells were grown in 
1:1 DMEM.F12 media (Gibco) with 5% horse serum (Gibco), 
20mM HEPES, lOng/ml EGF (Invitrogen), 10ml/l PenStrep- 
Glutamine (lOOOOU/ml penicillin, 10000^g/ml streptomycin 
and 29.2mg/ml L-glutamine), 10/ig/ml insulin (Invitrogen), 
0.1/ig/ml Cholera Toxin (Sigma) and 500ng/ml hydrocorti- 
sone (Sigma). MCF-7 and MDA-MB-23 1 cells were grown in 
DMEM (Gibco) supplemented with 2mM L-glutamine (Gib- 
co), 1 mM sodium pyruvate (Gibco), 5 ml/1 penstrep (5000 U/ 
ml penicillin and 5000/ig/ml streptomycin), and 10% fetal 
bovine serum (Hyclone). 

Total RNA isolation 

RNA was isolated from 85 to 95% confluent 10 cm tissue 
culture dishes using TRI reagent (Molecular Research Center 
Inc.) with slight modifications to the recommended protocol. 
Approximately 10 million cells were mixed with 1.0 ml Tri 
Reagent, the mixture was extracted with chloroform and the 
aqueous phase containing RNA was separated. The RNA was 
precipitated with isopropanol, the pellet washed sequentially 
with 80 and 100% ethanol, then dried and resuspended in 
DEPC- treated water. RNA was stored in aliquots at -70°C. 
The quality of RNA was visualized by running on a 
formaldehyde gel. The appearance of ribosomal RNA bands 
indicated that RNA was not degraded during the procedure. 
The amount of RNA was determined by its absorbance at 
260 nm. 

DNAase treatment of total RNA 

To remove DNA contamination, 20 /xg of RNA (quantified 
speciiophotometrically) was treated with 500 ng DNAase I, 
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80 U RNasin (Promega) and 1 mM MgCl 2 in Tris buffer in a 
total volume of 50 /d. The reaction was carried out at 37°C for 
1 h and the DNAase inactivated by heating to 65 °C for 30 min. 

Representational difference analysis (RDA) 
RDA of cDNAs is a modification of genomic RDA. (Lisitsyn 
et al., 1993). We performed RDA in the following two ways. In 
one experiment, MCF-7 cDNA was used as a driver and 
MDA-MB-231 cDNA as a tester. In the second experiment, 
MCF-7 cDNA was used as a tester and MDA-MB-231 cDNA 
as a driver. The protocol has been describer previously (Jacob 
et al., 1997). Briefly, first-strand synthesis was carried out using 
a commercial cDNA synthesis kit as per the manufacturer's 
protocol. A linker with a BgR\ site was ligated to the tester as 
well as the driver cDNA. A primer specific to one of the linker 
strands was used to PCR amplify the linker-ligated cDNAs. 
The linkers were then removed by digesting the cDNA with 
BgH\ and the digested cDNA was gel purified. A second set of 
unphosphorylated BglU adaptors was ligated to the tester 
cDNA only. The tester and driver DNAs were hybridized in a 
5 /d reaction volume at a ratio of 1 :40. After hybridization, the 
ends of the tester homoduplexes were repaired with Klenow 
polymerase and 1 /d of the reaction mixture was diluted to 
100 pi. The difference product was obtained by amplifying 1 u\ 
of the diluted mixture using the top strand of the ligated 
adaptor as a primer. The amplified difference product was 
digested with EcoRl and cloned in a pBlueScript vector. 
Individual clones were picked up and sequenced by Sanger's 
dideoxy chain termination method. Representative clones were 
validated by Northern analysis and semiquantitaive RT-PCR. 

Microarray analysis 

The GeneChips, Human Genome U133A 2.0, (Affymetrix, 
Santa Clara, CA, USA) used in this study contained 
approximately 22000 probe sets corresponding to 18400 
transcripts and variants, including 14 500 well-characterized 
human genes. 

Total RNA was converted into double-stranded cDNA by 
using Superscript II (Invitrogen, Carlsbad, CA, USA) and an 
oligo-dT primer containing a heel of the T7 RNA polymerase 
promoter sequence. The reaction mixture containing double- 
stranded cDNA was extracted with phenol-chloroform, 
precipitated with ethanol, and dissolved in 12/d RNase-free 
water. The cDNA was transcribed in vitro by using a RNA 
transcription labeling kit (Enzo Biochem, Famingdale, NY, 
USA) with 6/A of double-stranded cDNA in the presence of 
ATP, CTP, GTP, UTP, bio-ll-CTP and bio-16-UTP. The 
biotinylated RNA was purified by using an affinity column 
(Qiagen, Valencia, CA, USA) and fragmented randomly, l?y 
heating to 95 °C in the presence of fragmentation buffer, 
between sizes of 35 and 200 bases. The GeneChips were 
hybridized overnight at 45°C in hybridization oven in a. 
solution containing fragmented cRNA, control oligonucleo- 
tide B2, 20 x eucaryotic hybridization controls, herring sperm 
DNA, acetylated BSA and 2 x hybridization buffer. The 
GeneChips were washed and stained with streptavidin- 
phycoerythrin and the antibody in 2 x MES stain buffer, 
acetylated BSA, and optically read at a resolution of 6fim with 
a Affymetrix GeneChip scanner 3000. Affymetrix MICRO- 
ARRAY SUITE : ' was used for initial data preparation 
(generation of .CHP files). Normalization (quantile method) 
and calculation of signal intensities was performed with the 
software package RMA from the R project (http://www.r- 
project.org/). For every cell line, three replicates were 5 
performed with Affymetrix Gene Chips. The Gene Chip data 
were used for further calculations after the raw image and 



MAS 5 analysis revealed a positive quality report. Ratios of 
average, signal intensity (log 2) were calculated for the probe 
sets between pairs of cell lines and then converted to an 
average fold change (AFC). Statistical validation was* per- 
formed on probe sets as described (Yoshida et al., 2004). The 
statistical method used to assign P-values to the fold changes 
of gene responses is described by Yoshida et al. (2004) and is a 
two-step procedure based on the Benjamini and Yekutieli 
construction of false discovery rate confidence intervals 
(FDRCI) (Reiner et al, 2003). Functional annotation of 
proteins was assigned through Gene Ontology (http://www. 
geneontology.org) or Locuslink (http://www.ncbi.nlm.nih.gov/ 
LocusLink) classifications obtained through appropriate pub- 
lic databases. 

Quantitative RT-PCR 

RNA was reverse transcribed with Superscript II (Invitrogen, 
Carlsabad, CA, USA) RT by priming with oligodT. The 
primers specific to validated genes were synthesized from the V 
untranslated region using Primer 3 software. PCR reactions 
were then performed in triplicates in an I-cycler Thermocycler ' 
with optical module (BioRad, Hercules, CA, USA). The 
amplified products were quantified by reading fluorescence of 
SybrGreen I (Molecular Probes, Eugene, OR, USA). Average 
fold changes were calculated by differences in threshold cycles 
(Ct) between pairs of samples to be compared. HPRT gene was 
used as a control. 

Semiquantitative RT-PCR 

The spectrophotometrically determined concentration of RNA 
was confirmed by amplifying actin message at different cycles. 
The cycling conditions that yielded proportional increment of 
amplified product was used to normalize the RNA concentra- 
tion- The normalized RNA was used as template to determine 
relative abundance of transcripts corresponding to clones 
identified by RDA experiments. The conditions were standar- 
dized in the range of cycles that yielded a PCR product for at 
least one of the pairs of compared RNAs. Such experiments 
dictated cycles between 30 and 35 to be appropriate to 
compare abundance of selected transcripts in MCF7 and 
MDA-MB-231 cells. 

Northern analysis 

The expression pattern of selected transcripts in cell lines was 
also analysed in Northern blots. RNA (20 /im), as determined 
by spectrophotometer and confirmed by actin amplification, 
was electrophoresed on a formaldehyde agarose gel. A RNA 
ladder was used as a size marker. The RNAs were transferred 
from the gel to a Hybond nylon membrane by capillary 
transfer. The RNA was fixed onto the membrane by 
irradiation in a Stratalinker. The blot was hybridized at 65 °C 
for 12-15 h with a radioactive probe and the blot was 
subsequently washed with 0.1 x SSC and 1% SDS at 65°C. 
The hybridized probe was detected by autoradiography. 

Transection of MCF-7 cells with antisense constructs 
The genes selected on the basis of their upregulation were 
cloned in antisense orientation in pCDNA3.1 vector (Inviro- 
gen). MCF-7 cells were grown to 70-80% confluence. 
Approximately 4 jig of DNA was transfected into MCF-7 
cells by using Lipofectamine 2000. The transfected cells were 
grown in the presence of G418 (400/ig/ml). The transfectants 
were processed for protein isolation. A control set of cells was 
transfected with an empty pcDNA3.1. The proteins were 
analysed by two-dimensional electrophoresis, and altered 
bands were excised for mass spectrometry. 
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Protein isolation 

The cultured cells were harvested by trypsinization and 
centrifuged at 220 g for 5min at 4°C. The cell pellet was 
washed once with ice-cold 1 x PBS. The proteins were isolated 
by using a commercial kit (Bio Rad, Hercules, CA, USA.). 
Briefly, pelleted cells (0.05 ml) were mixed with 0.5 ml ice-cold 
CPEB solution containing protease inhibitors coclctail 
(Roche), vortexed and stored on ice for 30min. The cell 
suspension was passed through a syringe needle (20 gauge) for 
10-20 strokes to ensure complete cell lysis. The cytoplasmic 
protein fraction was collected by centrifugation at lOOg for 
lOmin at 4°C. The nuclear pellet was washed once again with 
0.5 ml CPEB solution. The nuclear pellet was resuspended in 
0.75 ml PSB buffer, vortexed briefly and centrifuged at 1000 g 
for lOmin at 4°C, and the supernatant containing nuclear 
protein was collected into a new tube. The samples were 
quantified using 2D Quant kit (Amersham Biosciences), 
aliquoted and stored at -80°C to prevent protein degradation. 
To reduce streaking, background staining and the other gel 
artefacts associated with substances contaminating 2D/IEF 
samples, the samples were cleaned with 2D Clean up kit (Bio 
Rad, Hercules, CA, USA) before running on the gel. 

Two-dimensional gel electrophoresis 

The protein mixtures were separated based on isoelectric 
points by using commercial pre-cast pH gradient gel strips 
according to the manufacturer's instructions. The protein 
sample (175 /ig) in 185/il of sample buffer (8M urea, 2% 
CHAPS, 0.2% biolytes, 3/10 ampholytes, 65 mM DTT and 
0.002% bromophenol blue) was loaded in the sample loading 
trays at the end of 1 1 cm immobilized rehydrated strips (pH 3- 
10) (Bio Rad, Hercules, CA, USA). Following isoelectric 
focusing, proteins were reduced and alkylated by successive 
15min treatments with equilibration buffer (6M urea, 0.375 M 
Tris-HCl pH 8.8, 2% SDS, 20% glycerol, 2% DTT) and 2.5% 
(W/V) iodoacetamide, respectively. Proteins were then re- 
solved in the second dimension on 8-16% gradient SDS- 
PAGE gel (Bio Rad, Hercules, CA, USA). The protein spots 
were visualized by staining with either silver stain or 



Coomassie blue stain. The gel images were compared and 
bands showing significant (greater than twofold) alterations in 
intensity were excised and processed for mass spectrometry. 
Comparisons were made between protein ly sates from MCF- 
10A, MCF-7 and MDA-MB-231 cell lines or between MCF-7 
and MCF-7 cells transfected with specific antisense constructs. 

Protein identification by enzymatic digestion followed by mass 
spectrometry 

Prior to performing trypsin digestion, the gel pieces containing 
protein spots were washed sequentially once with water and 
twice with acetonitrile. The gel pieces were then allowed to 
swell in lOOmM ammonium bicarbonate and finally washed 
with acetonitrile. The washed slices were dried in a Speed Vac 
concentrator, and subsequently incubated with 20 ^1 of 
Promega's autocatalysis-resistant trypsin (12.5ng//d in 50 mM 
ammonium bicarbonate and 5mM CaCl 2( pH 8.0) overnight at 
37°C. The supernatant (5 fd) from tryptic digest was injected 
for peptide sequence analysis using on-line capillary liquid 
chroma to graphy-electro spray ionization- tandem mass spectro- 
metry (LC-MS/MS). The front end HPLC utilized a Dionex 
(San Francisco, CA, USA) Vydac 300 ^m inner diameter 
x 15 mm C18 column. The linear acetonitrile gradient (3%/ 
min, containing 0.02% TFA) was developed using a Hewlett- 
Packard 1 100 pump operating at 0.1 ml/min, and the flow was 
split before the injector such that the flow rate through the 
column was 3 ^1/min. Peptides were detected at 215nM. The in- 
line mass spectrometer was a ThermoElectron LCQ-DECA 
instrument operated in data-dependent MS/MS mode, and 
proteins were identified by searching a nonredundant protein 
database using the Sequest program. 
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Identification of androgen-regulated genes in the 
prostate cancer cell line LNCaP by serial analysis 
of gene expression and proteomic analysis 

A common therapy for nonorgan-confined prostate cancer involves androgen depriva- 
tion. To develop a better understanding of the effect of androgen on prostatic cells, we 
have analyzed gene expression changes induced by dihydrotestosterone (DHT) in the 
androgen responsive prostate cancer line LNCaP f at both RNA and protein levels. 
Changes at the RNA level induced by DHT were determined by means of serial analysis 
of gene expression (SAGE), and protein profiling was done by means of quantitative 
two-dimensional polyacrylamide gel electrophoresis. Among>123 371 transcripts anal- 
yzed, a total of 28844 distinct SAGE tags were identified representing 16570 genes. 
Some 351 genes were significantly affected by DHT treatment at the RNA level 
(p < 0.05), of which 1 47 were induced and 204 repressed by androgen. In two indepen- 
dent experiments, the integrated intensity of 32 protein spots increased and 12 
decreased at least two-fold in response to androgen, out of a total of 1031 protein 
spots analyzed. The change in intensity for most of the affected proteins identified 
could not be predicted based on the level of their corresponding RNA. Our study pro- 
vides a global assessment of genes regulated by DHT and suggests a need for profiling 
at both RNA and protein levels for a comprehensive evaluation of patterns of gene 
expression. 
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1 Introduction 

Androgens affect numerous aspects of prostate biology 
including development, growth, and maintenance. They 
also play a critical role in tumorigenesis and progression 
of prostate cancer. Androgen deprivation is an estab- 
lished treatment modality for prostate cancer. However, 
the disease eventually progresses into a hormone refrac- 
tory cancer. Several mechanisms have been identified 
which may contribute to androgen independence: (1) 
Mutations in the androgen receptor (AR) lead to ligand- 
independent activation or promiscuity of the receptor 
[1 , 2]. The ability of the receptor to activate or repress 
downstream genes can also be affected by mutation. AR 
mutation is associated with advanced phases of prostate 
cancer [3, 4]; (2) AR can be activated in a ligand-indepen- 
dent manner by specific growth factors and cytokines [5]; 
(3) AR gene amplification has been found to occur in 
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28-30% of tumors that recurred post androgen -ablation 
therapy [6, 7]. Walten et a/. [8] have recently shown that 
one -third of locally recurrent hormone refractory prostate 
cancer contain AR gene amplification; (4) Coactivator f 
amplification and corepressor down-regulation have 
been shown to increase receptor transactivation [9, 10]. 
Regardless of which pathway(s) is taken by the tumor 
cells, androgen-regulated genes may ultimately be the 
key players in the development of hormone refractory 
cancer. As a step towards a better understanding of the 
effect of androgen on gene expression, we have under- 
taken a comprehensive assessment of gene expression 
changes induced by dihydrotestosterone (DHT) in the 
androgen responsive prostate cancer cell line LNCaP. An 
important feature of our study is the parallel assessment 
of expression changes at both RNA and protein levels. 



2 Materials and methods 

2.1 Prostate eel! line and DHT treatment 

LNCaP human prostate cancer eel I line (American Type Cul- 
ture Collection, Rockville, MD, USA) was cultured for 3 d in 
phenol-free RPMI 1640 supplemented with 5% charcoal- 
stripped fetal bovine serum (FBS) at 37°C. Half of the cul- 
tures were then treated with 1 0" 9 m DHT for 24 h. TotaJ R NA 
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and protein were extracted from untreated and DHT treated 
cells using TRIzol (Invitrpgen, Carlsbad, CA, USA) reagent 
for RNA and a solubilization cocktail for proteins, consisting 
of 9 m urea, p-mercaptoethanol and 1 % NP-40. For kinetics 
experiments 1 x 10 6 cells were plated in phenol-free RPMI 
medium with 5% charcoal-stripped FBS. On day three, 
cells were treated with 5 mL of media containing 10" 9 m 
DHT and total RNA was extracted at different time points 
from 0-96 h.To determine the direct/indirect effect of andro- 
gen on RNA levels, cells were plated as above and treated 
with 5 ^ig/mL of cyclohexamide in the presence or absence 
of 1 0- 9 m DHT for 24 h and total R NA was extracted. 

2.2 Serial analysis of gene expression 

Serial analysis of gene expression (SAGE) was performed 
as described previously [11] with the following modifi- 
cations: ditags were PGR amplified using biotinylated pri- 
mers and digested with Nla III enzyme [12]. Concatemers 
were heated for 15 min at 65°C and chilled on ice for 
10 min before separation on an 8% polyacrylamide gel 
[13]. The concatemers were then cloned into the Sphl 
site of the pZero vector (Invrtrogen). Concatenated tags 
were screened by PGR using M13 forward and M13 
reverse primers. PGR products with inserts greater than 
500 bp were isolated and sequenced with M13 forward 
primer on an automated 3700 DNA sequencer (Perkin- 
Elmer, Norwatk, CT, USA). For microSAGE, 1 \ig of total 
RNA per tube was used for cDNA synthesis in two tubes, 
with the mRNA Capture Kit (Boehringer Mannheim, 
Indianapolis, !N, USA. cDNAs were cleaved with Affalll, 
ligated to linkers and then digested with Bsm Fl enzyme. 
The released tags were ligated, and processed for the 
rest of steps as with the standard SAGE protocol. 

2.3 SAGE data analysis 

SAGE tags were extracted using the SAGE software 
V 4.12 [11]. Tags were matched to the SAGE reliable 
map (release 10-26-2000) (http://www.ncbi.nlm.nih.gov/ 
SAGE/). Due to the fact that some tags map to multiple 
genes and some genes have multiple tags, SAGE data 
were analyzed in two different ways: (1) exclusion 
method: tags that match to multiple genes were dis- 
carded. Only tags that match to a single gene were tabu- 
lated and composite counts analyzed for their signifi- 
cance; (2) inclusion method: tags that match to multiple 
genes were counted at 100% toward each gene. All tags 
were tabulated and composite counts analyzed for their 
statistical significance. Lists of differentially expressed 
genes (p<0.05) obtained from the exclusion and inclu- 
sion methods were compared, and finally only genes that 
have a p value <0.05 in both lists were considered statis- 



tically significant. The total number of genes identified 
was estimated by N m + (N um - 0.1 N um )/3.5, where N m is 
the number of genes matched to SAGE tags, IM um is the 
number of SAGE tags that do not match to known genes 
or ESTs, with 10% representing the estimated sequencing 
error per SAGE tag and 3.5, the average number of tags 
per gene in the SAGE reliable map (release 10-26-2000). 



2.4 RT-PCR and real-time PCR quantification 

For reverse transcription-PC R (RT-PCR), 1 ^g of total RNA 
was reverse transcribed into cONA. One fortieth of cDNA 
was used for PCR reaction. Samples were collected at 
different cycles and separated on a 2% agarose gel with 
ethidium bromide. Image was captured and quantified 
using NucleoVision 760 Image Workstation (Nucleotech, 
CA, USA). Amplification curves were generated. Sub- 
sequently, RT-PCR was done at cycles within the log 
phase of amplification. 

Real-time quantification was performed in the iCycler (Bio- 
Rad, Hercules, CA, USA). Briefly, one fortieth of cDNA was 
used in each reaction. Six reactions were carried out for 
each gene and three independent experiments were per- 
formed. PCR mix comprised of 1X PCR buffer, 1.5 mw 
MgCI 2 , 0.1 mM dNTP, 200 nM primers (listed below), 0.05 U 
platinum Taq polymerase (Invrtrogen) and 0.1 x SYBR green 
(Molecular Probes, Eugene, OR, USA). PCR was carried out 
at 94°C for 2 min, and 40 cycles of 94°C for 30 s, 55°C for 
30 s, and 72°C for 30 s. Primers used for real-time PCR 
quantification: PSA (prostate specific antigen) (Hs.1 71995, 
5 ' -GGAAATGACCAGGCCAAGAC-3\ 5-CAACCCTG GAC 
C TCACACCTA-3'), SCMH1 (Hs.57475, S'-GCCTTGACC 
ACATCACTCCAT-3\ 5 , -AGGCCTAGGGCTGCAAAAG-3 f ), 
and clusterin (Hs.75106, 5 '-GCAGGAATACCGCAAAAA 
GC-3\ 5'-GACTCAAGATGCCCCCGTAAG-3'). Standard 
samples (50, 25, 1 0, 5, 2.5, 1 and 0.5 uLcDNA) and experi- 
mental samples were used in real-time quantification PCR. 
Each sample was run in quadruple reactions. Standard 
curve (Ct = mX +• B) was obtained, where Ct is threshold 
cycle number, X is log quantity of target molecules, m is 
curve slope and B is Y-axis intercept value. Number of 
fold induction or repression for a given target molecule 
was calculated by Qa/Qb, where Qa is the target quantity 
in experimental sample A and Qb is the target quantity 
in experimental sample 8. Cta = mlogQa + B or Qa = 
10 (ct a -Bm l( therefore, Qa/Qb = 10^^". 



2.5 2-D PAGE 

The procedure followed was as previously described 
[14]. Cells were solubilized in 200 nL of lysis buffer con- 
taining 9.5 m urea .(Bio-Rad), 2% NP-40, 2% carrier 
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ampholytes pH 4-8 (Gallard/Schlessinger, Carle Place, 
NX USA), 2% p-mercaptoethanol and 10 mM PMSR 
Aliquots containing approximately 5x10 6 cells, were 
applied onto isofocusing gels. IEF was conducted using 
pH 4 to 8 carrier ampholytes at 700 V for 16 h, followed 
by 1000 V for an additional 2 h. The 1-D gel was loaded 
onto the 2-D gel, after equilibration in 1 25 mM Tris, pH 6.8, 
10% glycerol, 2% SDS, 1 % DTTand bromophenol blue. 
For the second dimension separation, a gradient of 
11-14% of acrylamide (Serva, Hauppauge, NY, USA) 
was used. Following 2-D PAGE, proteins were visualized 
by silver staining of the gels or transferred to an Immobi- 
lon-P PVDF membrane (Millipore, Bedford, MA). 



2.6 Protein identification by mass spectrometry 

The 2-D gels were silver stained by successive incuba- 
tions in 0.02% sodium thiosulfate for 2 min, 0.1% silver 
nitrate for 40 min and 0.014% formaldehyde plus 2% 
sodium carbonate. The proteins of interest were excised 
from the 2-D gels and destained for 5 min in 15 mM 
potassium ferricyanide and 50 mM sodium thiosulfate 
as described [15], Following three washes with water, 
the gel pieces were dehydrated in 100% acetonitrile for 
5 min and dried for 30 min in a vacuum centrifuge. 
Digestion was performed by addition of 100 ng of trypsin 
(Promega, Madison, Wl, USA) in 200 mM ammonium 
bicarbonate or by addition of 100 ng of the endopro- 
teinase Glu-C (Promega) in 100 mM ammonium bicarbo- 
nate. The Lys-C digestion was performed with 500 ng of 
the endoproteinase Lys-C (Roche, Mannheim, Germany) 
in 100 mM Tris-HCI, pH 9. Following enzymatic digestion 
overnight at 37°C, the peptides were extracted twice 
with 50 \iL of 60% acetonitrile/1 % TFA. After removal of 
acetonitrile by centrifugation in a vacuum centrifuge, the 
peptides were concentrated by using pipette tips C18 
(Millipore). 

Analyses were performed primarily using a PerSeptive 
Biosystem MALDI-TOF Voyager- DE mass spectrometer 
(Framingham, MA, USA), operated in delayed extraction 
mode. Peptide mixtures were analyzed using a saturated 
solution of cyano-4-hydroxycinnamic acid (CHA) (Sigma, 
St. Louis, MO, USA) in acetone containing 1% TPA 
(Sigma). Peptides were selected in the mass range of 
800-4000 Da. Spectra were calibrated using calibration 
mixture 2 of the Sequazyme peptide mass standards kit 
(PerSeptive Biosystems). The search program MS-Fit, 
developed by the University of California at San Francisco 
(http://prospector.ucsf.edu), was used for searches in the 
NCBI database. Search parameters were as follows: 
maximum allowed peptide mass error of 400 ppm, con- 
sideration of one incomplete cleavage per peptide and 



pH range between 4 and 8. MALDI-TOF mass spectro- 
metry was also used for molecular weight determina- 
tion as described [16]. In some cases, the amino acid 
sequence of some peptides of interest was determined 
by ESI MS analysis. 

3 Results 

3.1 SAGE analysis of the effect of androgen 
on gene expression 

SAGE libraries were generated from LNCaP cells cul- 
tured in the presence or absence of DHT. A total of 
123371 tags were generated of which 62 878 were from 
the LNCaP cell line and 60 493 from the LNCaP cells 
treated with DHT for 24 h (Table 1). Sequence analysis 
identified a total of 28844 distinct tags representing 
16570 genes, 1 1 243 from LNCaP and 12203 from DHT 
treated cells. A total of 351 transcripts were differentially 
expressed at a significant level (p<0.05). Eighty-seven 
percent of transcripts matched GenBank entries; 79% 
corresponded to known sequences and 8% to ESTs. 
RNA levels for 1 47 genes were increased and 204 genes 
were decreased after DHT treatment (Table 2). Therefore, 
at the RNA level more genes were repressed than 
activated by androgens. Of these androgen-regulated 
genes, 1 49 were changed > five-fold by DHT treatment 
flable 3). 



Table 1. Summary of SAGE analysis in LNCaP cells 



Sample 


Total 


Distinct 


Number 




transcripts 


tags 


of genes 


-DHT 


62 878 


17 050 


11 243 


+ DHT 


60 493 


18510 


12 203 


Total 


123 371 


28 844 


16 570 


- DHT: LNCaP cells without dihydrotestosterone; + DHT: 


LNCaP cells treated with 1 0 -9 m dihydrotestosterone for 


24 h 








Table 2. DHT regulated genes in LNCaP cells 




Genes 




LNCaP vs LNCaP + DHT 






(gene #) 








p 0.05 


p 0.01 


Known genes 




277 


147 


ESTs 




29 


8 


No match 




45 


13 


Total 




351 


168 


Up-regulated by DHT (24 h) 


147 


65 ■ 


Down-regulated by DHT (24 h) 


204 


103 
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UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


283305 


1 


82 


Induced 


U 


immunoglobulin heavy contant alpha 1 


183 752 


0 


42 


Induced 


0 


microsemino protein, beta- 


140 


1 


44 


Induced 


0 


immunoglobulin heavy constant gamma 3 (Grn marker) 


75415 


18 


95 


Induced 


0 


beta-2-microglobulin 


84298 


0 


17 


Induced 


1 .OOE-05 


CD74 antigen (invariant polypeptide of major histocompability 
complex, class II antigen-associated) 


9615 


0 


16 


Induced 


1 .OOfc-OO 


myosin regulatory iignicnain d, smooin rnuscie isoiurrn 


77443 


0 


16 


Induced 


1 .OOE-05 


actin, gamma 2, smooth muscle, enteric 


1119 


0 


15 


Induced 


O nrtr AC 

2.33b-0o 


nuclear recspxor suuTamiiy h, group m, mernuer \ 


84753 


23 


2 


Repressed 


o.OUb-Uo 


r\iAAU^4b protein 


75 777 


0 


14 


Induced 


O.DDC-UO 


transgeun 




0 


14 


Induced 


3.66E-05 


immunoglobulin J polypeptide, (inker protein for immuno- 


76325 










giODUim aipna ana mu poiypeuaes 


1852 


1 


17 


Induced 


4.33E-05 


acid phosphatase, prostate 


78465 


0 


13 


Induced 


9.UUfc-0o 


v-jun avian sarcoma virus 1 7 oncogene homolog 


78344 


1 


16 


Induced 


1 .20E-04 


myosin, heavy polypeptide 1 1 , smooth muscle 


83006 


19 


2 


Repressed 


1 .50E-04 


CGM 39 protein 


263 812 


15 


1 


Repressed 


3.19E-04 


nuclear distribution gene u \r, niauiansj nomoiog 


TACGGGGATA 


0 


11 


Induced 


4.20E-04 


Novel 


GCCTGGGTGG 


11 


0 


Repressed 


6.63E-04 


Novel 


GACTGACACT 


16 


2 


Repressed 


8.40E-04 


Novel 


284296 


0 


10 


Induced 


8.60E-04 


Homo sapiens SURF-4 mRNA, complete cds 


75105 


13 


1 


Repressed 


1.19E-03 


emopamil-binding protein (sterol isomerase) 


128075 


10 


0 


Repressed 


1.30E-03 


ESTs 


154162 


15 


2 


Repressed 


1.43E-03 


ADP-ribosylation factor-like 2 


6895 


1 


12 


Induced 


1.49E-03 


actin related protein 2/3 complex, subunit 3 (21 kD) 


143 240 


17 


3 


Repressed 


1.63 E-03 


ESTs 


211582 


0 


9 


Induced 


1.66E-03 


myosin, light polypeptide kinase 


180266 


0 


9 


Induced 


1.66E-03 


tropomyosin 2 (beta) 


93002 


14 


2 


Repressed 


2.53E-03 


ubiquitin carrier protein E2-C 


17883 


14 


2 


Repressed 


2.53E-03 


protein phosphatase 1G (formerly 2C), magnesium- 










dependent, gamma isoform 


126023 


14 


2 


Repressed 


2.53E-03 


ESTs, Highly similar to NTC1 J-IUMAN NEUROGENIC LOCUS 










NOTCH rROI fcIN HUMULUo \ rntUUnoUn 


69469 


2 


13 


Induced 


2.92E-03 


dendritic cell protein 


77899 


2 


13 


Induced 


2.92E-03 


tropomyosin 1 (alpha) 


285501 


2 


13 


Induced 


3.07E-03 


Human rearrangeo immunogioounn ramuua iiyni unain mmN/A 


119209 


0 


8 


Induced 


3.07E-03 


insulin -like growth factor binding protein 7 


173043 


0 


8 


Induced 


3.07E-03 


metastasis-associated 1 -like 1 


75866 


0 


8 


Induced 


3.07 E-03 


dimethylarginine dirnethylaminohydrolase 1 


171695 


0 


8 


Induced 


3.07E-03 


dual specificity phosphatase 1 


TACGGGGATT 


0 


8 


induced 


3.34E-03 


Novel 


172791 


13 


2 


Repressed 


4.19E-03 


ubiquitously-expressed transcript 


13561 


13 


2 


Repressed 


4.19E-03 


ESTs, Weakly similar to dJ37E16.5 [H. sapiens] 


98260 


8 


0 


Repressed 


4.34E-03 


. ESTs 


CATAAGACTT 


8 


0 


Repressed 


4.64E-03 


Novel 


TACGGGGACA 


2 


12 


Induced 


5.76E-03 


Novel 


180034 


12 


2 


Repressed 


6.91 E-03 


cleavage stimulation factor, 3* pre-RNA, subunit 3, 77kD 


5753 


12 


2 


Repressed 


6.91 E-03 


inosito(myo)-1 (4)-mono phosphatase 2 


278941 


10 


1 


Repressed 


6.94E-03 


PRO0628 protein 


19 500 


0 


7 


induced 


7.25E-03 


nuclear localization signal deleted in velocardiofacal syndrome 
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UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


yuuo 


n 
U 


7 


inauctJu 


7 occ_nq 


VAMP fvAQirlp-flQ<inriatAd m^mhrane nrotein^-associated 
protein A (33kD) 




o 


7 


Induced 


7.25E-03 


Homo sspiens clone HQ0692 




12 


2 


1 lO i*J 1 COOvU 


7.63E-03 


Novel 




7 


n 
\j 


ntijJi coocu 




Novel 


CCCCCCTGI C 


-7 

( 


U 


Repressed 


o.oyc-uo 


INUVcl 


OHO C\Cl4 




1 1 


maucea 


ft ftOP-o/* 


Nnmn <pn/pn<; rDNA Fl 1207^8 fis clone HEP08257 


1 1 y uuu 


1 


Q 


InHi marl 


9.12E-03 


artinin aloha 1 

C4w LI 1 III If C4I yjl Id 1 


78596 


1 


y 


Induced 


Q IOC OQ 


nrntaaenma /nrnenma marrrinain^ qi ihiinit hpfrfl t\/OP 
prOlcaSQIIIU ^|JIUoLMMW, 1 1 IdUl (JfJaJl I) ouuui Ml, ucva lypo, o 


1 71 955 


7 


0 


Repressed 


o C"i c no. 


uopninin associaieu protein \iasunj 


180545 


7 


0 


Repressed 


9.61 E-03 


Homo sapiens mRNA for hypothetical protein (TR2/D1 5 gene) 


77719 


7 


0 


Repressed 


9.61 E-03 


gamma-glutamyl carboxylase 


4877 


7 


0 


Repressed 


9.61 E-03 


CGI -oi protein 


22795 


7 


0 


Repressed 


9.61 E-03 


ESTs 


79 335 


7 


0 


Repressed 


n cmc no 
9.01 c-Uo 


nomo sapiens oVVi/oNr-reiatea, rnairiA-assuciaiou, dcun 










HononHont ran i il ntnr nf rhrnmfltin D1 f^MARCDI^ mRNA 
UcpalKJoMl lOyUlaHJI \Il L>I IFUI Malll 1 U 1 (J limine 


119 177 


9 


1 


Repressed 




ML/"-nD05ylaUn TaCUJl o 


CGGGAGCACC 


9 


1 


Repressed 


■* one no 


Novel 


182217 


0 


6 


Induced 


1 .32E-02 


succinate-CoA ligase, ADP-forming, beta subunit 


75106 


0 


6 


Induced 


1 .32E-02 


clustenn (complement lysis inniDitor, csr-^u, *^u, suiTaieu giy- 
mnrntoin ? tp^toQtfironft-rflnrR<i^pfi nrostate messaoe 2^ 


256311 


0 


6 


Induced 


•i qoc no 


granin-UK© neuroenciocrine pcpiiua prcL-ur&ui 


1 03 1 80 


0 


6 


Induced 




DV/£ protein 


TACGGGGATG 


1 


8 


Induced 


•i 7AC no 


Novel 


AAACAAATCA 


2 


10 


Induced 


1 .70E-02 


Novel 


136644 


2 


10 


Induced 


1.71E-02 


CS box-containing WD protein 


8036 


2 


10 


Induced 


1.71E-02 


glioblastoma overexpressed 


74284 


2 


10 


Induced 


1.71E-02 


ESTs, WeaKly similar to oimiiarxo o. cerevisiae nypotneucai 
protein L31 1 1 


ATGGCTGATC 


6 


0 


Repressed 


1 .75E-02 


Novel 


ATCACTGCCC 


6 


0 


Repressed 


1 .75 E-02 


Novel 


ACATCATCAG 


6 


0 


Repressed 


1.75E-02 


Novel 


CCAGTCCAAG 


6 


0 


Repressed 


1 .75E-02 


Novel 


54842 


1 


8 


Induced 


1 .78E-02 


ESTs 


1526 


1 


8 


Induced 


1 .78E-02 


ATPase, Ca++ transporting, cardiac muscle, slow twitch 2 


7911 


1 


8 


Induced 


1 78E-02 


KIAA0323 protein 


227 400 




8 


Induced 


1.78E-02 


mrtogen-activated protein kinase kinase kinase kinase 3 


77 269 


\ 


8 


Induced 


1 .78E-02 


guanine nucleotide binding protein (G protein), alpha inhibiting 
activity polypeptide 2 


7943 


6 


0 


Repressed 


1.81E-02 


R PBS -mediating protein 


278544 


6 


0 


Repressed 


1.81 E-02 


acetyl -Coenzyme A acetyltransferase 2 (acetoacetyl 
uoenzyme a tnioiasej 


23111 ■ 


6 


0 


Repressed 


1 .81 E-02 


phenylalanine-tRNA synthetase-like 


31442 


6 


0 


Repressed 


1.81 E-02 


RecQ protein-like 4 


38628 


6 


0 


Repressed 


1.81 E-02 


hypothetical protein 


77422 


6 


0 


Repressed 


1.81 E-02 


proteolipid protein 2 (colonic epithelium-enriched) 


211973 


6 


0 


Repressed 


1.81 E-02 


homolog of Yeast RRP4 (ribosomal RNA processing 4), 
3'-5'-exoribonuclease 


171 075 


6 


0 


. Repressed 


1.81 E-02 


replication factor C (activator 1) 5 (36.5kD) 


89781 


6 


0 


Repressed 


1.81 E-02 


upstream binding transcription factor, RNA polymerase I 


154149 


6 


0 


Repressed 


1 .81 E-02 


apurinic/apyrimidinic endonuclease(APEX nuclease)-like 2 



protein 
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UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


279 772 


8 


1 


Repressed 


2.28E-02 


brain specific protein 


205091 


8 


1 


Repressed 


2.28E-02 


ESTs, Weakly similar to WW domain binding protein 1 1 
(M. muscu/us) 


75658 


8 


1 


Repressed 


2.28E-02 


phosphorylase, glycogen; brain 


GGGCAGCTGT 


8 


1 


Repressed 


2.37E-02 


Novel 


105440 


0 


5 


Induced 


2.43E-02 


hepatocyte nuclear factor 3, alpha 


118244 


0 


5 


Induced 


2.43E-02 


protein phosphatase 2, regulatory subunit B (B56), delta iso- 
form 


82389 


0 


5 


Induced 


2.43E-02 


CGI-1 18 protein 


31638 


0 


5 


Induced 


2.43E-02 


restin (Reed-Steinberg cell-expressed intermediate filament- 
associated protein) 


12013 


0 


5 


Induced 


2.43E-02 


ATP-binding cassette, sub-family E (OABP), member 1 


12797 


0 


5 


Induced 


2.43E-02 


DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 16 


7736 


0 


5 


Induced 


2.43E-02 


hypothetical protein 


274479 


0 


5 


Induced 


2.43E-02 


NME7 


153138 . 


0 


5 


Induced 


2.43E-02 


origin recognition complex, subunit 5 (yeast homolog)-like 


16034 


0 


5 


Induced 


2.43E-02 


ESTs 


270072 


0 


5 


Induced 


2.43E-02 


ESTs 


187035 


0 


5 ! 


Induced 


2.43E-02 


ESTs 


140452 


0 


5 


Induced 


2.43E-02 


cargo selection protein (mannose 6 phosphate receptor bin- 
ding protein) 


182265 


0 


5 


Induced 


2.43E-02 


keratin 19 


19762 


0 


5 


Induced 


2.43E-02 


ESTs, Weakly similar to unknown (D. melanogastet) 


317 


0 


5 


Induced 


2.43E-02 


topoisomerase (DNA) 


6236 


0 


5 


Induced 


2.43E-02 


ESTs 


GCTGGAGCCT 


5 


0 


Repressed 


3.03E-02 


Novel 


CCAGTGCTCA 


5 


0 


Repressed 


3.03E-02 


Novel 


ACCCTACATA 


5 


0 


Repressed 


3.03E-02 


Novel 


GGGGAAATCT 


5 


0 


Repressed 


3.03E-02 


Novel 


ACTGGTACTG 


5 


0 


Repressed 


3.03E-02 


Novel 


GCTCCGGTGT 


5 


0 


Repressed 


3.03E-02 


Novel 


ACAGTGGTGA 


5 


0 


Repressed 


3.03E-02 


Novel 


7869 




7 


Induced 


3.04E-02 


lysophosphatidic acid acyltransferase-delta 


12101 


1 


7 


Induced 


3.04E-02 


hypothetical protein 


266940 




7 


Induced 


3.04E-02 


t-comptex-associated-testis-expressed 1 -like 1 


6196 


; 


7 


Induced 


3.04E-02 


integrin-linked kinase 


366 


1 


7 


Induced 


3.04E-02 


6-pyruvoyltetrahydropterin synthase 


173611 


1 


7 


Induced 


3.04E-02 


NADH dehydrogenase (ubiquinone) Fe-S protein 2 (49kD) 
(NADH-ceonzyme Q reductase) 


102469 


1 


7 


Induced 


3.04E-02 


putative nuclear protein 


78825 




7 


Induced 


3.04E-02 


matrin 3 


284465 


; 


7 


Induced 


. 3.04E-02 


ESTs 


30738 


1 


7 


Induced 


3.04E-02 


hypothetical protein FU10407 


78687 




T 


Induced 


3.04E-02 


neutral sphingomyelinase (N-SMase) activation assoviated 
factor 


92381 


1 


7 


Induced 


3.04E-02 


nudix (nucleoside diphosphate linked moiety X)-type motif 4 


242039 


5 


0 


Repressed 


3.26E-02 


EST 


4766 


5 


0 


Repressed 


3.26E-02 


DKF2P586O0120 protein 


283109 


5 


0 


Repressed 


3.26E-02 


hypothetical protein DKFZp762L1710 


192853 


5 


0 


Repressed 


3.26E-02 


ubiquitin-conjugating entyme E2G 2 (homologous to yeast 
UBC7) 


153678. 


5 


0 


Repressed 


3.26E-02 


reproduction 8 
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UGD or Tag 


CAP 


DHT 


Change 


P value 


Description 


dulOl f 


0 


U 


nepressea 


o occ no 


CQT 

to 1 




c 
O 


u 


nepressea 


q ooc no 


selenoprotein a 


•i cni i a 

1 503 iy 


0 


U 


Repressed 




bo 1 s 


102456 




r\ 

y 


Repressed 


o occ no 


survival of motor protein interacting protein 1 


284250 


5 


U 


Repressed 




au-uuo proiein 


251871 


5 


o 


Repressed 


3.26E-02 


CTP synthase 


270480 


7 


1 


Repressed 


3.92E-02 


ESTs, Weakly similar to ALU5JHUMAN ALU SUBFAMILY SC 
SEQUENCE 


26655 


7 


1 


Repressed 


3.92E-02 


glucose-6-phosphatase, transport (glucose-6-phosphate) 
protein 1 


8118 


7 


1 


Repressed 


3.92E-02 


leukotriene A4 hydrolase 


CTCCGCCGGC 


7 


1 


Repressed 


4.47E-02 


Novel 


AGGAAATGCT 


7 


1 


Repressed 


4.47E-02 


Novel 


GCTGACCGAGG 7 


1 


Repressed 


4.47E-02 


Novel 


CCCATAGTCC 


7 


1, 


Repressed 


4.47E-02 


Novel 



CAP: LNCaP cells without dihydrotestosterone 

DHT: LNCaP cells treated with 10" 9 dihydrotestosterone for 24 h 

UGD: unigene ID 

Numbers in CAP and DHT columns are the number of tags observed in respective samples 



3.2 Confirmation of SAGE data 

To confirm the differential expression pattern at the RNA 
level, semiquantitative RT-PCR assay was performed on a 
group of selected genes. One \ig of total RNA was reverse 
transcribed into cDNAwith oligo dT. cDNA was amplified by 
PCR for various cycles. PCR products were separated on 
agarose gels and quantified by densitometry. PCR ampli- 
fication curves were plotted for each gene, and data within 
the logarithmic phase of amplification were used for quanti- 
fication (Fig. 1 A, B and C). The reproducibility of gene regu- 
lation by androgen was confirmed in three independent 
experiments by monitoring the induction of PSA in the pres- 
ence or absence of cyclohexamide (Fig. 1 B). The kinetics 
of gene induction or repression was determined for PSA 
(Hs.171995), clusterin (Hs.75106) and SCMH1 (Hs.57475) 
genes. Typical examples of kinetics are shown in Figure 
1C. PSA was induced at 4-6 h t peaked between 6-20 h, 
and gradually declined after 20 h post-treatment of DHT. 
Clusterin was induced within 0.5-1 h and gradually declined 
after 6-1 2 h. SCMH1 was repressed 2-4 h post-treatment. 
Expression of PSA, clusterin, and SCMH1 was quantified by 
real-time PCR (Fig. 1D). 

3.3 Changes in protein expression induced 
by DHT 

Protein lysates of LNCaP cells cultured in the presence of 
DHT for 72 h and cells cultured in parallel in the absence 
of DHT were subjected to 2-D PAGE (Fig. 2). Following sil- 



ver staining, gels were digitized prior to computer-based 
matching and quantitative analysis, as previously des- 
cribed [17]. Of a total of 1031 protein spots matched 
between 2-D patterns of DHT treated and untreated cells, 
a set of 32 protein spots increased in intensity by at least 
two-fold in two independent experiments. Likewise, a set 
of 1 2 protein spots decreased in intensity by at least two- 
fold in these two independent experiments. A two-fold 
change in integrated intensity by silver staining represents 
on average a three-fold change in amount of protein, 
based on prior quantitative studies [1 7]. 

The 44 protein spots that changed in intensity as a result 
of DHT treatment were excised from the gels, digested 
with trypsin, and subsequently analyzed by MALDI-TOF 
MS. The resulting spectra were used to identify the pro- 
teins using the MS -Fit search program. Of the 44 spots 
excised from the gels, 29 were identified without ambi- 
guity and consisted of 21 up -regulated and eight down- 
regulated proteins (Table 4). The identified proteins repres- 
ented a heterogeneous group that included cytoskeletal 
proteins (e.g. tropomyosin, a tubulin), metabolic enzymes 
(e.g. adenine phosphoribosyl transferase, (3 1 ,4 galactosyl 
transferase, galactocerebrosidase), and the products of 
previously described androgen responsive genes (e.g. 
SRY, selenium binding protein) [18, 19]. Specific anti- 
bodies confirmed the identification based on MS for all 
proteins analyzed by Western blotting, which included a 
tubulin, myosin light chain isoforms, nucleoside diphos- 
phate kinase A, glyceraldehyde 3-phosphate dehydro- 
genase (G3PD), and tropomyosin (data not shown). 
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Figure 1. Kinetics and quanti- 
fication of androg en-regulated 
genes. Total RNA from un- 
treated and DHT treated 
LNCaP cells was reverse 
transcribed. cDNA was PCR 
amplified for various cycles. 
PCR amplification curves were 
plotted for each gene. A cycle 
number within the logarithmic 
phase of amplification was 
selected for semi -quantitative 
analysis. cDNA was also used 
for real-time quantitative PCR. 
(A) An example of RT-PCR 
amplification of PSA at cycles 
between 20-30. (B) RT-PCR 
of PSA in three independently 
treated LNCaP samples, h: 
hours; 0: no DHT; 24: treated 
with 10-* M DHT for 24 h; -: 



without cycloheximide; -k with 5 ^ig/mL of cycloheximide for 24 h. (C) An example of three independent kinetics experi- 
ments for PSA, clusterin and SCMH1. (D) Comparison of SAGE results with that of real-time quantitative PCR for PSA, 
clusterinand SCMH1. |: induction; |: repression. 



Corresponding SAGE data were available for most of the 
proteins affected by DHT treatment that were identified. It 
was therefore of interest to determine if the changes 
observed at the protein levels were matched by concor- 
dant changes at the mRNA level. Remarkably, for most of 
the proteins identified, there was no appreciable concor- 
dant change at the RNA level (Table 4). 



4 Discussion 

We have compared the gene expression profile of 
LNCaP, an androgen responsive prostate cancer cell 
line, with or without androgen (DHT) treatment. Approxi- 
mately 350 of 16570 expressed genes detected at the 
RNA level were affected by dihydrotestosterone at the 
p<0.05 level. The DHT responsive genes included 
known genes, ESTs, and novel genes. As expected, we 
saw an induction of genes that are well known to be 
regulated by androgens. For instance, we found a 1 .7- 
fold induction in kallikrein 3/PSA, a 7.6-fold induction in 
prostatic kallikrein 2, and a 15.7 -fold induction in pro- 
static acid phosphatase by DHT in LNCaP cells. We 
also saw a five-fold induction in NKX3.1/NKX3A and 
three -fold in fatty acid synthase; two previously identified 
androgen-regulated genes [20, 21]. Interestingly p-micro- 
seminoprotein, reported to be reduced or lost in prostate 
tumor [22], was up-regulated more than 40-fold in 



LNCaP cells by DHT. More significantly, our data indicate 
that genes involved in a variety of tumor cell functions 
such as growth, apoptosis, and metastasis, are directly 
or indirectly regulated by androgens. In addition, it is 
noteworthy that both p-actin and G3PD t the two most 
frequently used loading controls, were up-regulated 
approximately two-fold by DHT (p<0.01). Thus, these 
two genes may not be appropriate controls in some 
experiments. 

Serial analysis of androgen-regulated gene expression 
provides us with a list of candidate genes. However, 
many factors such as the dose of DHT and the time of 
cell exposure to DHT and other unknown experimental 
variations will affect the level of gene expression. There- 
fore, it is important to confirm the SAGE results with alter- 
native methods. We performed semiquantitative RT-PCR 
(Fig. 1 B) on approximately 20 genes and real-time quanti- 
tative PCR on 10 genes in at least three independently 
DHT treated LNCaP samples. We noticed experimental 
variations in every gene determined. Some genes such 
as PSA (Fig. 1 B) have smaller variations and some have 
larger variations. In addition, the fold of alteration in 
expression identified by SAGE is different for some genes 
compared to that identified by real-time quantitative PCR. 
This is probably due to the technical limitation of SAGE. 
Serial analysis of gene expression is highly quantitative 
for genes whose tags are detected at large numbers. 
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Figure 2. 2-D PAGE patterns of 
untreated (A) and LNCaP cells 
treated with DHT for 72 h (B). 
Identified proteins that were 
reduced in levels following DHT 
treatments are shown in (A) and 
induced proteins' are shown in 
(B). Numbered spots represent 
either proteins that were chan- 
ged in level that have not been 
identified or isoforms of identi- 
fied proteins. 
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Table 4. List of proteins that are altered by DHT (> 2-fold) 



Proteomics 2001, 7, 1327-1338 



Protein name or 


Unigene 


Protein 




Change 


mRNA 




Change 


sport number 


ID 


0 h 


72 h 


in protein 


0 h 


24 h 


in mRNA 


Adenine phosphoribosyl transferase 


28914 


0.448 


1.084 


T 


36 


18 


1 


Alpha Endosulfine 


111680 


0.066 


0.679 


T 


2 


5 


t 


Alpha Tubluin 


278 242 


0.924 


1.932 


T 


48 


38 


I 


Nucleophosmin, B23 


173 205 


. 0.256 


0.522 


T 


16 


5 


1 


CGI -83 


118554 


0.063 


0.291 


t 


2 


3 


T 


Creatine Kinase B chain (brain) 


173 724 


0.508 


1.142 


T 


59 


32 


1 


Cytokeratin 8 


242 463 


0.134 


0.393 


T 


35 


40 


T 


Cytoskeletal tropomyosin, NTRK1 


85 844 


0.414 


1.174 


t 


14 


18 


t 


Galactocerebrosidase 


273 


0.095 


0.405 


T 


3 


3 




Lactoyl Glutathione Lyase 


75 207 


0 


1.256 


T 


27 


22 


i 


Myosin Light Chain (1000) 


77 385 


0.724 


6.102 


T 


• 118 


88 


i 


Myosin Light Chain (1001) 


77 385 


0.6 


3.212 


T 


118 


88 


I 


Myosin Light Chain (1032) 


77385 


2.573 


5.694 


t 


118 


88 


I 


Nm23 


118638 


0.583 


1.978 


t 


35 


25 


I 


Protein phosphatase 6 ( catalytic 


279563 


0.287 


0.737 


T 


2 


2 




Protein 


7016 


0 


0.448 


T 


2 


5 


t 


Slow Skeletal Muscle Troponin 


84673 


0.067 


0.195 


t 


1 


0 


i 


SRY maJe gonadal sex determining 


1992 


0 


0.735 


t 


ND 


ND 




Synovial sarcoma, X breakpoint 2 


289105 


0.225 


0.593 


T 


ND 


ND 




Terminal deoxynucleotidyl 


272 537 


0.392 


0.995 


T 


ND 


ND 




Transformer-2 alpha 


119523 


0.074 


0.201 


t 


7 


13 


t 


284 




0.173 


0.461 


T 








402 




0 


0.33 


T 








456 




0.154 


0.314 


t 








664 




0.297 


0.629 


t 








807 




0 


0.377 










863 




0.246 


0.665 


T 








903 




0.271 


0.632 


T 








1018 




0 


0.725 


r 








1054 




0.47 


1.394 


T 








1057 




. 0.451 


0.959 


t 








1082 




0.609 


1.738 


T 








14-3-3 eta 


75 544 


0.232 


0.114 


I 


1 


5 


t 


Beta-1 ,4-galactosyl transferase 


158540 


1.306 


0.707 


I 


ND 


ND 




Cox 11 


241515 


1.643 


1.453 


i ' 


3 


0 


i 


Dihydrolipoamide dehydrogenase 


74635 


0.844 


0.132 




9 


8 




G3PD (456) 


169476 


1.191 


0.279 




75 


137 




G3PD (457) 


169 479 


2.051 


0.336 




75 


137 




Selenium Binding Protein (234) 


7833 


0.411 


0.201 




26 


31 




Selenium Binding Protein (237) 


7 833 


01574 


0.22 




26 


31 




651 




0.31 


0.115 










745 




0.2t 


0.067 










892 




0.674 


0.454 










1178 




0.933 


0.305 











Numbers in protein are the intensity of protein spots 

Numbers in mRNA columns are the number of SAGE tags detected 

ND: not detected 

f : increase; J: decrease; =: equal 
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When the tag number detected for a given gene is close 
to zero, the quantitative nature of SAGE is compromised. 
For instance, in approximately 60 000 transcripts of each 
sample, the PSA tag was detected 42 times in LNCaP and 
70 times in LNCaP treated with DHT. However, the tag for 
clusterin was detected 0 times and 6 times, and the tag 
for SCMH1 5 times and 0 times in the corresponding sam- 
ples. Therefore, the SAGE data of PSA is more reliable 
than that of clusterin and SCMH1 . Indeed, SAGE data of 
PSA is virtually identical to that of real-time quantitative 
PCR, whereas the SAGE data of clusterin and SCMH1 dif- 
fer from that of quantitative PCR (Fig. 1 D). 

PSA is probably the best-known androgen-regulated 
gene. We were surprised to see only 1.7-fold induction 
by DHT. However, kinetics experiments indicate that PSA 
is induced at 4-6 h, peaked between 6-20 h t and gradu- 
ally declined 20 h post-treatment of DHT. Since the SAGE 
experiment was done in samples treated for 24 h with 
DHT, PSA mRNA level was likely to have declined from 
its peak level. Indeed, we could detect 5-1 0 fold induction 
of PSA at 6-8 h post-treatment. Clusterin was reported as 
an androgen-repressed gene in the rat prostate [23]. 
Recent evidence indicates that clusterin may not be 
directly androgen-repressed, but regulated by apoptotic 
stimuli [24]. Our. results suggest that clusterin was 
induced within 0.5-1 h, gradually declined after 6-12 h, 
and after 24 h reduced to a lower level than that of 
untreated cells (Fig. 1C). 

Another important question is how many of the andro- 
gen-regulated genes Identified are directly induced or 
repressed by androgens. In order to address this ques- 
tion, detailed analyses must be done for each gene. As 
the first step, we will divide the androgen-regulated 
genes identified in this study into two groups; the cyclo- 
hexamide-sensitive group whose induction or reduction 
of expression by DHT is blocked by the protein synthesis 
inhibitor and the cyclohexamide-insensrtive group whose 
induction or reduction is not affected by cyclohexamide. 
We consider that genes in the latter group are directly 
regulated by DHT. Preliminary results suggest that 
approximately 20% of the 20 genes studied by RT-PCR 
are cyclohexamide-insensitive. Experiments are under 
way to determine the cyclohexamide sensitivity of all of 
the 149 genes listed in Table 3. Further characterization 
of androgen-regulated genes may provide some clues 
on the transition from hormone sensitive to hormone 
refractory prostate cancer. 

A relatively small set of genes could be analyzed at the 
protein level, largely due to the limited sensitivity of 2-D 
PAGE. Nevertheless, a substantial number of detected 
proteins (44 of 1031 proteins analyzed, 4.3%) were 
affected by DHT treatment. Some of the proteins in this 



subset that were identified included the products of 
genes that were previously shown to be affected by 
androgens, namely selenium binding protein, brain speci- 
fic creatine kinase, SRY protein, B23 and G3PD. Using 
a subtract! ve approach, the human selenium-binding 
protein gene was shown to be differentially expressed 
in LNCaP and reversibly down -regulated by exogenous 
androgen in a concentration-dependent manner, in con- 
cordance with our findings at the protein level [19]. An 
increase in levels of the brain specific creatine kinase B 
chain in response to androgen has also been described 
[25]. Likewise the SRY gene has been found to be respon- 
sive to androgen stimulation in LNCaP cells [18]. Also, a 
number of glycolytic enzymes including G3PD which 
were affected at the protein level have been found to be 
responsive to androgen stimulation [26]. Androgenic reg- 
ulation of the amount and phosphorylation of the protein 
B23, included in our list of affected proteins, has been 
previously described and found to be related to the early 
changes associated with androgen mediated growth of 
the prostate [27]. 

Corresponding SAGE data was available for most of the 
proteins affected by DHT treatment that were identified. 
Interestingly, for most of the proteins identified, there 
was no appreciable concordant change at the RNA level. 
There are several potential explanations for this lack of 
concordance. A particular gene may be represented by 
more than one protein isoform in 2-D gels. For example, 
in the identified group, three proteins (myosin light chain, 
G3PD and selenium binding protein) were represented 
by more than one isoform. Thus, a source of discordance 
between RNA and protein data may be that the protein 
change is limited to a particular isoform of a protein and 
not to overall protein products of a particular gene. 
Nevertheless, a change in a particular isoform is inform- 
ative and biologically meaningful and may not be pre- 
dictable from RNA data. A lack of concordance between 
RNA and protein data may also reflect either translational 
control, post-translational modifications, or changes in 
protein turnover due to DHT treatment. Yet another 
explanation for a lack of concordance could be a lag 
time for changes at the RNA level to be reflected in a 
protein change. It follows from the above considerations 
that monitoring gene expression at both RNA and protein 
levels may provide complementary information that could 
not be ascertained by solely measuring RNA or protein. 
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Web-based data warehouse on gene expression in 
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Based on biomedical literature databases, we tried a first step for constructing a gene expression 
"data warehouse" specific to human colorectal cancer (CRC). Results of genome-wide tran- 
scriptomic research were available from 12 studies, using various technologies, namely, SAGE, 
cDNA and oligonucleotide arrays, and adaptor-tagged amplification. Three studies analyzed CRC 
cell lines and nine studies of human samples. The total number of patients was 144. Out of 982 
up- or down-regulated genes, 863 (88%) were found to be differentially expressed in a single 
study, 88 in two studies, 22 in three studies, 7 in four studies, and only 2 genes in six studies. 
Eight large-scale proteomics studies were published in CRC, using 2-D-, SDS* or free-flow elec- 
trophoresis, involving only 11 patients. Out of 408 differentially expressed proteins, 339 (83%) 
were found to be differentially expressed only in a single study, 16 in three studies, 10 in four 
studies, 3 in five, and 1 in eight studies. Confirmation at proteome level of results obtained with 
large-scale transcriptomics studies was possible in 25%. This proportion was higher (67%) for 
reproducing proteome results using transcriptomics technologies. Obviously, reproducibility and 
overlapping between published gene expression results at proteome and transcriptome level are 
low in human CRC. Thus, the development of standardized processes for collecting samples, 
storing, retrieving, and querying gene expression data obtained with different technologies is of 
central importance in translational research. 
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1 Introduction 

Translational research in human colorectal cancer (CRC) is 
applying a large spectrum of molecular biology, cellular bi- 
ology, and advanced validation tools. In particular, genome- 
wide techniques are now applied to decipher modifications 
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in gene expression. In recent years, transcriptomics and 
proteomics tools have been broadly applied in CRC. The 
hope is that the new data obtained will now allow a classifi- 
cation of disease on molecular basis, deep insights into the 
pathophysiology of CRC, prognostic statements, and finally a 
systematic search for diagnostic and therapeutic targets. 

Because of the complexity of the biological system under 
investigation, the most significant contribution of transla- 
tional research in CRC is expected to derive not from the 
analysis of single experiments but from libraries of experi- 
ments. In other words, the results obtained so far by trans- 
lational research tools in different clinical and experimental 
settings need to be compared, contrasted, and if possible 
synthesized. Thus, based on the biomedical literature data- 
bases, we tried a first step for constructing such "data ware- 
house" specific to human CRC. 
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The specific aims of the present study were to screen 
transcriptomics and proteomics studies published in CRC, 
and to elaborate a kind of "meta-analysis of gene expression" 
in CRC. Our hypothesis was mat we would be able to deter- 
mine a common set of genes and gene products that are up- 
or down-regulated in human CRC by overlapping results 
obtained by different authors. 



2 Materials and methods 

This is a literature survey of gene expression data published 
in human CRC. A Medline search of reports published in 
English from 1990 to 2004 using the terms "colon cancer" 
and "gene expression" with the limit "human" was performed 
and yielded 1979 articles. All abstracts were reviewed and a 
related article search was performed on appropriate abstracts. 
Articles were selected by a consensus of two reviewers (E.S., 
M.R.) that satisfied these predetermined criteria: sample ori- 
gin (human and/or cell lines) and preparation detailed, tech- 
nology for gene expression studies defined (transcriptomics 
and/or proteomics studies), and quantitative results on over- 
expression or underexpression available. Studies concerning 
^ single genes or arbitrarily selected genes were discarded 
Results obtained in animal models were not considered. 

Data were entered into an Excel working sheet (Micro- 
soft, Seattle, USA). Since gene expression data were not 
always obtained with quantitative, but in most cases semi- 
quantitative gene expression analysis technologies, no 
expression values or ratios were entered into the database. 
Only genes reported as being over- or underexpressed were 
entered. No threshold was defined so that some genes 
defined as differentially expressed might have shown only 



marginal differences. Only descriptive statistics are provided, 
which were obtained with the in-built tools of the Excel soft- 
ware. 

Various methods were used for large-scale translational 
research in CRC, including transcriptomics and proteomics 
technologies. Results of genome-wide transcriptomic re- 
search are available from 12 studies, using various technolo- 
gies, namely, SAGE (five studies), complementary DNA 
(cDNA) arrays (five studies), and oligonucleotide array (one 
study), as well as adaptor- tagged amplification (one study). 
Three studies analyzed CRC cell lines and nine other studies 
analyzed human samples. The total number of patients pro- 
filed in all studies was 144. Out of the nine studies with hu- 
man samples, only two studies with a total of only 22 patients 
were performed using purified epithelial cells (one of them 
using cDNA arrays, the other one using SAGE technology). 
An overview of these, studies is provided in Table 1. 

Eight large-scale proteomics studies were published in 
CRC using various technologies, namely, 2-D PAGE (six 
studies), SDS-PAGE (one study), and free-flow electrophore- 
sis (one study). In total, all proteome studies in CRC involved 
only 11 patients. An overview of these studies is provided in 
Table 2. 

3 Results 

One thousand two-hundred and forty genes have been 
reported to be dysregulated (up- and/or down-regulated) in 
human CRC, representing about 5% of the 20000-25 000 
human genes. 



* The complete dataset can be consulted: 
http ://www.chimr giebethel.de 



Table 1. Large-scale or genome-wide transcriptomics studies in human CRC, using cell lines or human tissues. Alt studies together 
involved only 144 patients 

Author Year Sample Preparation Method Number 

of patients 



Zhang etal. [181 


1997 


Normal mucosa and primary tumors 


Whole tissue 


SAGE 


2 


SAGE-NETM91 


1997 


Primary tumors and cell lines 


Cell lines 


SAGE 




Zhang etal. [181 


1997 


Cell lines 


Cell lines 


SAGE 




Parle-McDermott ef a/. [20] 


2000 


Cell lines 


Cell lines 


SAGE 




Yanagawa etal. [21] 


2001 


Paired normal and cancer 


Whole tissue 


cDNA 


20 


Takemasa etal. [22] 


2001 


Paired normal and primary tumors 


Whole tissue 


cDNA 


20 


Nottermann etal. [231 


2001 


Paired normal mucosa and primary 


Whole tissue 


Oligonuc- 


22 






tumors, or adenoma 




leotide array 




Buckhaults etal. [241 


2001 


Normal mucosa, adenoma and 


Purified epithelial 


SAGE 


6 






primary tumors 


cells 






Birkenkamp etal. [25] 


2002 


Paired normal and primary tumors 


Whole tissue 


cDNA 


27 


Lin etal. [26) 


2002 


Paired normal mucosa and primary 


Purified epithelial 


cDNA 


16 






tumors, or adenoma 


cells 






Muro etal. [271 


2003 


Normal and cancer 


Whole tissue 


Adaptor-tagged 


11 


Williams et ai [28] 


2003 


Paired normal mucosa and primary 


Whole tissue 


cDNA 


20 



tumors, normal and adenoma 
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Table 2. Large-scale or proteomics studies in human CRC, using cell lines or human tissues. All studies together 
involved only nine patients 



Author 


Publication 


Sample 


Method 


Number 




year 






of patients 


Raymond etal. [9] 


1997 


Purified epithelial cells 


2-D PAGE 


1 


Simpson er al. [291 


2000 


Cell lines 


SDS-PAGE 




Lawrie etal. [30| 


2001 


Purified epithelial cells 


2-D PAGE 


4 


Simpson etal. [31 1 


2001 


Cell lines 


Free-flow electrophoresis 




Medjahed etal. [32] 


2003 


In silico 


2-D PAGE 




Demalte etal. [33] 


2003 


Cell lines 


2-D PAGE 




Stierum etai [341 


2003 


Cell lines 


2-D PAGE 




Friedman etai [35] 


2004 


Human whole tissue 


2-D PAGE 


6 



The vast majority of dysregulated genes in human CRC 
was found using transcriptomics tools: a total of 982 genes 
was found to be differentially expressed in at least one of 
these 12 transcriptomics studies. Out of these 982 genes, 863 
(88%) were found to be differentially expressed only in a 
single study, in other words these results have not been 
reproduced so far. The other findings could be reproduced in 
two or more transcriptomics studies: 88 genes were found to 
be differentially expressed in two studies, 22 in three studies, 
7 in four studies, no gene in five studies, and 2 genes in six 
studies. The most cited genes are listed in Tables 3 (up-reg- 
ulation) and 4 (down-regulation). 

A total of 408 proteins were found to be differentially 
expressed in human CRC in at least one study. O ut of these 408 
molecules, 339 (83%) were mentioned in a single study, in 
other words these results could not be reproduced. Differential 
expression of the remaining 70 proteins were mentioned with 
following frequenc: 40 were mentioned in two proteomics 
studies, 16 in three studies, 10 in four studies, 3 in five studies, 
and a single protein in eight studies. The most cited proteins 
are listed in Table 5. It has to be noted that only a single study 
[1] provided differential display protein expression data 
obtained in the human patient, using whole tissue biopsy. 

It is also difficult to reproduce transcriptomics results 
with proteomics tools. Out of 982 genes found to be differ- 
entially expressed in human CRC by .genome-wide tran- 
scriptomics technologies (Table 6a), only 177 (18%) have 
been confirmed using proteome technologies. When the 
genes reported to be differentially expressed in three and 
more transcriptomics studies (tt = 31) are compared with 
those reported to be differentially expressed in three and 
more proteomics studies (n*=30), only two genes (actin: 
ACTB.HUMAN and creatin-kinase: KCRB.HUMAN) can 
be matched. Thus, the probability of reproducing a gene 
expression result obtained at transcriptorne level is low when 
using proteomics technologies. 

In contrast, there is a better reproducibility when prote- 
omics results are verified using transcriptomics tools 
(Table 6b). In fact, when the subset of 30 proteins that are 
consistently (in three studies 6r more) reported to be dysreg- 
ulated in CRC is compared with transcriptomics results, 20 
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genes can be matched, representing about two-thirds. Thus, 
the probability of being able to reproduce a proteomics result 
using transcriptomics tools is more than three times higher 
than the opposite way (67 vs. 18%). However, it has to be 
noted that in 16% of cases, results remain unclear or even 
contradictory. 

Both in the transcriptomics and proteomics studies, 
many genes and factors were found to be differentially regu- 
lated that obviously do not play a causal role in CRC carci- 
nogenesis. 

4 Discussion 

This study was aimed at screening transcriptomics and pro- 
5 teomics studies published in CRC, and to elaborate a kind of 
meta-analysis of gene expression in CRC in order to generate 
a data warehouse that would be useful in translational 
research, A significant number of translational research 
studies have been published in CRC, starting in 1997 with 
SAGE technology, followed by an increasing number of array- 
based transcriptomics studies since 2001, and more recently 
by several proteomics studies. This has created a significant 
amount of data which we were then able to compile. 

Our starting hypothesis was that we would be able to 
determine a common set of genes and gene products that are 
up- or down -regulated in human CRC by comparing results 
obtained by different authors. This endeavor was not very 
successful; obviously, overlapping between published gene 
expression results both at proteome and transcriptorne level 
is low in human CRC. In fact, more than 80% of results 
could not be reproduced. Several explanations can be pro- 
vided to explain this lack of reproducibility. 

First, the number of patients included in the studies is 
low (alltogether 144 patients in transcriptomics, 11 in prote- 
omics studies); some high impact publications having been 
conducted on samples of only two patients [18]. This is a 
problem because interindividual genetic variability is high in 
human CRC [2]. Thus, it is allowed to hypothesize that some 
results attributed to gene dysregulation might be caused by 
genetic diversity rather than by cancer-specific traits. 
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Table 3. Results of transcriptomics research in human CRC. Most cited up-regulated genes out of 12 genome-wide or large-scale studies 

TRANSCRIPTOMICS 







Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


BuckhauHs et al 2001 [24] 


Birkenkamp et al 2002 [25] 


Takemasa et al 2001 [22] 


Un et al 2002 [26] 


5* 

§ 

t3 
o 

i 

© 

a 

o 
£ 

i 

a 
a. 


Zhang et al 1997 [18] 


57 

i 

z 
fc 

z 

Cfl 
< 


en 

91 

a» 

CO 
C 

ts 
m 


Williams et al 2003 [28] 


Notterman et al 2001 [23] | 




Paired normal and cancer 


, Normal and cancer 


N.mucadenoma and pr.tum 


Paired n. and pr.tumors 


Paired n. and pr.tumors 


Paired n.muc and pr.tum or adenoma 


Cell lines 


Norm.muc and Prim.tum 


Primary tumors and cell lines 


Cell line 


Paired n.muc and pr.tum, n and adenoma 


Paired n.muc and pr.tum or adenoma 


Accession 

Number Swiss-Prot Entry Name Name 


I 

a 

o 


| Adaptor-tagged 


SAGE 


S 

o 

o 


< 

2! 

o 

o 


< 
z 
o 

« 


| SAGE 


SAGE 


SAGE 


{ SAGE 


I 


Oligonucleotide Array 


AF041260 


075363 


IFM2_HUMAN 


Interferon-inducible protein 1-Bd 


1 




1 




1 






1 










AA232508 


Q9Y397 


SAHH.HUMAN 


S-Adenosylhomocysteine hydrolase AHCY) 






1 


1 










1 


1 






AA007218 


Q36QY8 


BGH3.HUMAN 


Transforming growth factor, beta-induzed, 66 kDa 


1 




1 


1 










1 








G01119 




TCPD_HUMAN 


Chaperonin containing TCP-1, subunit 4 (delta) 












1 








1 


1 




X75821 


Q04984 


CA11_HUMAN 


C0L1A1 










1 


1 




1 










U22055 


Q96AG0 


TF3A„HUMAN 


General transcription factor IMA 








1 




1 










1 




U14631 


P80365 


GR0_HUMAN 


GR01 oncogene (melanoma growth stimulating 
activity, alpha) 








1 












1 


1 




M92439 


P42704 


MDP1.HUMAN 


MPDP4, MDP7 microsomal dipeptidase 






1 


1 








1 










AF029082 


P31947 


MPI2_HUMAN 


M-phase inducer phosphatase 2 






1 




















M8640Q 


P29312 


PR0C_HUMAN 


Pyrroline 5-carboxilate reductase mRMA 






1 




















L76465 


P 15428 


SPRC.HUMAN 


Secreted protein, acidic, cysteine-rich 
(osteonectin) \ 






1 




















X57352 


Q01628 


Q96AG0 


100 kDa coactivator mRN ; A . 


























X03205 


P62888 


RL30_HUMAN 


50S Ribosomal protein L30* . 


























M95787 


Q01995 


RL23.HUMAN 


60s ribosomal protein L23 i 


























Z25821 


P42126 


HS9B_HUMAN 


90-kDaHSP 


























U96132 


Q99714 


RLA2_HUMAN 


Acidic ribosomal phosphoprotein P2 mRNA 


























AB028893 


P04643 


CA14_HUMAN 


Alpha-1 chain of collagen IV 


























X79239 


Q02546 


APALHUMAN 


Apolipoprotein A-l 


























Ml 3934 


P06366 


BMP4.HUMAN 


Bone morphogenetic protein-2B (BMP-2B) 


























M60854 


P17008 y 


CSEI.HUMAN 


CAS chromosome segregation gene homologue 


























Ml 3932 


P08708 


CD81_HUMAN 


CD81 antigen 


























X6915Q 


P25232 


TCPZ_HUMAN 


Chaperonin containing TCP1, subunit 6A(zeta1 1 


























L06498 


P 17075 


CA21 .HUMAN 


Collagen alpha-2 type 1 
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Table 3. Continued 

TRANSCR1PT0MICS 



Accession 

Number Swiss-Prot Entry Name 




Yanagawa et al 2001 [21] 


Muro et at 2003 [27] 


Buckhaults et al 2001 [24] 


Birkenkamp et al 2002 [25] 


1 

n 

s 

i 


£4 

"to 
c 


Parle-McDermott 2000 [20] 


Zhang et at 1997 [18] 


SAGE-NET et al 1997 [19] 


Zhang et al 1997 [18] 


Williams et al 2003 [28] 


Notterman et al 2001 [23] j 


Name 


cDNA 


Adaptor-tagged 


SAGE 


< 
z 

£> 

o 


§ 

o 

o 


§ 

o 

o 


SAGE 


SAGE 


SAGE 


SAGE 


cDNA 


Oligonucleotide Array 


104483 


335265 


;02_HUMAN 


Complement component C2 


1 


1 






















M64716 


P25111 


R10A HUMAN 


Csa-19mRNA 


1 


1 






















X77770 


P02383 




DNA binding protein UPI, liver mRISIA fragment 


























W52460 


PI 4798 


TP2A_HUMAN 


DNAtopoisomerase It (top2) 


1 
























X557I5 


P23396 


IF39_HUMAN 


Eukaryotic translation initialionfactoriEIF3|rnRNA 


1 


— 






















M77234 


P49241 


IF2B_HUMAN 


Eukaryotic translation initiation factor 2, subunit 2 


























M 58458 


P12750 


GGH.HUMAN 


Gamma-glytamyl hydrolase 


























D16992 


PI 0660 


G3P2_HUMAN 


Glyceraldehyde 3 phosphate dehydrogenase 


























M77233 


P23821 


SYGJWMAN 


Glycyl-tRNA synthetase 


























F16294 


P09058 


RS8_HUMAN 


H19RNA 


























X61156 


P08865 


HS9A_HUMAN 


Heat shock Protein HSP 90-Alpha 


























136055 


Q13541 


HMGLHUMAN 


Hmgi mRNA for high mobility group protein 1 


























AA316619 


P04645 


RS23_HUMAN 


Human homolog of yeast ribosomal protein S23 
(D14530) 


























LI 4599 


Q15233 


IMD2.HUMAN 


IMP (inosine monophosphate) dehydrogenase 2 


























X79234 


P39026 


IFMLHUMAN 


Interferon induced transmembrane protein 1(9- 
27) 


























L06505 


P30050 


LDHB.HUMAN 


Lactate dehydrogenase B(LDH-B) 


























X64707 


P26373 


RS2.HUMAN 


LLRgd 3 


























X56932 


P40429 


4F2.HUMAN 


Lymphocyte activation antigen 4F2 large subunit 


























L25899 


P39030 


NPL1JHUMAN 


Nucleosome assembly protein 1 -like 1 


























X63527 


P14118 


GSHH_HUMAN 


Phospholipid hydroperoxide glutathione per- 
oxidase 


























X89401 


P46778 


GDFF_HUMAN 


Prostate drfferentation factor 


























H59771 


P23131 


RL6 HUMAN 


Ribosomal protein L 6 


























M94314 


P38663 




Ribosomal protein L18(RPL18| mRNA 


























LI 9527 


P08526 




1 Ribosomal protein L18a mRNA, complete cds 


























U 14968 


P46776 


RL7_HUMAN 


Ribosomal protein L7 


























UH969 


P46779 


RL8JWMAN 


Ribosomal protein L8 


























L38941 


P49207 


RS5_HUMAN 


Ribosomal protein S5 


























U 12465 


P42766 


RS19.HUMAN 


S 19 ribosomal protein 


























F19234 


P 18077 


MYC_HUMAN 


V-myc myelocytomatosis viral oncogene 
homolog (avain) 


























X66699 


PI 2751 


Q99497 


EST(PARK7;DJ1) 


1 


1 




1 
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Table 4. Results of transcriptomics research in human CRC. Most cited down-regulated genes out of 12 genome-wide or large-scale studies 

TRANSCRIPTOMICS 







S 

i 

CO 

% 

to 

% 

TO 

c 
to 

>- 


Muro et al 2003 [27] 


Buckhaults et al 2001 [24] 


Birkenkamp et al 2002 [25] 


<s 

% 

40 
M 

ce 
s 
5 


Un et ai 2002 [26] 


| 

a 

I 

a 

o 
S 

_2 

CO 

a. 


Zhang et al 1997 [18] 


SAGE-NET et al 1997 [19] 


Zhang et al 1997 [18] 


Williams et al 2003 [28] 


j Notterman et al 2001 [23] | 




Paired normal and cancer 


Normal and cancer 


e 

_ 
a. 

XJ 

es 
at 

<B 

E 
o 
e 
« 

T3 
fC 
U 
3 

E 
2 


Paired n. and pr.tumors 


Paired n. and pr.tumors 


Paired n.muc and pr.tum or adenoma 


Cell lines 


E 
3 
§ 

-o 
c 
to 
o 

i 

i 

o 
Z 


Primary tumors and cell lines 


Cell line 


Paired n.muc and pr.tum,n and adenoma 


Paired n.muc and pr.tum or adenoma 


Entrv Name Name 


< 
z 
a 

o 


Adaptor-tagged 


SAGE 


< 
z 
a 

o 


< 
z 
o 

o 


< 
z 
a 

o 


SAGE 


LU 

to 


SAGE 


SAGE 


< 
z 
a 

o 


Oligonucleotide Array 


CAH2_HUMAN 


Carbonic anhydrase II 


1 




1 




1 












1 


1 


CEA1.HUMAN 


EST (BILIARY GLYCOPROTEIN} 






1 


1 


1 












1 


1 


BENE_HUMAN 


BENE 


1 








1 






1 








1 


CAH1_HUMAN 


Carbonic anhydrase 1 


1 




1 


















1 


GUAU_HUMAN 


GCAP-tl guanylate cyclase activator 2B; Uroguanylin; UGN 


1 






















1 


FABLJWMAN 


L-FABP liver fatty acid-binding protein 1 


1 








1 
















ACDS.HUMAN 


Acyl-CoenzymeA dehydrogenase C-2 to C-3 short chain 






















1 


1 


ADHG_HUMAN 


ADH gama 2 subunit (aa-1-375| 










1 














\ 


T4S3_HUMAN 


Co-029; transmembrane 4 superfamily member 3 






1 




















DRAJWMAN 


Colon mucosa-associated (DRA) 
























1 


KCRB_HUMAN 


Creatine kinase 


1 
























GUAN.HUMAN 


GUCA1B guanulate cyclase activator 1B/guanylin 
























1 


095784 


IgG Fc-binding protein 


1 








1 
















ITMC_HUMAN 


Integral membrane protein 2C 










1 


1 














MK03_HUMAN 


MAPK3 






















1 




FXY3.HUMAN 


MAT8 protein 










1 
















ACTB_HUMAN 


mRNA fragment encoding cytoplasmic actin 


1 












1 


1 










SELP_HUMAN 


Selenoprotein P 










1 










1 






EST1_HUMAIM 


Liver carboxylesterase 1 [Precursor] 


















1 


1 




1 


CFAD.HUMAN 


Adipsin, complement factor DCDA (EST) 
























1 


ATPB.HUMAN 


ATP5B ATP synthase 










1 
















CAHC.HUMAN 


CA12 










1 
















CANS_HUMAN 


Calcium dependent protease (small subunit) 
















1 










CALM_HUMAN 


Calmodulin-I (CALM1) mRNA, 3' UTR, partial sequence 
















1 






1 




CAH4_HUMAN 


Carbonic anhydrase IV » 








1 
















1 



© 2005 WIL£Y-VCH,Verlag GmbH &Co. KGaA, Weinheim 



www.proteomics-journal.de 



3072 



E, Sagynaliev et al. 



Proteowics 2005, 5,3066-3078 



Table 4. Continued 

TRANSCRIPTOMICS 







Yanagawa et al 2001 [21] 


Muro et al 2003 [27] 


Buckhautts et al 2001 [24] 


Birkenkamp et al 2002 [25] 


Taketnasa et al 2001 [22] 


I 

% 

c 
3 


I 
1 

« 

o 
£ 

JO 

ca 
0. 


Zhany et al 1997 [18] 


SAGE-NET et al 1997 [19] 


Zhang et al 1997 [18] 


Williams et al 2003 [28] 


Notterman et al 2001 [23] | 


Entry Name 


Name 


cDNA 


1 

o> 
J3 

s 

to 
3 


SAGE 


Q 

o 


cONA 


«X 

z 
a 

a 


SAGE 


SAGE 


SAGE 


SAGE 


z 

O 


Oligonucleotide Array 


MT1E_HUMAN 


cDNA similar to gb: M10942_cds1 human 








— - 










1 






1 


000748 


CES 2 carboxylesterase 2 










1 
















CMGA HUMAN 


CgA 








— j— 
















1 


CLUSJHUMAN 


clusterin 
























1 


KlCT.HUMAN 


Cytokeratin 20 









— 










1 






1 


KDGA_HUMAN 


Diacylglycerol kinase 








... 
















1 


DTDJWMAN 


DTD sulfate transporter 










1 
















PLA8_HUMAN 


EST 122594 5 








— - 










1 






1 


ATPA.HUMAN 


FI-ATPase alpha subunit 










1 
















LEG3_HUMAN 


Galectin-3 (Galactose-specific lectin 3) (MAC-2 antigen) 










1 
















ABP.HUMAN 


HP-DA01 (diamine oxidase) 




















1 




1 ' 


K1CS.HU MAN 


Keratin 19 












1 








1 






K2C8_HUMAN 


Keratin, type 11 cytoskeletal 8 (cytokeratin 8) {K 8) (CK 8) 














1 • 






1 






DHB2_HUMAN 


LI 1708 Estradiol-17 beta- dehydrogenase 2 












1 












1 


MT1H_HUMAN 


Metallothionein 1 H 
























1 , 


MUC2_HUMAN 


Mucin 2, tntestinal/tracheal 














1 


1 










MEPA_HUMAN 


PPH alpha gene 




















1 




1 


PA2A_HUMAN 


RASF-A PLA2 gene 












1 










1 




MT1FJ1UMAN 


RNAhelicase-related protein 
























1 


SBP1_HUMAN 


SBP selenium-binding protein 












] 


1 












MT1L_HUMAN 


Serine theonine kinase 39 (STE20/SPS1 homolog.yeast) 












1 












1 


MYL6.HU MAN 


Myosin light polypeptide 6 


















1 




1 




TETN_HUMAN 


TNA tetranectin 












1 












1 


VIPR.HUMAN 


VIPRt vasoactive intestinal polyoeptide receptor 1 












1 












1 


PLA8.HUMAN 


Placenta-specific gene 8 protein 




















i 




1 


Q9NXM9 


Hypothetical protein FU20151 












1 












1 


T4S1„HUMAN 


Transmembrane 4 superfamily member 1 








1 












i 






ALU2_HUMAN 


Alu subfamily SB sequence contamination warning entry 






1 






1 














S116JHJMAN 


S100 calcium-binding protein A16 


















1 


i 






C14A HUMAN 


Dual specificity protein phosphatase CDC14A 




















i 


1 





Second, results have been obtained using heterogeneous 
samples in particular cell lines, whole tissue biopsies, and 
epithelial cells purified from surgical specimens. Results 
obtained in cell lines do not allow accurate comparison be- 
tween normal and cancer cells, and the presence/absence of 
proteins of interest has to be confirmed in biopsies. For 
example, when 2-D PAGE protein patterns of normal human 
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colonic crypts were compared with the CRC cell line LIM 
1863, the proteins spots from normal crypts matched only 
75-80% of the cell line spots and the relative expression 
levels of a large number of proteins differed [3]. When clin- 
ical biopsies are examined, results of phenotypic compar- 
isons depend on the type of samples examined (e.g., hetero- 
geneous whole tissue biopsies with inflammatory cells, 
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Table 5. Results of proteomics research in human CRC. Most cited up-regulated (orange), down-regulated (green), or mentioned (yellow) 
proteins out of 12 large-scale proteomics studies 



Accession 

Number Swiss-Prot Entry Name Name 


Simpson et al 2000 129] 


Simpson et al 2001 [31] 


Demalte et al 2003 [33] 


Reymond et al 1999 [9] 


"5 

to 

f 
s 


Lawrie et al 2001 [30] 


Friedman et ai 2004(35] 


Stierum et al 2003(34] 


UJ 

CD 
CL. 
CO 
O 
C/3 


Free-flow electrophoresis 


2-0 PAGE 


2-D PAGE 


2-D PAGE 


LU 

9 


2-0 PAGE 


3 

CL 

9 

Wl 


AF041260 


075363 


APTR HI IMAM 




Vl.;. 




'J': 






w\ 




AA232508 


Q9Y397 


APTfJ Ml IMAM 


Anfin Pwtn nl a cm atir> 7 Iftamma-Artinl 












$n 


AA007218 


Q96QY8 


RTP Mil MAN 
0 1 r_nuiviHiii 


ftlittathinnn ^-trancf pracp M*3 fhrainl 
LI llild lI iiui la J uaiidiciaso ivio \uianif 




-■Mi 


:.::J| 


w 








X77584 


G01119 


Tt-lin Ml IMAM 

I niu_nuiviMiM 


Thinrartnvin i ATI -rioriwori fartnrl I ADFt 














X75821 


Q04984 


AMY'S Ml IMAM 
AIM AJ_n U IVI MIM 
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Table 6. Reproducibility of transcriptome results using proteomics tools (a) and vice versa (b). Proteomics results have been reproduced in 
67% of cases in transcriptomics studies. Results from transcriptomics studies are more difficult to reproduce using proteomics 
tools (25%). Only two gene products can be retrieved in three and more proteomics and transcriptomics studies (actin and creatin- 
kinase) 



a. Transcriptomics > Proteomics 
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necrosis, blood, stool, etc.), so that standardized sample 
preparation procedures are critical for obtaining reproduc- 
ible results. Several sample preparation methods have been 
described, in particular fluorescence-activated cell-sorting 
(FACS) [4], laser capture microdissection (LCM) [5-8], 
immunomagnetic beads separation [9], and cellular frac- 
tionation [10]. Unfortunately, these sample preparation 
procedures were barely applied in CRC. In our experience, 
the beads-based method is characterized by several advan- 
tages when compared with other cell purification proce- 
dures [11]. 

Third, gene expression patterns depend on the arrays 
technology platform. In transcriptomics studies any factors 
may affect the outcome of a microarray experiment, in par- 
ticular technical instrumental, computational, and inter- 
pretative factors. In fact, lack of reproducibility and accuracy 
is a major concern in microarray studies [12). When cross- 
platform comparison was performed, reproducibility was 
insufficient: only four genes from a set of 185 common genes 
selected behaved consistently on three array platforms, and 
agreement of about 30% was found between two brands [1 3]. 

Fourth, in proteomics studies, 2-D PAGE or 2-D DIGE 
have well-known technological limitations. In CRC, even 
after epithelial cell enrichment using magnetic beads, the 
mean CV of repeated 2-D PAGE analysis with silver staining 
was found to lie between 20 and 28%. Only 47% (interrun) to 
76% (intrarun) of spots could be matched within a triplicate 
experiment. Interindividual phenotypic variability was high. 
Thus, even under well-defined experimental conditions, 2-D 
PAGE parallel analysis of paired CRC samples is hampered 
by a significant variability [2]. 

Fifth, the methods applied for generating, formatting, 
storing, retrieving, and querying data are of outmost impor- 
tance to assess methodological and biological variation in 
gene expression analysis, Unfortunately, due to the small 
sample size (number of patients), large number of variables 
examined at once, and absence of double or triple experi- 
ments (arrays and gels are expensive and samples are rare) 
statistical analysis is often not valid. In. particular, assessing 
the reproducibility of a variable is necessary (e.g., using the 
intraclass correlation coefficient) for comparing multiple 
samples at once. The use of median values instead of mean 
values has been shown to improve data correction [14]. It has 
also been proposed to use housekeeping genes as endoge- 
nous controls [15]. A dedicated society, The Microarray Gene 
Expression Data (MGED) Society, has been formed to facil- 
itate the sharing of gene expression data generated by func- 
tional genomics and proteomics experiments [16]. 

Finally, correlation between results of transcriptomics 
versus proteomics results is low. For CRC, there is no pub- 
lication comparing mRNA and protein expression for a 
cohort of genes. However, extrapolation is reasonable from 
another epithelial cancer (lung adenocarcinoma), where 
such comparison has been performed. Only a subset of the 
proteins (17%) exhibited a significant correlation with 
mRNA abundance [17]. 
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Obviously, many genes and factors found to be differen- 
tially regulated (both in transcriptomics and proteomics 
studies) do not play a causal role in CRC carcinogenesis. For 
example in the studies under investigation, at least 17 
mRNAs encoding ribosomal proteins were identified to be 
dysregulated using cDNA arrays and 39 ESTs using SAGE 
technology. This broad dysfunction of protein synthesis, in 
particular of small molecules synthesis, has been reported 
not only in cancer but also in several other human diseases, 
where etiologies have been linked to mutations in genes of 
the translational control machinery [36]. 

However, some findings might be of particular interest 
in human CRC. For example, a small molecule group found 
to be dysregulated in human CRC is the 14-3-3 proteins 
family. 14-3-3 proteins are ubiquitous within all eukaryotic 
cells and participate in protein kinase signaling pathways. In 
particular, they are involved in phosphorylation-dependent 
protein-protein interactions that control progression 
through the cell cycle, initiation and maintenance of DNA 
damage checkpoints, activation of MAP kinases, prevention 
of apoptosis, and coordination of integrin signaling and 
cytoskeletal dynamics [37]. Given the prevalence of specific 
14_3-3 isoforms expression in several human epithelial can- 
cers [38, 39], these proteins may be involved in cancer 
tumorigenesis and particular isoforms may be useful as 
therapeutic targets inhuman CRC. 

In summary, we propose a gene expression data ware- 
house in human CRC that is intended to help researchers 
active in the field to get an overview of the data available. 
However, reproducibility of results obtained in different 
studies was disappointing. 

As a matter of fact, the development of some types of 
unified processes for generating information, formatting, 
storing, retrieving, and querying data, regardless of the tech- 
nology used to generate it, is of central importance for 
building gene expression databases. Such unified processes 
have been proposed (see above) but have found only limited 
recognition so far. This is unfortunate because these metho- 
dological issues must be solved before trying to integrate 
experimental data from different sources into functional 
proteomics studies at the bench or in silico. The present 
meta-analysis of gene expression highlights the need for 
including a sufficient number of patients, analyzing only 
purified epithelial cells, using clinical standards (such as the 
international classification of disease, histopathology, and 
staging), assessing the variability of the technology applied, 
considering common standards for microarray data annota- 
tion and exchange, and finally for developing software 
implementing these standards and promoting the sharing of 
these high quality, well-annotated data within the life sci-. 
ences community. 

In the absence of such unified processes, reproducibility of 
results within a laboratory will continue to be low, confirma- 
tion of results between different groups will stay difficult, and 
identification of diagnostic and therapeutic targets will remain 
a lottery. These prpblems are challenging the whole drug dis- 
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covery process from the very beginning, namely target identi- 
fication, so that target validation, assay development, and 
prioritization of compounds remains a high risk endeavor. 

We are very grateful to C. Nestler for her excellent technical 
assistance. 

5 References 

[I] Friedman, D. B., Hilt, S., Keller, J. W ef al., Proteomics 2004, 
4,793-811. 

I2l Ott, V., Guenther, K., Steinert, R. etal, Pharmacogenomics J. 
2001, 7, 142-151. 

[3) Ji, H., Whitehead, R. H., Reid, G. E. etal., Electrophoresis, 
1994, 75,391-405. 

[4] Reymond, M. A., Sanchez, J. C, Schneider, C. etal., Electro- 
phoresis 1997, 78,622-624. 

(51 Kondo, T, Seike, M., Mori, Y. etal., Proteomics 2003, 3, 1758- 
1766. 

[6] Banks, R. E„ Dunn, M. J., Forbes, M. A. etal., Electrophoresis 

1999, 20, 689-700. 
[7] Craven, R. A., Banks, R. E., Prot eomics 2001, 7, 1200-1204. 
[8] Craven, R.A,Totty, N., Harnden, P. etal., Am. J. Pathol. 2002, 

750,815-822. 

[91 Reymond, M. A., Sanchez, J. C., Hughes, G. J. etal., Electro- 
phoresis 1997, 18, 2842-2848. 
[10] Szymczyk, P., Krajewska, W. M„ Jakubik, J. ef al, Tumori 
1996, 82, 376-381. 

[II] Kellner, U., Steinert, R., Seibert, V. etal., Pathol. Res. Pract. 
2004*200, 155-163. 

[12] Shi, L.,Tong, W., Goodsald;F. etal, ExpertRev. Mo/. Diagn. 
2004, 4, 761-777. 

[13] Tan, P. K., Downey, T. J., Spitznagel, E. L. Jr. etal., Nucleic 
Acids Res. 2003, 57, 5676-5684. 

[14] Pellis, L., Franssen-van Hal, N. L, Burema, J., Keijer, J., Phy- 
siol. Genomics 2003, 16, 99-106. 

[15] Janssens, N., Janicot, M., Perera,T. ( Bakker, A., Mot. Diagn. 
2004, 8, 107-113. 

[16] www.mged.org/, consulted on December 25, 2004. 

[17] Chen, G„ Gharib,T. G., Huang, C. C. ef al., Mol. Cell. Prote- 
omics 2002, 7,304-313. 



[18] Zhang, L, Zhou, W., Velculescu, V. E. ef al., Science 1997, 
276, 1268-1272. 

[19] www.sagenet.org/cancerAable1.htm, consulted on Decem- 
ber 1,2003. 

[20] Parle-McDermott, A., McWilliam, P. f Tighe, O., Dunican, D., 

Croke, D. T„ Br. J. Cancer 2000, 83, 725-728. 
[21] Yanagawa, R., Furukawa, Y., Tsunoda, T. ef al., Neoplasia 

2001,3,395-401. 
[22] Takemasa, t., Higuchi, H., Yamamoto, H. et al., Biochem. 

Biophys. Res. Commun. 2001, 285, 1244-1249. 
[23J Notterman, D. A., Alon, U., Sierk, A. J., Levine, A. J., Cancer 

Res. 2001,57, 3124-3130. 
[24] Buckhaurts, P., Rago,C, St. Croix, B. etal., Cancer Res. 2001, 

61, 6996-7001. 

[25] Birkenkamp-Derntroder, K., Christensen, L. L, Olesen, S. H. 

etal., Cancer Res. 2002, 61, 4352-4363. 
[26] Lin, Y. M., Furukawa, Y., Tsunoda, T. ef al., Oncogene 2002, 

27,4120-4128. 

[27] Muro, S., Takemasa, I., Oba, S. etal., Genome Biol. 2003, 4, 
R21. 

[28] Williams, ISL S., Gaynor, R. B f Scoggin, S. etal., Clin. Cancer 

Res. 2003, 9, 931-946. 
[29] Simpson, R. J„ Connolly, L. M., Eddes, J. S. ef al., Electro- 

phoresis 2000, 21, 1707-1732. 
[30] Lawrie, L C, Curran, S., McLeod, H. L, Fothergill, J. E., 

Murray, G. I., Mol. Pathol 2001, 54, 253-258. 
[31] Simpson, R. J., Proteomics 2001, 1, 807-818. 
[32] Medjahed, D., Luke, B. T., Tontesh, T. S. ef al., Proteomics 

2003,3, 1445-1453. 
[33] Dematte, I., Swiss-2-D PAGE database: Search for 

DLD1_Human. http://www.expa sy.org, Release 16 and 

updates up to 17-Nov-2003. 
[34] Stierurn, Gaspari, M- t Dommete, Y M et al., Biochim. Bio- 
phys. Acta 2003, 1650, 73-91. 
[35] Friedman, D. B., Hill, S., Keller, J. W. etal., Proteomics 2004, 

4,793-811. 

[36] Calkhoven, C. F., Muller, C, Leutz, A-, Trends Mol. Med. 2002, 
8, 577-583. 

[37] Wilker, E., Yaffe, M. B., J. Mol. Cell Cardiol. 2004, 37, 633-642 
(review). 

[38] Qi, W., Liu, X., Qiao, D. ef al., Int. J. Cancer 2005, 7 73, 359- 
363. 

[39] Moreira, J. M., Gromov, P., Celis, J. E., Mol. Cell, Proteomics 
2004, 3, 410-419. 



© 2005 WIUEY-VCH Verlag GmbH & Co. KGaA, Weinheim 



www.proteomics-journal.de 



17 

Proteomics 



Kathryn Lilley. Azani Razzaq and Michad J. Deery 



17.1 Introduction 

There are now numerous organisms whose genome sequences are known, for 
example Arabidopsis (Arabidopsis Genome Initiative, 2000), Drosophila (Adams 
et al. f 2000), human (International Human Genome Sequencing Consortium, 
2001) and rice (Yu et al. 9 2002). The prediction of open-reading frames from 
these genomic sequences has enabled the comprehensive identification of many 
putative protein sequences. These proteins can be arranged into three categories, 
namely those of known function, those with recognisable motifs and hence a 
vague idea of function, and those with no sequence similarities to any pro- 
tein (Gabor Miklos and Malenszka, 2001). Many proteins reside in this latter 
'functional vacuum' which could represent as much as 30% of the predicted 
proteins. Determining protein function is key to nnHmtajrf\ n g cellular mecha- 
nism. Studying how protein expression is modulated in response to a given set 
of circumstances, such as infection, disease, developmental stage, senescence or 
response to drugs, will facilitate the elucidation of disease pathways and thus 
provide a mechanism for diagnosis and therapy. 

DNA chips (mRNA profiling studies) can contribute to the study of gene 
expression in response to a particular biological perturbation. However, the 
extrapolation that changes in transcript level will also result in corresponding 
changes in protein amount or activity cannot always be made. Tb understand 
fully, we need integrated data sets from a variety of protein expression studies, 
providing information on relative abundances, sub-cellular locations, protein 
complex formation and the profiling of isofonns generated by either alternate 
mRNA splicing or post-translational modifications. Proteomics is the word now 
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commonly used to describe the discipline associated with the acquisition of 
these data sets. 

17.2 Definitions and applications 

Before embarking on an overview of die techniques used in the field of pro- 
teomics, it is necessary to define a few terminologies; pioteomics, functional 
genomics, structural genomics and post-genomics are terms that have crept into 
the scientific vocabulary with alarming stealth and are used freely, and in some 
cases interchangeably, especially in cases where the use of such *buzz words 9 
is likely to increase the attendance £t departmental seminars. 

• Genome: A genome represents the entire DNA content in a particular cell, 
whether or not it is coding, non-coding, or is located either chromosomaliy 
or extra-chromosomally. 

• Genomics: Genomics is the study of the genome, interrogating the complete 
genome sequence using both DNA and RNA methodologies. 

• Proteome: The proteome represents the complete set of proteins encoded by 
the genome. 

• Proteomics: Proteomics is the study of die proteome and investigates the cel- 
lular levels of all the isoforms and post-translational modifications of proteins 
that are encoded by the genome of that cell under a given set of circumstances. 

• Functional genomics: This is the study of die functions of genes and their 
inter-relationships. 

Whilst a genome is more or less static, die protein levels in a particular cell can 
change dramatically as genes get turned on and off during the cell's response to 
its environment The proteome originally was defined over seven years ago as 
'all the proteins coded by the genome of an organism' (Wasinger et a/., 1995). 
Nowadays the term 'proteomics* is used more widely and implies an effort to 
link structure to function by whatever means are appropriate. We can expect die 
definition to change again with time as die field and die investigator's view of 
it evolves. 

17 3 Stages in proteome analysis 

Proteomic analysis can ascertain function either by looking for changes in the 
expression of either all or a subset of proteins, or by identifying binding partners 
for particular proteins and seeing how their interaction is affected by biological 
perturbation. Whatever the rationale of the investigation, or the number of pro- 
teins involved, the study of the proteome can be broken down into the following 
stages of analysis. 
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Gene Expression Analysis 
Using Microarrays 

Sophie E. Wildsmith and Fiona J. Spence 



13.1 Introduction 

Microarrays are of increasing interest to both industry and academia as tools 
for 'gene hunting' and also as quantitative methods for routine analysis of large 
numbers of genes. Techniques such as real-time polymerase chain reaction (RT- 
PCR) TaqMan™ and SybrMan™ are generally considered to be more accurate, 
robust, larger in dynamic range and less capital intensive, but for rapid, large- 
scale gene expression analysis using limited mRNA, microarrays and gene chips 
are preferred. 



13.2 Microarray experiments 
Platforms 

Global gene expression platforms are now available in multiple formats, 
including cDNA anrays, oligonucleotides spotted onto slides or in situ 
synthesised oligonucleotide arrays manufactured using photolithography. 
Commercial sources for these include Stratagene (La Jolla, CA), Memorec 
(Kttln, Germany) and BD Biosciences (Oxford, UK) for cDNA microarrays, 
Mergen Ltd (San Leandro, CA) for spotted oligomers and Affymetrix (Palo 
Alto, CA) for oligoarrays synthesised in situ. Purchasing from a supplier is more 
expensive than generating microarrays in-house ? although the latter is beneficial 
in labour-intensive institutions or when proprietary gene information is utilised. 
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'Off-the-shelf microarrays may also have the advantage of rigorous quality 
control and standardised protocols. 

It is possible to produce oligonucleotide spotted arrays in-house, by design- 
ing oligonucleotide sequences that match genes of interest and then purchasing 
purified oligonucleotides to spot down on glass or other substrates. Alternatively 
there are new systems such as that available from CombiMatrix (Mukilteo, 
WA, USA) for computer-aided design and in situ synthesis of oligonucleotides. 
However, production of cDNA microarrays is currently the most affordable and 
popular method and is now well established. Numerous sources of informa- 
tion on cDNA microanay fabrication are available in the literature and on the 
internet (Bowtell, 1999; Cheung etal % 1999; Wildsmith and Elcock, 2001 and 
http://crngm.stanford.edu/pbrovm). Thus, this chapter will focus on the 
implementation of experiments and analysis of data ftojn cDNA microarrays. 
The experimental procedure differs slightly according to the number of fluo- 
rophores (or channels) and the type and manufacturer of the array. We have 
attempted to describe a generic process, indicating where possible the different 
options. Figure 13.1 demonstrates the procedure for a two-colour hybridisation. 




Figure 13.1 The microanay experimental process for two-colour hybridisations 



RNA Extraction 
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First, RNA is extracted from the tissue or cells of interest. The quality of the 
RNA extracted is paramount to the overall success of the microarray experi- 
ment, as impurities in the sample can effect both the probe labelling efficiency 
and also stability of the fluorescent label (Hegde et al. % 2000). Snap-freezing 
of tissue in liquid nitrogen, immediately after harvesting, is used to preserve 
RNA integrity. Any further sectioning of the tissues should be carried out under 
RNAse-free conditions (Fernandez et al. t 1997). Total RNA can be extracted 
using kits such as TRIzol® (Invitrogen, Paisley, Scodand) and Rneasy (Qia- 
gen, GmbH, Hilden, Germany). Some researchers perform a further extraction 
of mRNA; this results in a purer starting material but has the disadvantage of 
lower yields. Affymetrix recommend between 5 and 40 |xg of total RNA is 
required for their GeneChips™ and 10 \ig or less is the required amount of 
starting material for cDNA microarrays (Hegde et a/., 2000). 



Sample .Labelling 

Hie mRNA is transcribed in vitro, with the concomitant inclusion of labelled 
nucleotides. The labels may be fluorescent or radioactive. In the case of dual 
channel/colour hybridisations, two samples will be labelled with dyes that 
fluoresce at different wavelengths, with different emission spectra. Example 
fluorophores, available coupled to nucleotides, are Cy3, Cy5, fluorescein 
and lissamine. Wildsmith etal (2001) have demonstrated that AlexaFluor 
546dUTP™ (Molecular Probes, Leiden, The Netherlands) gives a significantly 
higher signal than Cy3dCTP (Amersham Biosciences, Piscataway, NJ, USA). 
When performing two-colour hybridisations the control sample and 'test* sample 
are labelled with different fluorophores and the subsequent cDNA is then mixed 
together and hybridised simultaneously (Nuwaysir et al, f 1999). An advantage of 
simultaneously hybridisjng control and treated sample is that it obviates the need 
to control for differences in hybridisation conditions or between microanays. A 
specific example of the huge impact this technique has had includes its use 
in the first published account of gene expression data of the entire genome of 
Saccharomyces cerevisiae (DeRisi et a/., 1997). In two-colour hybridisations, 
one would assume that the properties of the two fluorescent dyes being used 
arc equivocal. In fact, for Cy5 and Cy3 this is not the case as Cy5 has been 
reported to give higher background fluorescence and also is more sensitive to 
photobleaching than Cy3 (Van Hal et a/., 2000). In addition, there is evidence 
from several independent sources that the combination of Cy3 and Cy5 dye 
labelling can affect data in certain genes. That is to say, when experiments 
are repeated and the dye combination for the* two probes reversed, inconsistent 
results are obtained with certain genes (Taniguchi etal f 2001). Despite these 
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Treated : Cy 3-532 nm Control : Cy 5-635 nm 




Figure 13.2 Two-colour fluorescent scan of human gene cDNA array. The probe mix con- 
sists of DNA from HcpG2 control cells and cells treated with buthionine sulfoximine for 6 h. 
A colour version of this figure appears in the colour plate section 

facts, two-colour hybridisations are widely accepted throughout die microarray 
community and an example of an image is shown in Figure 13.2. 

Hybridisation and Processing 

After labelling, the cDNA is purified (to remove unincorporated nucleotides), 
mixed with a hybridisation buffer and then applied to a cDNA microarray slide. 
The sample and the slide are heated prior to hybridisation in order to separate 
double-stranded DNA. A coverslip is applied (Shalon et a/., 1996), or prefer- 
ably a hybridisation chamber is used to avoid evaporation and enable an even 
hybridisation. The hybridisation and subsequent wash steps are carried out at a 
buffer stringency and temperature that enables hybridisation of complementary 
strands of DNA but reduces non-specific binding. 

Image Capture and Image Analysis 

After hybridisation the microarray slides are scanned, using either a laser or a 
phosphorimager (depending on the type of label used). There are many different 
suppliers and models of fluorescence scanner, for example the ScanArray 5000 
(Perkin Elmer Life Sciences, Zaventem, Belgium), GenePix 4000B (Axon GRI, 
Essex, UK) and the GeneArray® (Affymetrix, Santa Clara, CA). The choice of 
scanner is determined by sensitivity, resolution, flexible wavelength, file size 
generated, throughput and technical support available. 

Images are analysed using software that measures the intensity of the sig- 
nal from the hybridised spotted genes (spots), which provides a measurement 
of the amount of cDNA bound. Thus the initial concentration of messenger 
RNA is inferred. Early software packages 'drew* grids around the spots and 
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integrated across the whole area of the grid. This overcame problems associ- 
ated with accurate location of the spots, which is problematic, especially if the 
spots on the printed arrays are poorly aligned. More recent versions of software 
'draw' circles around the spots themselves and perform measurements within 
and outside of this boundary. For example, the background may be calculated 
from a region outside the spot boundary. The intensity of the signal from the 
spot may be calculated using median, mode or mean values of the pixels within 
the spot. Researchers differ in their preferences regarding using median or mean 
values (Hegde et al. f 2000) and this is likely to depend upon the protocols and 
software used. 

Image analysis software commonly is supplied with scanners or can be 
bought from the same supplier. This has the advantage of being opti- 
mised for that specific type of microarray and the benefit of upgrades 
and technical support. Software for microarray analysis is available 
from BioDiscovery (http://www.biodiscovery.com), Imaging Research 
(http: //imagingreaearch. com), GenePix Pro (Amersham Biosciences, Pis- 
cataway, NJ), anaySCOUT 2.0 (http://www.lionbioscience.com), NM 
(http : //www. nhgri .nih.gov/DIR/LC0/15K/HTML/imgjinalysis .html), 
Stanford University (http : //rana . Stanford . EDU/sof tware) Media Cyber- 
netics (Silver Spring, MD t USA) and TIGR (http//www . tigr . org/sof t lab). 
Important criteria for image analysis software include speed, ease of use, 
automation and the ability to distinguish artefact from real signal (Wildsmith 
and Elcock, 2001). 

As the technology has evolved and more experience gained, it has become 
more and more apparent that the most significant issues facing microarray users 
are the processing of the vast quantities of data generated and deciding exactly 
what tools are the most appropriate for data analysis. Because of the enormity 
of this, we have dedicated a complete section to describing the current status of 
this area. 



13.3 Data analysis 

It is important to be cognisant of the fact that the practical laboratory aspects of 
using microarrays are only part of gene expression analysis. Many researchers 
generate vast volumes of data, without a clear understanding of how to manage 
and interpret them. Furthermore, the variability in microarray data confers addi- 
tional problems for analysis. In some cases the purpose of the experiment will 
be a gene-hunting exercise, in which case a cursory indication of potential gene 
biomarkers is sufficient analysis. In other instances, such as pathway mapping 
and screening studies, it is paramount that results are statistically meaningful 
and valid. The next few sections detail some relatively simple analysis meth- 
ods and recommendations for the benefit of researchers with minimal statistical 
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Figure 13 J The ideal microarray experimental design and process 

training. There are also suggestions for more advanced analysis for those who 
have the assistance of a statistician or specialist data analyst. 

Most of the steps before, during and after performing a microarray experi- 
ment are optimally conducted with regard for statistics and data analysis. Careful 
planning before implementation facilitates the downstream analysis and inter- 
pretation of data. The following model summarises the entire microarray process 
with integration of the biological and data analysis components (Figure 13.3). 

Hypothesis Generation 

Any study is conceived for the purpose of investigating or obtaining supporting 
evidence for a biological hypothesis. Giving time at this early stage to consider 
downstream implications will pay dividends later. It is helpful if, rather than 
simply stating the aims of the experiment, the researcher asks the question 'What 
results do I expect?' or 'what answer will validate/invalidate my hypothesis?'. 
This 'reverse-engineering' proves useful in focusing the project, assessing the 
feasibility of the woric, providing early preparation for data management and 
analysis and, importantly, in managing expectations with regard to outcomes. 

A good example of carcfiil experimental planning is demonstrated by Golub 
etal. (1999) in the classification of acute leukaemias in order to distinguish 
between acute lymphoblastic leukaemia (ALL) and acute myeloid leukaemia 
(AML). Distinguishing between ALL and AML using conventional techniques 
is known to be a difficult task. The researchers maximised their probability of 
success by choosing an easier, more defined model (normal kidney vs. renal 
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cell carcinoma), on which to validate their analytical methods. In doing so they 
established that their techniques were suitable for classifying tissues according 
to disease and gained confidence in their approach before using the samples of 
real interest. 

Optimisation Experiments 

Although microarrays are becoming increasingly accessible to all, using these 
tools requires experience and it is unlikely that successful experiments will be 
conducted immediately. It is usual that some time is given to optimising a sys- 
tem for any specific application, for example for a given tissue or cell type. 
Additionally, the requirements for a given system may warrant some modifica- 
tions. The standard approach for a scientist to take is to vary one parameter, 
whilst keeping all others constant. This is time-consuming and does not take into 
account the interactions between different factors. Well-designed, multifactorial 
experiments (Box et a/., 1978), provide a faster route for optimisation, with a 
statistical measure of confidence. An example of this technique is in the opti- 
misation of microarray experimental conditions for preparation of fluorescent 
probes from rat liver tissue (Wildsmith et a/., 2001). When a major source of 
variation is revealed this can be investigated further with a view to minimising 
it or providing sufficient replicates to account for it 

Design of Experiments 

Once confidence in the experimental procedure has been obtained the researcher 
is likely to have gained an insight into the reproducibility of the system. This 
assists in the design of the experiments, in particular in determining the mini- 
mal number of replicates necessary. Replication can be implemented at many 
stages - from biological samples through to microarray slides. 

Owing to the enzyme-catalysed transcription reactions, a large amount of 
variation occurs during the probe-making stages in microarray experiments. Our 
work indicated that replicates should be made at this step and a minimum of six 
replicate probes are made for microarray experiments (Wildsmith et ai f 2001). 
These can be pooled or hybridised separately onto six microarray slides. 

Lee et al. (2000) have examined the effect of the different location of cDNA 
spots oh the glass slides and concluded that replicates are essential to provide 
meaningful data and to enable reliable inferences to be drawn. 

With regard to commercially available gene chip systems, such as that avail- 
able from Affymetrix (see Figure 13.4), the variation between chips, within a 
batch, is likely to be low due to stringent quality control and highly automated 
manufacture. The use of an automated wash station also reduces variability in 
intensities between chips. However, using a multi-step approach in the probe 
preparation and subsequent antibody binding steps may lead to variation between 
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Figure 13.4 The GeneCbipft Instrument System. From left to right, the hybridisation sta- 
tion, scanner and workstation. Image courtesy of Asymetrix. A colour version of this figure 
appears in the colour plate section 



replicate, samples prepared on different days. Pooling of reagents within an 
experiment, and analysing controls together with treated samples, will both 
reduce the variability within a given experiment. 

Conduct of Experiment 

At this stage some attention may be required for verifying and validating pro- 
cesses. For example, checking that the imaging instruments give consistent 
results across the slide, on repeat use and from day to day. If two imagers 
are used it is important to verify that the results from both machines are com- 
parable. Some laboratories read fluorescence of one channel and then adjust the 
laser intensity of the second channel in order to obtain comparable readings. 
This is a method of normalising for the difference in intensities of fluorophores. 
It is important to be aware that this approach has a number of drawbacks. The 
arbitrary value of the second laser intensity setting will vary from experiment 
to experiment; thus comparisons of this channel cannot be made across experi- 
ments. Also the response of the fluorophore may not be linear across the laser 
intensity settings and this can lead to additional errors. 

Another area for investigation prior to running the study itself is the image 
analysis component. Depending on the software used, the image analysis pack- 
age may process the data to some extent, for example automatic background 
subtraction. Full understanding of the software is required so that it is clear at 
what point the data are 'raw', and the extent of inherent, inseparable manip- 
ulation. Effort may be required to determine the optimum settings for any 
software parameters. 

As data are generated it is important to be aware of the data integrity - for 
example ensuring that all data are collected, so that there are no missing data that 
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Figure 13.5 The relative fluorescence units (RFUs) of 1248 genes on seven microar- 
ray slides that were, hybridised with cDNfA made from the liver of .a rat. treated with 
acetaminophen. Note the gene outliers at approximately gene number 480 

can complicate analysis later. The researcher may be intuitively aware of any 
spurious results and should be alert for anything extraordinary that could indi- 
cate problems, for example hybridisation intensities appearing inconsistent from 
sample to sample. Data analysis at this point can be a rapid indicator of dubious 
results. For example, Figure 13.5 shows a plot of the fluorescent intensities of 
1248 genes that were hybridised with probe derived from acetaminophen-treated 
rat liver tissue. The data appear consistent, with the exception of peaks in inten- 
sity on one slide at around gene 4800. Further investigation of the microarray 
revealed a large artefact that had been missed by the image analysis process 
(Figure 13.6). 

Raw Data Generation and Storage 

One issue that arises when carrying out microarray analyses is how much data 
to store and in what form. For Good Laboratory Practice (GLP) purposes, often 
required in industry, storage of the raw data is necessary. This could be construed 
as the microarray image. Storing the image analysis results requires far less 
storage space and is easier to visualise, but it has the drawback that image 
analysis cannot be redone should superior software be available in the future. 
In reality, the methods used for microarrays are continually changing and the 
likelihood of revisiting old images on which the analysis has been performed, 
using outdated protocols is quite small. 
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Figure 13,6 Portion of scanned image showing region where artefact occurred that caused 
very high signals, which were classed as outliers 



Pre-processing 

A number of pre-processing steps are often used in microarray analysis. These 
include filtering, log transformation, normalisation and background subtraction. 
Filtering may be used before or after transformation in order to extract data 
from preferred regions of interest, or in order to remove outliers (see the 
above example relating to image analysis artefact). One example of filtering 
is the removal of individual gene replicates that lie outside a given number (for 
example 5) of standard deviations from the mean. Alternatively, data points that 
lie in the top/bottom few percentiles (e.g. 0.1%) of the data can be removed. This 
method of removing outliers is also called 'trimming'. It is acceptable if there 
is a large volume of data where only a small proportion of data is removed and 
if the same method is applied consistently across all data. Care must be taken 
in the way in which this is carried out in order not to delete genuine data. For 
example, if one gene is consistently high or low in expression across replicates, 
then it is unlikely to be an outlier 

Another method of detecting outliers is to plot all genes (see the section 
'Conduct of experiment , above) or to perform PCA analysis (see the section 
'Multivariate analysis' below) to detect replicate outliers. The use of a PCA 
plot to detect outliers is shown in Figure 13.7. 

Log transformation of data is accepted universally because the fluorescence 
data that are generated from microarrays tend to be skewed towards lower val- 
ues. There are scientifically valid reasons why ratios of raw expression values 
should not be used (Nadon and Shoemaker, 2002). When using two-colour 
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Figure 13.7 PCA plot of data used id Figure 13.4 showing one microarray (number 2) as 
an outlier 

hybridisations it is common to express the ratio of treated to control as a loga- 
rithm in base 2 (Quackenbush, 2001). Thus genes up-regulated by a factor of 2 
have a log 2 (ratio) of 1, and genes down-regulated by a factor of 2 have a log 2 
(ratio) of —1. 

Normalisation and background subtraction techniques are methods of data 
manipulation and their use is more subjective and often debated. The purpose of 
these techniques is to reduce the error (variability) that occurs between replicates 
and thus enable a comparison of data across samples. 

The theory behind background subtraction is that during hybridisation there 
will be non-specific binding to the slide. This will effectively 'darken 1 the 
image and give falsely high readings of fluorescent intensity. Correcting for 
the non-specific hybridisation should reduce error due to background staining. 
Background subtraction often occurs automatically in microarray image analy- 
sis packages. The software may circle the spot of interest and use the region 
beyond the periphery as the measurement of background. In cases of uneven 
hybridisation this method enables locally high background to be subtracted on 
a regional basis. One criticism of this approach is that the slide surface beyond 
the periphery is not similar, in chemical terms, to that where the nucleic acid 
has been deposited, and therefore cannot act as a real control for non-specific 
binding. A more accurate measurement of non-specific binding can be gained 
from using a region where spotting chemicals have been deposited, but no target 
is present. This concept is the basis of a method using local 'blank spots* (Wu 
e/ a/., 2001). 

A number of methods exist for normalisation of data. These include normal- 
ising to total signal or to a 'known* spot or gene, standardisation, or proprietary 
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methods. Normalising to total signal is the simplest approach, whereby die gene 
intensity is expressed as a percentage or proportion of the signal intensity for the 
entire array. This method works best when the total intensities for the microar- 
rays are similar and the number of changes is small compared with the number 
of genes. However, we often find, when using arrays of around 1000 genes, that 
pathological disease can up-regulate a large number of genes simultaneously. In 
this case, when total signal normalisation is applied, highly up-regulated genes 
wiU appear less up-regulated and genes that do not change from the control will 
appear down-regulated. 

Normalisation to a control value is a more popular technique. A control value 
can be obtained from using a gene known to remain constant under the condi- 
tions of the experiment. DeRisi et ai (1997) used a panel of 90 housekeeping 
genes for. normalisation, but found considerable variation in their gene expres- 
sion. Unfortunately it is very difficult to know with certainty that a gene will 
not change and there is evidence to suggest that so-called 'housekeeping genes' 
are variable (Savonet et al. % 1997). Other control genes can be derived from an 
alternative species; these should not be expected to hybridise. We have used 
yeast and Arabidopsis genes as negative controls for hybridisations of rat tis- 
sues. No orthologs were known to die genes selected; however, in. most cases 
non-specific binding occurred. 

If two or more microarray replicates appear to be different, but they are 
expected to be the same, then they can to? standardised. An example of this 
might occur if the total intensity of one microarray is greater than another, but 
the genes are proportionally equally up- or down-regulated. If the microarray 
sample spot data are assumed to be drawn from a normal distribution, then the 'z- 
transform' can be used. This requires that the mean and the standard deviation of 
the intensity values for each microarray are determined. The mean is subtracted 
from each individual gene value and the remainder is divided by the standard 
deviation. The intensity values from each microarray will then have the same 
mean and standard deviation. This has the advantage of facilitating comparison 
of microarrays with different dynamic ranges as well as total intensities. If the 
data are not normally distributed, then alternative non-parametric methods can 
be used, such as normalising to the median. 

Univariate Analysis 

Univariate methods of analysis involve examining one variable, or gene, at a 
time. This can be a very laborious task when examining a large volume of 
data, yet it is the preferred method of biologists. The simplest technique is to 
compare control and treated values and express the result as a 'fold-change* 
ratio. Typically, when examining small volumes of data, fold changes greater 
than 2 and less than 0.5 are considered meaningful (Quackenbush, 2001). This 
cut-off is essentially arbitrary and has the distinct drawback that microarray data 
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are not homoscedastic; that is, there is more variation about the mean at low 
values than there is at high values (Draghici, 2002). 

A second method for finding up- or down-regulated genes uses the standard 
deviation (SD) of the replicate gene data. Thus if changes greater than, say, 2 
SD from the log mean ratio are considerably greater than changes associated 
with 'noise', then they are considered significant This technique means that 
when looking at a large number of genes that are normally distributed there will 
be up- and down-regulated genes, regardless of whether there are (biological) 
changes (Draghici, 2002). 

Rather than using arbitrary cut-off values it is far more meaningful to express 
the fold-change in terms of either confidence intervals, or a 'p- value' (that 
is, the probability of the value occurring by chance). Thus, a fold-change of 
1.1 may be associated with a p- value of 0.001 and thus the probability that 
the gene is not up-regulated is 1 in 1000. Naturally, such small fold-changes 
may then be queried in terms of biological significance. One must then ask 
the question: Are large fold-changes more important (biologically) than small 
ones? Simple calculation of p-values for two data sets can be obtained using 
f-test functions in standard spreadsheet software. A number of replicates are 
necessary for this approach, and the data must be normally distributed. We 
have recently developed a method for calculating p-values for fold-changes that 
is not influenced by the distribution of the data or outliers and applied it to 
TaqMan™ and microarray data. Other complex and computationally intensive 
methods for calculating p-values are described in Draghici (2002) and Nadon 
and Shoemaker (2002). 

Multivariate Analysis 

Multivariate analysis of gene expression data is becoming increasingly popu- 
lar in the microarray community and in other biological domains where large 
volumes of data are generated. Multivariate analysis methods include princi- 
pal component analysis (PCA), factor analysis, multivariate analysis of variance 
(MANOVA) and cluster analysis. Currently, cluster analysis is the most widely- 
used method in the microarray community but PCA is growing in popularity 
(Crescenzi and Giuliani, 2001; Komi et al. % 2001). 

Quackenbush (2001) provides a good review of clustering tools that is rather 
unique in the regard that different clustering algorithms and linkage methods 
are presented. Clustering methods are unsupervised, and they are powerful tools 
for gaining insight into huge data sets. They enable the data to be partitioned 
in order to facilitate interpretation; however, they do suffer from subjectiv- 
ity. This is because the user selects various parameters, such as the algorithm 
used, linkage type, distance metric and, sometimes, cluster size. Whatever the 
data, clusters will always be identified, thus there is also a tendency to over- 
interpret the data - trying to attach meaning to clusters that may have no bio- 
logical relevance. 



282 GENE EXPRESSION ANALYSIS USING MICROARRAYS 



Software available for cluster analysis includes Cluster, the output of which 
is viewed in Treeview; both available from http://rana.stanford.edu/ 
software. This tool is particularly useful for clustering genes to identify genes 
that are co-regulated. 

The PCA is a visualisation tool that enables complex, high-dimensional data 
to be represented in two or three dimensions. It facilitates identification of groups 
of similar data, thus enabling inferences to be made about the samples. 

An example is shown in Figure 13.8. The figure shows the gene data for one 
sample (control rat liver) that was hybridised to seven microairays according 
to the method used in Wildsmith et at. (2001). Each microarray contained two 
replicate gene sets; thus there were 14 replicate gene sets in total. The gene sets 
comprised 1248 genes (with controls). All the data (14 x 1248 data points) was 
input into the analysis and the PCA plot displays the 14 replicates individually. 
The two axes are principal components 1 and 2. Principal component 1 (PCI) 
accounts for 65.5% of the variation in the data, whereas PC2 represents only 
13%. This means that the model accounts for 78.5% of variation in the data. 

The first principal component (PCI) accounts for as much as possible of 
the variation in the.original data and subsequent components (e.g. PC2) are of 
decreasing importance. Ttras, samples 8 and 9 are wy different from samples 1 
and 2. In terms of interpreting the PCA plot, it is immediately clear that there are 
three or four distinct clusters of data. These are marked by circles. Datapoints 
tend to cluster in pairs; for example replicates 1 and 2, 3 and 4, 5 and 6, 
etc. These are the duplicate gene sets on the same microarray. This indicates 
that the variation within the microarray is lower than the variation between 
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Figure 13.8 PCA plot of the microanray results from seven slides (2690, 2692, 2723, 2869, 
2876, 2879, 2880), each with two replicate spot sets (labelled 1-14), after hybridisation with 
control rat liver. See text for explanation 
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replicate microarrays. However, given that the same sample is applied to all 
the microarrays, we must ask why we get further separation of the replicates. 
The answer lies in the associated data. The four-digit numbers associated with 
the clusters are the microarray slide numbers that indicate when they were 
printed. Numbers that are more similar seem to be more closely related, and thus 
we can hypothesise that there were some differences between the slides, such 
as differences in slide backgrounds or changes during the printing process or 
change-over of batches, between slides in the 2600-2700 region and the 2800s. 

The PC A provides a clearer overview of the data than does cluster analy- 
sis. It is a rapid method for gaining an insight into the results, in particular 
where biological meaning can be attached to the components (Crescenzi et ai, 
2001). There are a number of packages for multivariate data analysis, including 
SIMCA-P (Umetrics, AB, Umea, Sweden) and The Unscrambler (Camo ASA, 
Norway), both of which are useful for PCA. 

Other tools for data visualisation include software packages such as Spot- 
fire.net (Spotfirc Inc., Cambridge, MA, USA) and GeneSpring (Silicon Graphics, 
San Carlos, CA, USA). Spotfire is particularly useful for visualisation of mul- 
tidimensional data and for visualisation of temporal data. It is possible to use 
these tools to identify genes (hat are co-ordinately expressed over time. 

Biological Interpretation 

After developing a sound experimental strategy, ensuring that the results are 
statistically valid, and after analysis of the data, it is down to the biologist to 
assemble die pieces of information that have been obtained. This intertwined 
information may include unexpected results that are contradictory to intuition or 
to published literature. One way to untangle the data is to map the relevant genes 
onto existing pathways and known functions. The Kyoto Encyclopedia of Genes 
and Genomes (KEGG), available at http://www.genome.ad.jp/kegg/, is a 
useful source of information, especially where the gene products are enzymes. It 
enables visualisation of the position of up- or down-regulated genes in metabolic 
pathways. 

The gene expression data obtained may differ from protein expression data, 
or information on gene product activity or location. When initiating a study 
it is useful to consider additional endpoints that can assist in the interpreta- 
tion of the data. For in vitro studies, these might include cytotoxicity endpoints, 
metabolites, key signalling molecules or perhaps protein expression. Waring 
et aL (2001) used tetrazolium dye reduction (MTT) as a measure of hepatocyte 
cell viability for their studies of gene expression in response to hepatotoxic 
insult. For in vivo studies, expression information on the tissue of interest could 
be supported by pathology, histology and blood chemistry measurements. Gene 
expression results could be confirmed by in situ hybridisations or protein activity 
assays. 
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13.4 Recent examples of microarray applications 

One area of rapid progress using microarray technology is the increased under- 
standing of cancer. Molecular pathologists are subgrouping cancers of tissues 
such as blood, skin and breast, based on differential gene expression patterns. 
For example, within a small group of breast cancer tissue samples, Perou ex a/. 
(2000) distinguished two broad subgroups representing those expressing or alter- 
natively lacking expression of the oestrogen receptor-a gene. The work was not 
conclusive, but never has progress in this field been so rapid when compared 
with the previous methods of gene identification. 

Another example of the impact of this technology is in the identification of 
two biomarkers for prostate cancer, namely hepsin and PIM1 (Dhanasekaran 
etal., 2001). 

Microarray technology has also accelerated the understanding of the molecu- 
lar events surrounding pulmonary fibrosis. Specifically, two distinct clusters of 
genes associated with inflammation and fibrosis have been identified in a dis- 
ease where, for years, the pathogenesis and treatment have remained unknown 
(Katsuma et al„ 2001). 

13.5 Conclusions 

Important factors in gene expression experiments include sensitivity, precision 
and reproducibility in the measurement of specific mRNA sequences (Schmittgen 
et al. t 2000). These quality metrics can be maximised by using, or fabricat- 
ing, high-quality microarray s, and by optimising each step of the microarray 
process. From conception to conclusion it is important to bear in mind the 
original hypothesis. 

Having considered the complexity of the microarray experiment, the value 
obtained from a meticulously designed experiment should not be underesti- 
mated. As the number of high-quality gene expression studies increases, we hope 
that the literature will contain increasingly detailed information that will help 
interpret complex gene expression changes, and thus elucidate the mechanisms 
of disease. 
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http : //rana . Stanford . EDU/sof tware 
http //www . tigr . org/ so f t lab 
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INCREASING NUMBERS OF HUMAN Dis- 
eases, both acquired and genetic, are 
being considered to be based at least 
in part on alterations in DNA se- 
quence. For most diseases, inheritance 
and acquisition are likely to be com- 
plex and polygenic. The efforts of the 
Human Genome Project to elucidate the 
structural genetic background by iden- 
tifying the chromosomal positions and 
genomic organization of between ap- 
proximately 30000 and 35000 human 
genes are nearly complete. 1 Based on this 
structural knowledge, a byproduct 
should be a better "scaffolding n to help 
link specific genes to susceptibility to 
various human diseases. However, to un- 
derstand how the products of these ge- 
netic linkages work together to orches- 
trate the initiation and progression of 
particular complex diseases, there will 
be a need to apply a functional genetic 
rather than a structural genetic ap- 
proach. 14 

Until recently, functional genetic 
studies have generally been of limited 
scope, only able to elucidate the role of 
1 or a few genes at a time in 1 system. 
Information on the specificity and rela- 
tive abundance of expression prod- 
ucts has traditionally been obtained by 
techniques such as RNA Northern blot 
hybridization and ribonuclease protec- 
tion assays. Somewhat more sophisti- 
cated methods, such as differential dis- 
play' and Serial Analysis of Gene 
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DNA microarrays represent a technological Intersection between biology and 
computers that enables gene expression analysis In human tissues on a ge- 
nome-wide scale. This application can be expected to prove extremely valu- 
able for the study of the genetic basis of complex diseases. Despite the enor- 
mous promise of this revolutionary technology, there are several Issues and 
possible pitfalls that may undermine the authority of the mlcroarray plat- 
form, We discuss some of the conceptual, practical, statistical, and logisti- 
cal Issues surrounding the use of microarrays for gene expression profiling. 
These Issues Include the Imprecise definition of normal In expression com- 
parisons; the cellular and subcellular heterogeneity of the tissues being stud- 
led; the difficulty In establishing the statistically valid comparability of ar- 
rays; the logistical logjam In analysis, presentation, and archiving of the vast 
quantities of data generated; and the need for conflrmatlonal studies that 
address the functional relevance of findings. Although several complicated 
Issues must be resolved, the potential payoff remains large. 
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Expression, 5 have been used to screen 
larger numbers of complementary DNA 
(cDNA) clones. However, technical 
limitations render these techniques 
nonconducive to large-scale genetic 
survey. 

To this end, a powerful new tech- 
nology is emerging, using hybridiza- 
tion to nucleotide arrays, the so-called 
gene chips. 6,7 This technological inter- 
section of biology and computers en- 
ables the reliable screening of a vast 
number of genes simultaneously and is 
amenable to automation. On a nylon 
membrane or glass surface, gene- 
specific cDNAs can be spotted, or oli- 
gonucleotides can be synthesized in situ 
by a combination of photolithography 
and oligonucleotide chemistry. This 



permits simultaneous monitoring of the 
expression of thousands of genes in a 
single step. Individual chips can be cus- 
tomized to include any chosen set of 
fully or partially characterized ge- 
nomic or expressed sequences. Chips 
can monitor over 50000 unique se- 
quences. The power of these chips lies 
in the potential for comparative expres- 
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sion studies in diseased vs normal 
samples, and in documenting changes 
at different stages during the natural 
course of the disease or in response to 
treatment. It provides the researcher 
with a new arsenal to analyze under- 
lying pathomechanisms on a grand scale 
and also to review the rationale of thera- 
peutic concepts. 

However, despite the enormous po- 
tential of this revolutionary technol- 
ogy, there are several issues and pos- 
sible pitfalls that attenuate the power 
of microarrays. First, the definition of 
normal in expression comparisons is 
neither precise nor unambiguous. Sec- 
ond, the heterogeneity of the tissues 
being studied complicates the mean- 
ing of the expression profiles. Third, the 
statistically valid comparability of ar- 
rays is an unresolved problem. Fourth, 
the vast quantities of data create a lo- 
gistical logjam for analysis* presenta- 
tion, and archiving. Finally, conflrma- 
donal studies are needed to corroborate 
the biological significance of micr oar- 
ray data (Figure 1). 

TROUBLE WITH NORMAL 

The standard normal vs diseased tis- 
sue type of comparison, which is the ba- 
sic design foundation of profiling stud- 
ies, may be more quicksand than 
bedrock. Normal is not so easy to de- 
fine—neither is diseased. Gene expres- 
sion in normal tissue is likely to be de- 
pendent on several factors involving 
patient and sample variation. These fac- 
tors will also have an impact on expres- 
sion profiles of diseased tissue. 

Patient Variation: Ethnicity, 
Sex, Age, Genetic Background, 
Disease States 

The ethnicity, sex, age, and genetic back- 
ground of a patient are likely to affect 
the gene expression profiles of many tis- 
sues to varying extents.* 9 A simple ex- 
ample is provided by the expression pro- 
files of genes involved in scalp and body 
hair follicle activity, which can be ex- 
pected to vary over a normal range un- 
der the influence of all of these sources 
of patient variation. The effects of these 
parameters on gene expression are likely 
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Figure 1. Possible Pitfalls in MJcroarray Gene Expression Profiling Analysis 
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to be subtle but pervasive, not fully un- 
derstood at this time, and quite prob- 
lematic for defining normal. 

The presence of disease in a subject 
who is the source of tissue for control 
purposes, presents further potential 
variabilities. For example, there may be 
a significant difference in the conclu- 
sions reached by 2 similar microarray 
expression profiling studies. One may 
compare genes expressed in a pa- 
tient's diseased lung tissue with those 
expressed in normal, nondiseased lung 
tissue from the same patient, and an- 
other may compare genes expressed in 
the same patient's diseased lung tissue 
with those expressed in normal, non- 
diseased lung dssue from a healthy con- 
trol or normal individual. Moreover, it 
is also possible that seemingly unre- 
lated disease states may influence gene 
expression at distant sites. For in- 
stance, the presence of diabetes in 1 of 
2 renal cancer patients may compli- 
cate the direct comparison of renal 
tissues. 



Sample Variation; Proximity 
to Disease, Anatomic Location, 
and Developmental Range 

Yet another complication derives from 
ihe proximity of the normal tissue used 
as a control for.the diseased tissue. Tis- 
sue adjacent to an area of disease may 
not be normal despite absence of evi- 
dence of disease clinically or under the 
light microscope. Normal-appearing tis- 
sue near a tumor could, for example, 
be genotypically altered or exhibit an 
altered gene expression profile. 10 ' 1 * 



Moreover, factors such as the degree of 
disease-associated inflammation may 
have a significant impact on gene ex- 
pression profiles. Other bystander ef- 
fects, epiphenomena, or secondary dis- 
ease processes could all play important 
roles in determining expression pro- 
files within these adjacent, so-called 
normal tissues. These factors must be 
considered in the choice of normal. 

The precise location within a par- 
ticular organ may be another impor- 
tant factor that affects gene expres- 
sion. 9 For example, just as location 
relative to the urethra may influence ex- 
pression profiles in the prostate, 14 skin 
from the nose, back, and palm are cer- 
tain to have different expression pro- 
files as well, despite all being from the 
same organ. Thus, site and specific ana- 
tomic location must also be taken into 
account in a description of normal 

U must also be kept in mind that the 
definition of normal actually repre- 
sents a dynamic state. 14 All tissues, 
which are composed of early and late- 
stage cells, have a normal developmen- 
tal range. For example, normal epithe- 
lium in prostatic ducts ranges from 
atrophic to resting to hyperplastic, and 
each has a unique pattern of gene ex- 
pression. 14 

A 3-dimensional analytic approach is 
a strategy that has been used to address 
some of these concerns about defining 
normal. Cole et al 14 used a 3-dimen- 
sional model to characterize the entire 
prostate gland in their study of gene ex- 
pression profiles in prostate cancer. In 
this study, whole-mount prostactec- 
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torny specimens were divided into trans- 
verse cross sections such that the entire 
prostate gland, including the complete 
spectrum of normal epithelium and tu- 
mor progression, was available for view- 
ing, microdissection, and microarray 
analysis. This method was used to de- 
termine the exact physical relationship 
of the normal ducts, premalignant le- 
sions, and tumors — thus obtaining an 
anatomic framework on which to over- 
lay gene expression data. This tech- 
nique offers several advantages over the 
normal vs lumor comparison. Previous 
studies had used normal epithelium in 
prostatic ducts as a baseline control 
against which to compare and contrast 
tumor gene expression profiles. 15 * 16 How- 
ever, the expression profile of this nor- 
mal epithelium is affected by proximity 
to tumor, location within the gland, and 
developmental state. 11 These factors can 
be better appreciated using a 3-dimen- 
sional approach. 

Disease-Related Variation 

Of course, many of the parameters that 
affect normal expression profiles (pa- 
tient ethnicity, age, sex, and genetic 
background, location within an or- 
gan, and developmental stage) will also 
affect disease expression profiles. 17 "" 
Disease heterogeneity, including sub- 
type, activity, severity, stage, and pre- 
vious as well as current treatments, also 
may have a significant impact on gene 
expression. 15 "" 

Categorizing and subgrouping pa- 
tients on entry into a study may be use- 
ful to control for as many of these fac- 
tors as possible. However, there may be 
problems surrounding attempts to de- 
fine microarray-based categorization on 
ihe basis of another imperfect categori- 
zation system, such as histology, as these 
groups are sometimes arbitrary or in- 
consistently designated. Nevertheless, de- 
termining whether gene expression pro- 
files correlate with existing clinical or 
histological categories can provide new 
insights into the meaning of these cat- 
egories as can new methods of classify- 
ing cancers or other diseases into spe- 
cific diagnostic categories based on their 
gene expression signatures. Several stud- 
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ies have been able to establish expression- 
based criteria (class predictors) for pre- 
existing categories and then use these 
new criteria to categorize new cases (class 
prediction).*" 8 Global profiling may also 
allow the development of new classifi- 
cation systems based on gene expres- 
sion alone (class discovery). 2 *- 10 

Thus, when possible, it will be of 
value to profile a range of normal and 
diseased cell populations from a num- 
ber of patients to distinguish between 
differences in expression that are rel- 
evant to the disease process and those 
reflecting the biological spectrum of the 
normal tissue or that have occurred for 
reasons unrelated to the disease. The 
significance of this distinction is fur- 
ther appreciated when taking into ac- 
count the vast quantity of data gener- 
ated from microarrays and the potential 
for confounding interpretation from the 
inclusion of differential expression un- 
related to disease processes. 

It is worth noting, however, that the 
issues of patient and sample variability 
are not unique to microarray experi- 
ments. In fact, microarray experiments, 
in contrast to classic single-gene experi- 
ments, may actually provide the tools for 
identifying this heterogeneity. For ex- 
ample, DNA microarrays have been used 
to explore physiological variation in gene 
expression on a genomic scale in 60 cell 
lines derived from diverse tumor tis- 
sues. 51 Cluster analysis allows the iden- 
tification of prominent features in gene 
expression patterns that appear to re- 
flect molecular signatures of the tissue 
from which the cells originated. 31 

HETEROGENEOUS CELL 
POPULATIONS 

A further complication encountered with 
expression profiles is that any given tis- 
sue is composed of several cell types, 
members of which are likely to be within 
a spectrum of dynamic functional states. 
For example, a simple punch biopsy of 
the skin may contain keratinocytes, me- 
lanocytes, Langerhans cells, Merkel cells, 
adipocytes, smooth muscle cells of ar- 
rector pili, striated muscle cells of the 
panniculus camosus, blood cells includ- 
ing immune system cells, and cellular el- 



ements of blood vessels, nerves, hair fol- 
licles, sebaceous glands, and sweat 
glands. Moreover, cells from each of these 
populations will be at various stages of 
development and levels of activation, per- 
forming different functions and respond- 
ing to disease processes or treatments in 
different ways and to varying extents. The 
result is a highly heterogeneous sam- 
pling of cells, each expressing a unique 
set of genes. An expression profile gen- 
erated from a microarray study of the 
RNA in such a sample will thus repre- 
sent merely a snapshot of the genes ex- 
pressed by a plethora of cells at a mo- 
ment in time. Such extensive cellular 
heterogeneity complicates the ability to 
draw conclusions about specific pro- 
cesses occurring within a tissue speci- 
men. An illustrative example is pro- 
vided by Stanton et al, n who used 
microarrays to identify genes differen- 
tially expressed during myocardial in- 
farction. The expression profiles they 
studied represented transcripts from cell 
populations as diverse as immune sys- 
tem cells, which migrated to the infarct 
region and are responsible for the in- 
flammatory response, cardiac myocytes 
within the ischemic area undergoing 
apoptosis and necrosis, fibroblasts un- 
dergoing proliferation and participat- 
ing in the formation of scar tissue to re- 
place the infarct, and cardiac myocytes 
undergoing hypertrophy to compen- 
sate for the loss of cells in the infarct 
area." The issue of such cellular hetero- 
geneity was avoided by categorizing the 
differentially expressed genes into func- 
tional categories to look for patterns in- 
dicative of cardiac remodeling without 
attempting to attribute specific tran- 
scripts to specific cell types. For gene ex- 
pression studies involving samples with 
mixed cellular populations, further in- 
vestigation, such as with in situ messen- 
ger RNA (mRNA) hybridization, maybe 
necessary to localize the transcripts be- 
fore conclusions can be drawn about the 
roles of specific genes in specific cell types 
during the disease process. 

Laser Capture Microdissection 

An ingenious but technically delicate 
approach to the study of complex bio- 
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logical samples has become possible 
with the development of laser capture 
microdissection (FIGURE 2) * This tech- 
nique allows for the rapid and accu- 
rate procurement of cells from spe- 
cific areas of tissue under direct 
microscopic visualization, and thus 
makes the molecular genetic analysis 
of defined populations in their native 
tissue environment possible. 15 Sgroi et 
al 36 demonstrated the feasibility of com- 
bining laser capture microdissection 
with high- throughput cDNA arrays. 
They showed that in vivo subpopula- 
tions of malignant cells from multiple 
stages of breast cancer progression 
could be separated from nonmalig- 
nanl populations, and their expres- 
sion profiles could subsequendy be ana- 
lyzed using microarrays. 

The potential is powerful. Specimens 
could be separated into tissue layers; for 
example, separating a skin biopsy into 
epidermis, dermis, and hypodermis. Tis- 
sues could be further differentiated into 
specific structural components, such as 
dermis into blood vessels, adipose, ar- 
rector pilt, and sebaceous glands. Struc- 
tures could be separated into defined cell 
types, such as blood vessels into endo- 
thelial cells, erythrocytes, and lympho- 
cytes. Cell types could even be sepa- 
rated into marker-defined subtypes, such 
as lymphocytes into CD4 and CDS cells. 
Expression profiles from refined and de- 
fined structures and cell types likely 
would be extremely valuable in the study 
of disease. 

Potential aside, there are significant 
limitations to this technology at the pres- 
ent time. The standard protocols for fix- 
ing and embedding tissue samples from 
surgical resections were not designed 
to be compatible with microarray ex- 
periments, with or without laser cap- 
ture microdissection. Typically, tissue 
suspected of being important for diag- 
nosis and staging is processed through 
aldehyde-based fixatives, such as for- 
malin, which damage mRNA integ- 
rity." If frozen tissue is available, mRNA 
can be recovered and studied from dis- 
sected cell populations. However, fro- 
zen tissue sections are technically dif- 
ficult to prepare, the histology is often 



Figure 2. Laser Capture Microdissection 



Slide 




A, Tissue sections are processed tnd placed on a microscope slide under a thin, transparent the/mopbsHc fibn 
which H attached to a movable cap. Visualizing the tissue mlcroscopfcally, a short-duration, focused pulse from 
tn Infrared carbon-dloxlde laser is used to activate and melt the film to selectively adhere celts within targeted 
areas of Interest B. When the cap b lifted, the film, with selected ceOs stilt bound, Is removed from the tissue 
section for further processing U> retrieve cellular materials (eg, DNA, RNA, proteins). 



severely compromised, and the tissue 
available may contain only a Limited 
portion of the lesion. 14 

Moreover, the sample amounts gen- 
erated from laser capture microdissec- 
tion can be small, even as minis cuie as 
a single cell* Consequently, the yields 
of RNA are low. Arrays have a thresh- 
old for the quantity of molecular start- 
ing material: at least 5 to 15 ug for oli- 
gonucleotide arrays and between 2 and 
100 ug for cDNA arrays, depending on 
the manufacturer, the source of the 
RNA, and the use of signal amplifica- 
tion. 1 ^ 0 Studies that have successfully 
integrated laser capture microdissec- 
tion with microarray technology have 
used samples of approximately 1 X 10* 
to 1 X 10 5 cells with 95% to 98% ho- 
mogeneity as determined by micro- 
scopic visualization If needed, am- 
plification techniques may be used to 
generate sufficient genetic material for 
microarray hybridizations. 11 Laser cap- 
ture microdissection is an intriguing 
technology, but time will tell whether 
its potential is realized. 

Although some biological issues re- 
lated to gene expression may be com- 
plicated by the presence of heterog- 
eneous cell populations in studied 
samples, it is also true that some bio- 
logical conditions can be understood 
only in the context of these heterog- 
eneous cell populations. The nature of 
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global gene expression experiments is 
to uncover differences between 2 bio- 
logical samples, including those differ- 
ences based on diverse cell popula- 
tions. For example, to appreciate a 
disease that is characterized by an in- 
flammatory infiltrate, it must be un- 
derstood that the inflammatory infil- 
trate is part of the disease and is part 
of the difference between diseased and 
nondiseased tissue. Thus, the isola- 
tion of specific cell populations for 
study is not necessarily required or even 
desirable in all instances. 

MAKING MICROARRAYS 
COMPARABLE 

Ideally, rnicroarray experiments should 
be comparable both within and be- 
tween laboratory or manufacturing sys- 
tems, but obtaining consistent and com- 
parable data is a critical challenge for 
microarray-based expression analysis. 
Major sources for the observed vari- 
ability of microarray data include the 
normal physiological gene expression 
variations in different samples and the 
noise introduced in the microarray as- 
say process/ 1 

Physiological Gene 
Expression Variation 

Inextricably linked to the issues of pa- 
tient and sample variability and tissue 
heterogeneity discussed above, is the 
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problem of normal gene expression 
variations and how to distinguish these 
variations from significant disease- 
associated changes. Few studies have 
systematically investigated physiologi- 
cal expression changes, but data from in 
situ hybridizations suggest that normal 
variance for many tightly regulated tis- 
sue-specific genes can be within 20% to 
30%. 41 However, there can be as much 
as 2- to 4-fold random fluctuations for 
many genes in yeast. 0 - 14 Affymetrix 
(Santa Clara, Calif) guidelines have sug- 
gested that for most of the "housekeep- 
ing" genes in human tissues, which are 
likely to be less tightly regulated, differ- 
ences of less than 4-fold are probably not 
biologically significant/ 3 Conse- 
quently, a significant portion of micro- 
array data variability for high- or me- 
dium-abundance mRNAs may simply be 
due to normal expression variations. Sev- 
eral previous studies have used arbi- 
trary 2-fold change criteria to define sig- 
nificant expression change. 16 However, 
the 2-fold threshold has been shown to 
be statistically invalid even for dupli- 
cate experiments. 16 In a recent study that 
used cDNA microarrays to profile gene 
expression in samples of normal skin 
from breast-reduction surgeries, 71 of 
4400 genes were found to demonstrate 
variability in expression greater than 2 
SDs from die mean of each gene. 17 These 
included genes coding for transport pro- 
teins, gene transcription, cell-signaling 
proteins, and cell-surface proteins. Thus, 
physiological variation should be con- 
sidered in the analysis and interpreta- 
tion of microarray data. More stringent 
criteria for defining significant expres- 
sion change may be useful. 

Noise In the Microarray 
Assay Process 

For the tightly regulated (mostly low 
abundance) mRNA species, inconsis- 
tencies introduced at any stage of the 
microarray-based assay process may 
play a major role in data variability. 12 
Due to the miniaturization and the large 
number of genes involved, it is diffi- 
cult to maintain consistent processing 
conditions for each sequence across 
multiple assays, and obtaining accu- 
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rate absolute signals is unlikely. 42 Noise 
may be introduced by slide heteroge- 
neities, printing irregularities (eg, 
pin-to-pin variations), and spotting 
volume fluctuations. 18 Some of the sys- 
tematic variations may be reduced by 
the inclusion of controls, but random 
fluctuation at various manufacturing 
stages cannot be completely con- 
trolled and can accumulate quickly in 
a complicated assay. 46 

In certain microarray systems, 2 
samples are competitively hybridized to 
1 array using different fluors for label- 
ing. In other systems, there is only single- 
sample hybridization. A 2-color system 
might be expected to be more reliable 
since variations in spot size or amount 
of cDNA probe on the chip should not 
affect the signal ratio (both signals are de- 
rived from the same spot). However, this 
only holds true if signals are well above 
the background in both detection chan- 
nels. 41 In fact, the signal level for most 
of the tightly regulated genes will likely 
be close to the background level. 42 In ad- 
dition, background level on a slide can 
also vary significantly from spot to spot 
due to factors such as unevenness in slide 
surface properties, dust contamination, 
and incomplete washing, leading to high 
levels of signal variability for lowt 
abundance mRNA species even in 
2-color systems. 42 

The high levels of variability of mi- 
croarray data also mean that subtle 
changes in experimental conditions 
may significantly alter the results, mak- 
ing it difficult for separate laborato- 
ries to compare experimental data. In 
addition, the lack of standard con- 
trols, the predominant use of relative 
signals (ratios), and the adoption of in- 
compatible data formats contribute to 
poor comparability between studies 12 
Despite the hard-wired variability in- 
troduced by chip manufacturing con- 
ditions, most of the published studies 
to date using microarray-based expres- 
sion analysis include only limited num- 
bers of replicates. 49 In fact, many stud- 
ies conduct the experiment only once. 
Considering the potential sources of as- 
say variation, the need for sufficiently 
replicated studies is underscored. 19 



Mlcraarray Data Normallutfon 

Because of variability of microarray data 
for single sample arrays and for fur- 
ther analysis of 2-color system arrays, 
each must be brought into the same 
scale to compare 2 or more arrays. How 
to perform this normalization of gene 
expression levels across multiple ar- 
rays, thus removing systematic varia- 
tion between the arrays and rendering 
different experiments comparable, re- 
mains an issue that is not yet fully re- 
solved. 10 Many of the early microarray 
studies in the literature simply ig- 
nored this issue. A more statistically rig- 
orous approach is needed. 

One difficulty has been that leading 
microarray manufacturers have not 
published statistical error models for 
their products. Thus, users are un- 
clear how much to adjust data for varia- 
tions in spot intensity, hybridization 
patterns, and intensity measurement 
sensitivity. Software does exist to al- 
low for array-to-array comparisons by 
using a scaling factor to normalize gene 
expression patterns across arrays. How- 
ever, in general, these algorithms as- 
sume that intensity differences be- 
tween arrays are linearly related with 
a zero y-intercept. 51 This assumption al- 
lows software to trim the tails off dis- 
tributions of expression from differ- 
ent arrays at statistical cutoffs and then 
simply move the distributions along an 
axis to a common level to provide com- 
parisons. However, this linear relation- 
ship often does not hold true. 31 When 
the average expression level of 1 array 
is higher than that of a second array, a 
longer tail will be trimmed from the sec- 
ond array. Thus, a greater number of 
genes from the first array will be 
counted as being expressed because 
their expression level is above the sta- 
tistical cutoff point. In this case, the 2 
arrays cannot be considered compa- 
rable. 

Although bioinformatic software has 
recently been developed that offers 
more statistically robust normaliza- 
tion, the cost of these commercially 
available programs (combined with the 
already expensive microarrays) has 
been prohibitive for many research- 
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ers. 50 Standardization of these pro- 
cesses awaits the development of im- 
proved methods of normalization 
leading to valid statistical models widely 
available to all researchers. 

To this end, Schadt et al 5> have de- 
veloped a standard nonlinear curve 
technique for normalizing the data in 
arrays that do not demonstrate a lin- 
ear relationship between data sets, this 
model performs well when the 2 
samples being compared demonstrate 
a low number of differentially ex- 
pressed genes. However, when expres- 
sion profiles of 2 samples vary to a 
greater extent, Schadt et al M recom- 
mend a rank-selection method. Using 
this method, genes expressed on an ar- 
ray are ranked from highest to lowest 
level of expression. Then, for the ar- 
ray expressing a greater number of 
genes, the genes with the lowest ex- 
pression levels are removed from the list 
until the 2 arrays Hst a comparable num- 
ber of expressed genes'Thls type of 
rank-selection method has gained sup- 
port from other groups, but it too has 
limitations. 90 Removing low- expres- 
sion level data points restricts the study 
to the more extreme and easily de- 
tected entities, a technique that blunts 
the genomic-scale potential of micro- 
array technology. 

Efforts continue to improve compa- 
rability between arrays. Jones 30 re- 
cently applied a statistical model to nor- 
malize spotted cDNA array data that 
takes into account not only the differ- 
ences in numbers of genes expressed be- 
tween arrays, but also the interarray 
variations in fluorescent dye intensity 
and mechanical error occurring in the 
printing process. Nevertheless, the is- 
sue of how to properly normalize ar- 
ray data has not been settled. Research- 
en must continue to demand statistical 
rigor in their comparisons before they 
can believe the mathematical results of 
their data. 

LOGISTICAL LOGJAM 

Microarrays deliver massive amounts of 
data on lens of thousands of genes. The 
result is an immense quantity of bio- 
logical information that must be ana- 
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lyzed, presented, and archived in a 
meaningful way. 

Data Analysis 

In human studies, the number of hy- 
bridizations that can be performed for 
any set of experimental conditions is of- 
ten restricted by the limited number of 
obtainable tissue samples and by the ex- 
pense of arrays. Restricted numbers of 
hybridizations for each experiment 
hamper the ability to assess the bio- 
logical significance of variation within 
or between given sets of conditions. 
Thus, for the assessment of thousands 
of genes in a setting of limited hybrid- 
izations, the importance of reliable and 
sophisticated algorithms for data analy- 
sis becomes amplified. 31 

A logical beginning is to examine the 
extremes, that is, genes with signifi- 
cant differential expression in indi- 
vidual samples. For example, a com- 
parison of 2 samples can be visualized 
in the form of a simple btvariate scat- 
terplot in which the expression profile 
of 1 sample (x-axis) is plotted against 
that of the second sample (y-axis). The 
distribution pattern generally demon- 
strates that the expression ratios clus- 
ter around the line in which x is equal 
to y (indicating comparable levels), with 
individual genes falling varying dis- 
tances from this line. Additional lines can 
be placed on the scatterplot to repre- 
sent various fold changes of expres- 
sion. Data points that fall above or 
below these lines represent genes ex- 
hibiting expression ratios greater or less 
than the specified fold change. Thus, one 
can begin by examining those genes that 
demonstrate a 10-fold or greater change 
in expression level. To expand the num- 
ber of genes under investigation, one can 
examine genes that demonstrate a 5-fold 
or greater change, or a 3- fold or greater 
change, and so forth. Many studies de- 
fine a 2- fold or greater change in ex- 
pression level to represent significant dif- 
ferential expression. The 2-fold 
threshold, however, as noted above, has 
been shown to be statistically invalid/ 6 
Although this simple technique can be 
efficient and effective for focusing on ex- 
panding sets of differentially expressed 
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sequences, again, such an analysis does 
not take advantage of the full potential 
of genome-scale experiments to en- 
hance our comprehension of cellular bi- 
ology that would be provided by an in- 
clusive analysis of the entire repertoire 
of transcripts in a cell as it goes through 
a biological process. 52 A more holistic ap- 
proach, which allows the deciphering of 
patterns from the entire data set, is 
needed. 

Data Organization 
and Presentation 

Statistical algorithms can be applied to 
detect and extract patterns within pro- 
filing data. It is a basic assumption of 
many gene expression studies that 
knowledge of where and when a gene 
is expressed provides information about 
the function of the gene. Therefore, an 
important beginning is to organize 
genes on the basis of similarities in their 
expression profiles." However, even 
this basic tenet deserves critical con- 
sideration. Similarity of gene expres- 
sion profile does not mandate similar- 
ity of function or mechanistic pathway, 
and it maybe purely coincidental. Nev- 
ertheless, the idea of clustering genes 
on the basis of their expression pat- 
terns is well established and cluster 
analysis has become the most widely 
used statistical technique applied to 
large-scale gene expression data. 51 

Although various cluster methods can 
usefully organize tables of gene expres- 
sion measurements, the resulting or- 
dered but still massive collection of num- 
bers remains difficult to assimilate. Thus, 
another important component of ge- 
nome-wide expression data explora- 
tion is the development of powerful data 
visualization methods and tools. Ap- 
proaches have been developed that pre- 
sent clustering results in simple graphi- 
cal displays such as dendrograms, which 
represent relationships among genes by 
a tree whose branch lengths reflect the 
degree of similarity in expression be- 
tween the genes. Similarity is mathemati- 
cally defined.*' The computed trees can 
be used to order genes in the original data 
table such that genes or groups of genes 
with similar expression level patterns are 
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Figure 3. Real-Time Polymerase Chain 
Reaction (PCR) 



A. Complementary DNA synthesis from source mes- 
senger or total RNA proceeds as with traditional re- 
verse transcriptase PCR. B, A dual-labeled fluoro- 
genlc probe (with a higher melting point than the PCR 
primers used for extension) h annealed to the target 
sequence between the forward and reverse PCR prim- 
ers. A reporter (R) ftuoroehrome (usually 6-carboxy- 
fiuorosceln) Is attached to the 5' end of the probe and 
a quencher (Q) molecule (usually 6-carboxy- 
tetramethyl-rhodamine) is attached to the 3' end. 
C, As Taq polymerase extends In the 5' to 3* direc- 
tion, the duaMabded probe begins to be displaced from 
the target sequence. D. As the Taq polymerase con- 
Hnues to extend, the 5' to 3' endonudease activity 
deaves the reporter (R) molecule from the probe se- 
quence such (hat Its emission spectra (51 8 nm) are no 
longer quenched by the second (Q) fluorescent dye. 
Fluorescence Is measured continuously throughout the 
PCR amplification in real time and ts proportional to 
the amount of PCR product generated In each cyde. 



placed adjacently. Clustering methods 
can also be combined with representa- 
tion of each data point with a color that 
quantitatively and qualitatively reflects 
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the original experimental observa- 
tions. 31 Visual assimilation is then more 
intuitive. 

Data Archiving and Mining 

Ultimately, successful interpretation of 
gene profiling studies is likely to be de- 
pendent on the integration of experi- 
mental data with external information 
resources. As multiple experiments in- 
volving multiple cell types and tissues 
from multiple laboratory groups accu- 
mulate, data archiving may well be- 
come the watershed issue. Ideally, all 
data, in a suitably standardized form, 
would be freely accessible in the pub- 
lic domain. Even assuming a willing- 
ness, to share the data, such Utopian 
goals would require a user-friendly and 
powerful database system and stan- 
dardization of correction and normal- 
ization procedures such that data points 
from various projects become compa- 
rable." The National Center for Bio- 
technology Information Entrez sys- 
tem (http://www.ncbi.nlrn.nlh.gov 
/Entrez/) does provide useful data in this 
regard, but current databases may be 
limited in scope or computability." A 
major focus of infrastructure develop- 
ment to support genomic-scale expres- 
sion studies will need to be in the area 
of electronic biological pathway data- 
bases and resources. 

CONFIRMATIONAL STUDIES 

The development of more sophisti- 
cated analytical algorithms and data- 
bases will help lend credence to the bio- 
logical significance of differential gene 
expression determined by microarray 
analysis, in the meantime, several stud- 
ies have begun to examine the sensi- 
tivity and specificity of microarray- 
based experiments. Sensitivity, defined 
as the minimum reproducible signal de- 
tected by a given array scanning sys- 
tem, has been reported for microar- 
rays to be approximately 10 mRNA 
copies per cell, which is slightly infe- 
rior to the sensitivity of Northern blot 
analyses. 11 * 7 Specificity studies showed 
that for a given probe any nontarget 
transcripts with more than 75% se- 
quence similarity may show cross- 



hybridization. 36 The problem of clone 
misidentiflcanon and the need for clone 
confirmation have also been ad- 
dressed." One study found that of 1 189 
bacterial stock cultures, only 62.2% 
were uncontaminated and contained 
cDNA inserts that had significant se- 
quence identity with published data for 
the ordered clones.* 9 Thus, the use of 
sequence-verified clones for cDNA mi- 
croarray construction is warranted. 

Additionally, potential gene candi- 
dates can be assessed for relevance to 
disease using parallel technologies. Sev- 
eral such alternative platforms have 
been used to bolster the importance of 
specific sequences first suggested in 
gene chip comparisons including (1) 
methods at the RNA level, (2) meth- 
ods at the protein level, and (3) func- 
tional studies. 

Methods at the RNA Level 

Reverse transcriptase polymerase chain 
reaction (RT-PCR) is a method often 
used to verify microarray data. Al- 
though RT-PCR is not well suited to 
quantitation, the relative technologi- 
cal ease of this assay and the ability to 
rapidly monitor multiple samples make 
it a useful technology.* 0 * 61 Hybridiza- 
tion data can be verified and multiple 
putative markers can be screened in a 
short period. 

Several other studies have used real- 
time quantitative RT-PCR (TaqMan, 
PE Applied Biosystems, Foster City, 
Calif). 15 - 62 Real-time PCR is a tech- 
nique that increases the quantitative 
ability of RT-PCR by providing accu- 
rate and reproducible information 
on RNA copy number (Figure 3). in 
this method, a fluorogenic probe 
(labeled at the 5' end with a reporter 
fluorochrome and at the 3' end 
with a quencher fluorochrome) is 
annealed to 1 strand of the target 
cDNA sequence between the forward 
and reverse PCR primers. As Taq poly- 
merase extends the forward primer, its 
intrinsic 5' to 3' nuclease activity dis- 
places and degrades the dual-labeled 
probe, releasing the reporter fluoro- 
chrome from the quencher label and 
allowing the detection of a fluorescent 
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signal that is proportional to the 
amount of PCR product generated in 
each cycle.* 11 

Northern blot analysis is also com- 
monly used as a confirmational tech- 
nique, as it is a standard specific and 
semiquantitative method. 15 -* 7,61 For 
mRNA expressed at moderate-to-high 
levels, and for which cDNA probes are 
available, Northern blot analysis works 
well, but it is not well suited for low- 
copy mRNA. 64 * 5 Furthermore, only a 
small number of genes can be ana- 
lyzed with this conventional method. 

Methods at the Protein Level 

DNA micro array technology is limited to 
the study of gene expression at the 
mRNA level. However, it has been es- 
tablished that mRNA levels do not nec- 
essarily correlate with protein levels. 
Moreover, the level of expression or even 
presence of a protein is not tightly linked 
to physiological consequences. An in- 
vestigation conducted by Winzeler et al,** 
for example, provides a cautionary tale. 
Their study demonstrated that genes up- 
regulated in yeast growing in minimal 
medium did not prove to be more im- 
portant for growth than genes that were 
not upregulated." They found only 2 of 
8 genes required for yeast growth in 
minimal medium to be induced. The les- 
son to be learned is that genes that are 
not differentially expressed may be of 
equal functional importance in disease 
states compared with those that are. 

Furthermore, the regulation of some 
genes maybe at the translational rather 
than the transcriptional level, which 
would preclude detection by DNA mi- 
croarrays. Posttranslational modifica- 
tion of proteins is also an important 
mode of regulation that cannot be de- 
tected by DNA microarrays. Protein ac- 
tivity, particularly receptor activity, is 
heavily dependent on phosphoryla- 
tion, for example. DNA and mRN A re- 
veal nothing about whether a given pro- 
tein is active, and can be deceptive when 
used to speculate about quantities of 
proteins. It has been demonstrated thai 
the correlation between mRNA and pro- 
tein abundance is less than 0.5, 67 em- 
phasizing that ideally, mRNA expres- 
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sion studies should be accompanied by 
analyses at the protein level." Radio- 
immunoassay and immu no histochem- 
istry have been used in a number of 
studies. 13 * 8 * 9 These techniques, how- 
ever, are not well suited to detecting low 
levels of expression, and they require 
the availability of an antibody specific 
for the protein to be studied. 

The field of proteomlcs, the large- 
scale parallel analysis of the proteins that 
are present in a cell, is developing rap- 
idly, but has problems of its own. Pro- 
teins vary in abundance by many orders 
of magnitude within a given cell, and 
there is no PCR equivalent for the ampli- 
fication of proteins. Moreover, proteins 
fold in many known (and unknown) 
ways that affect their function. The fea- 
sibility of the microarray analysis of pro- 
teins has begun to be explored . Antibod- 
ies attached to microarrays can be used 
to bind to and quantitatively detect pro- 
teins that have been tagged with fluo- 
rescent dyes. 70 Skeptics doubt the plau- 
sibility of identifying thousands of 
unknown proteins in this manner. 70 The 
diverse chemistry of various proteins 
poses serious difficulties, and it will be 
challenging to find antibodies for every 
protein. Thus, although it is important 
to incorporate protein analyses into 
expression profiling studies, current plat- 
forms are technically limiting. 

Functional Studies 

Confirming the role of a gene initially 
identified in a microarray experiment in 
animal models with transgene or knock- 
out studies provides a particularly pow- 
erful alternative platform. Transcript 
function, rather than mere presence, is 
addressed. However, this approach is ill- 
suited for high-throughput conditions. 
It may be ideal for an in-depth investi- 
gation of 1 or 2 genes of interest, but it 
is not practical for confirming large quan- 
tities of profiling data. 

Confirmational studies are useful to 
corroborate the biological signifi- 
cance of differential gene expression de- 
termined by microarray analysis. While 
improved databases and more reliable 
statistical models will help to lend 
greater authority to array data, alter- 



native platforms can be used to assess 
the relevance of genes first identified by 
array comparisons. It should be real- 
ized, however, that the alternative tech- 
nologies are not intended for large- 
scale analyses. Realistically, only 
selected sequences from the array data 
can be confirmed with other plat- 
forms in the short-term, a retreat from 
the initial purpose of the genome- 
scale investigation by microarray. 

CONCLUSIONS 

Microarrays can be expected to prove ex- 
tremely valuable as tools for the study of 
the genetic basis of complex diseases. The 
ability to measure expression profiles 
across entire genomes provides a level of 
information not previously attainable. Al- 
though complicated issues must be re- 
solved, the potential payoff is big. Mi- 
croarrays make it possible to investigate 
differential gene expression in normal vs 
diseased tissue, in treated vs nontreated 
tissue, and in different stages during the 
natural course of a disease, all on a ge- 
nomic scale. Gene expression profiles 
may help to unlock the molecular basis 
of phenotype, response to treatment, and 
heterogeneity of disease. They may also 
help to define patterns of expression that 
will aid in diagnosis as well as define sus- 
ceptibility loci that may lead to the iden- 
tification of individuals at risk. Finally, 
as specific genes are identified and their 
functional roles in the development and 
course of disease are characterized, new 
targets for therapy should be identified. 

Despite the problems of defining nor- 
mal, understanding tissue heterogene- 
ity, making arrays comparable, analyz- 
ing and archiving massive quantities of 
data, and performing confirmational 
studies in alternative platforms, expres- 
sion profiling with microarrays stands 
as a truly revolutionary technology. As 
we continue to delve into the possibili- 
ties, we will surely progress in our un- 
derstanding of current issues and com- 
plications. No doubt the ride on the 
high-throughput highway will be ex- 
hilarating. 

Author Contribution: Ms King and Or Slnha both con- 
tributed to the conceptualization and writing of this 
article. 
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Abstract Novel fled powerful technologies each as DHA 
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*tht eiprcssJon lewis of adnata geses Tlp-***'**int^f both, to 
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rcvofatfeabe biology^ Id eartfcAtar 1b the to af BntBBtdar 
medicine os they arc ciptiled to reveal gent ngubttoa nob 
UTofred to {bens progrtsam aj wcO os to ptapoufl pateatfel 
tajgicj far drug dbuivuy and ^ |, f wtt ** < * Hare, wo octttv At 
uuiiui stans of these ^T^—^tgEtt and hacjdisjai coast avawa bo 
which they have bceo applied b contort to toenails*) of bhmay 
sptdmtm. © 2000 Federatfea of Ebjopcan Blocferahal Sorf- 
ttlek Pofalbhcd by Elsevier Seiooco fcVV. AS rights reserved. 
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1* fjlttodoctfoo 

U a oow a little more duo 10 yean since the Human 
Genome Project was bunched and during this rcbthety abort 
period of time there have been rcatarkabfe advances is the 
oooitructJon of phytic*! and gcnrUc maps as weQ as in the 
identification of genes associated with human diseases (I J). 
Soon* the total srqumnr of the human genome wiD be deoJ* 
pbrrcd and, hopcfuBy, made available to researchers world* 
wMe for the benefit of mankind. 

Undoubtedly, the Human Genome Project has paved the 
way to the revolution in the tife oetcnoes that wo are eapcri- 
cociog today. Gradually, however, its focus is surfing to shin 
towards functional genomic*, on area of the post-genemic cm 
that deals with the functional analysis of geoe* tad their 
products (see the article by Ooffeau in this issue). Techniques 
of functional genomics Include method! for gene eapmslon 
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profiling 01 the tnuncript (DNA microarrayt |V5); ace also 
article by Braaaa sad Vila to tab bsvc); differential display 
(6); serial anafyea of gene c ap ituh io P-9L o*d proton bveb 
(proteomics) <II0-U|; tec also aitScte by Aadema aad Mara 
in thb cousX os weUju iranigteics (III phage display |l«l 
pTOoeduecs for studying pitJttin^utein toteroctioos ([I5J61; 
see aho articfc .by Legram aad 80% in thb bsnc) aad biois- 
foTmatio P?V 

Among the tecteioocB of functional genomics, both DMA 
nderoamyt aad refercscea therein) aad probomics 1 10- 
12) bold great pfvaibe tot the study of cornptea btafogjcal 
aysasas with appucatiotts in molecular medicine. These novel 
aad powerful gene expression profiling techaiojucs pencil the 
aasjysb of the capieasiun bvcb of thousands of gents stmo> 
tSAeoasJy both in health aad disease. Tbese technologies are 
coiiybiucBtaiy, aDow high a throus^miit v aad in coitibicatioo 
are cipnrifd to generate a vast arooont of gene and protein 
omusiou data that may bad to a better undemanding of the 
rtgubtory events ravoJvcd b oormal aad disease proccasea, la 
addaJon, these technologies offer a systematic approach- for 
searching for effective targets for drug cUseovcry and diagnos- 
tics. b 

Here, we review the current state of DNA mfefoarrays aad 
pvotcomics aad highlight come atudies io whkh they hove 
been applied to coimcrt to tat analysis of biopsy ojmJ iii m ii 

3L DNA ojuxfoaifoya 

The amount of Information that b now bccoxnmg Bvaitabb 
to rcsearehcri in the life adenses b capfnding, aad eveo 
though the data can be stored m oonvenbonal media, aew 
nsethods an being required to analyze targe sets of genes Is 
a Mgb-throughput fashion. For thb purpose the DNA array 
technology was developed. The method makes it pcasibb to 
survey thmnandt of genes in parallel, and has seven) areas of 
appticatieo. Oat b aprcsrion monitoring (IS), in which lac 
transcript levcb of genes arc measured m different physiolog- 
ical coodttibtts both in cultured ccfls aad tissues, to search for 
rcgubtary upresrion patterns. Uaderstaading patttras of ct- 
prcssed genes b czpocttd 10 improve our knowledge of highly 
complex networks that cms communicate in hitherto un- 
known ways, both m .health aad disease. Another area of 
application b pelymorphbro analysis f 19). In thb case, poly- 
morphic regbns of the genome are scanned to search for link* 
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age to diseases, and io reveal disease susceptibility genes and/ 
or inherited disease genes. A similar approach his bees used 
to analyze polymorphic regions of known genes, in particular 
to detenninc whether polymorphisms arc associated with aa 
altered function of the gem product, a fact that may iacrease 
the fuseeptibQiry to disease. Finally, various attempts have 
been made to utilise DNA arrays for sequencing (20). * 

Below we review toe technology cumotiy io use for micro- 
array-geoeratcd gene expression pattcro discovery as veil as 
some app&calioos. Table I provides a set of World Wide Web 
sites that conttia useful additional information (sec also the 
article by Brum* aod Vito in this issue). 

/. /. MirroarrCy techiotogy 

Micro arrays are usually nude by deposition of DNA spots 
do a solid support like a coelcd glass surface, that differs in 
several ways from conventional filter-based supports sash as 
charged oyton and oiuoccttulosc, The Batness of the glass 
surface makes it possible (i) to array molecules hi a psraUel 
fashioo, Co) io miniaturize the procedure and (Ui> to use fluo- 
rescent dyes for detection. There is no diffusion of the applied 
material into the support, thus allowing focusing for laser 
scanning cnierascopy. 

Two cumin procedures have been used to produce DNA 
chips: photolithography as developed and niarfccted prfanarQy 
by Affymctrii loc (Santa CUn. GA, USA) (20,21) and oa> 
chanical ssidding (22). netoUthos/aphy b well known in the 
computer chip industry and utilises an ultraviolet Cght sonros 
that passes through a nssfc that directs in a step-wise manner 
wherc a phoiochcmical reaction <obspnirtcotJde synthesis) 
takes place on a siliconircd glass surface. The mask can be 
pioduced with openings as small as a few osJcrossctcrs allow- 
ing a density of several hundred thousand probes per square 
ototimcter of glass. There b, however, an mbxrent length te- 
ttrtcuoa with this in situ synthesis technology bsnltiag the 
probes to about 25 nucf eotides to length. High-density arrays, 
en the other band, allow the use of multiple probes per gene 
P0]. 

Mechanical gndding methods are based oa ink-Jet or phys- 
ical deposition of the materia) using pins uuuiifacturcd whh 
very high preoaioo. There is direct surface contact and the 
transport of small amounts of liquid makes these systems 
nisceptibk to evaporation and ccnnuxUnation with dust par- 
tides. The gjrtddiog Instruments use en XYZ motion control 
based oa sup engines that can- be controlled with very nigh 
precision. The DNA containing snaterial can be spotted from 
96 or 384 weD plates to glass In predefined patterns. 

The arrayed probe can be ofigos (photolithography and 
gndding) or cDNAs (gridding). The hybridisation reaction 
conditions are quite different in the two coses, and special 
sample preparations am needed to optimally utilise them 
probes. With shorter probes, Lc of 20-50 nucleotides, the 
sample- is fragmented to avoid tertiary ftructurcs and to 
achieve optimal hybridisation (18,20). Polymerase chain reac- 
tion (PCR) amplified probes of 30O-2O00 nucleotides usually 
do not require fragmentation of the sample (23). 

The type of glass used as support, the coating substance, 
the coupling technique, the febelling system, and the fiuores* 
cent labeb used for detection arc all variables that must be 
optimised. A Dumber of coating substnnccs are comouraaOy 
avaitsbte io immobilize DNA to the surface. Two types of 
a tides arc available that use a coating procedure designed 



for printing of amine modified DNA: these include (i) so> 
la ted slides* which contain reactive aldehyde groups that ream 
with ajnioo-groups via a Schiff base formation as the printed 
DNA dries on the surranc of the slide (avsilabfe from Tele- 
chem Inc. CA. USA; Cell Associates loc, TX. USA; for 
coupling chemistry tec bttpfNnw.9n*yilJC Qmtou ti ou ny* 
coupling/) and (o) activated slides prepared by the covaicm 
attachment oft bydrophilic, polymeric amine reactive coating 
to sUaoc hasc-costnd sfides as described by Brier and Hohcisel 
(24) (avaitabte from Surmodics Inc. MS, USA). Amine-modi- 
cod DNA attaches covalently to the activated, porymtric tor- 
Caen. 

Three other types of slides are available that are based on 
more conventional cramobiTicitioTi technology routinely used 
in membrane bnsobilisaiioo: these include Q rtaaised slides 
which carry covalently anarnrrl primary amines on the sur-. 
tans that can form ionic bonds with the phosphate baiAbuua 
at neutral pH. In aavitiea, the rt4tcsJ-based coupling between 
thymidine residues on the DNA and carboni oa the sibyl 
amine of the mbstra te can be iisduocd with UV or hcat<pro- 
vtded by companies fike Tdechcm tee, CA, USA; Sigma 
AMrfch, tttfi, MO. USA; Corcicx toe, NY. USAMB) nitre- 
ocUulos»tesed porytnavcoated slides thai possess ths binding 
and bsunobihmtion properties of nitroccfiuloss that binds 
DNA in a oon-ocmsnst but tncvcrtible enaaxur (Schleicher 
and SehuctQ. and (Hi) porjMysmc<oatcd slides which require 
UV onmrnddng of the DNA (evnimbfe from Sigma AMrfch 
Inc.. HO. USA; sea ako htrpAwigrn fttnforctcdofrbrown/ 
pjmc<olVl^s£dr*»BtinJ). 

U. Quantitmkg tkt Jjfnabfrtm arrays 

A bncar tcspnass that covers two or three orders of mag- 
nituds Is often needed to detect low and high copy number 
transcripts en the same array. Io cases where this b not pos- 
sible it may be cteccssary to scan the chip at duTcrcnt wave* 
lengths, or to emptily the signal with an imiiiuiif sandwich on 
top of the bound sample PS b» the latter case, the first 
sca nnin g is carried out after hybridisation of the es befln d sam- 
ple, and the ceccmd after reaction with the labelled andbodlcs. 

It Is necessary to dnciimrnl the Snearity and reproducibility 
in each step of ths procedure, and s ometim es even from probe 
to probe to obtain iciiabu data. Often, a standard sample ts 
used to oonmaro whh the ctpestaeotal sanrple and this may 
compensate for d^Icrcssses m hybridtcatioo from probe to 
probe. 

2.S. SuBtdsfattatfaa 

Gonsparison of data obtained from mdependent arrays and 
from different laboratories rtquiiu sssatds/ d ts stio c. Both the 
Arfyiuetrix chips and the custom made eDNA chips use differ- 
ent methods for stance rdixattoo. The Affymeuis drips have 
. approximately 20 probes per gene and rttndarcfixauon is ci- 
ther based on the csprcsshsn level of selected suites. Ske actin 
and OAPDH, or on a aettjoj of the global chip Intensity to 
approximately 150 units per gene on the chip, la this way, 
chip data from different caaarunsjito can be compared to each 
other. In our hands, the data obtained with the two standard- 
isation methods cliffer only by approximately 10% (unpub- 
lished observations). 

"The curton>made cDNA or otigo arrays ate require stan- 
dard cm lion, but this is n comples problem. Io general, the 
standard used often rennets the purpose for which the array ' 



/£ CeCt ti aUFCBS Utters 4B (2000 J J-16 



was produced. For example, for expression mooiloring of 
breast cancer cells, a miiture of breast cancer cell tines may 
provide a good standard (261* Today, however, (here is oo 
golden standard thst can be used for all purposes and as a 
mult, it b difficult u> compare data from different laborato- 
ries and often it is necessary to use other technologies such as 
Northern hybridization, leaMinte PCR or uununostaifling to 
validate the signals. A minimum requirement a that labors- 
tones that produce arrays themselves should be able to repro- 
duce data from one chip generation to the next baaed on Che 
analysis of well-defined controls covering different genes and 
etprtssioa levels. In addition, rt is common to use spiking of 
samples with bacterial genes that hybridise to probes, spotted 
for control purpose on the arrays. 

2. d. Sample* for expression monitoring 

The analysis of relatively homogeneous ceQ populations 
(cloned cell lines* yeast, etc) has proven ouch stapler than 
the analysis of tissue biopsies aa the bitter often contain many 
cefl types (ephhcEai endothelial, inflammatory, nerve, onsets, 
and connective tissue ecus) that are present in variable 
amounts. Standardization may require mrcrocUsscctioo of. 
the tissue to isolate specific cell types (27,281. although the 
oumbcr of odb needed for the assay b well above a mi Dion. 
Sampling of specific cell types using laser capture mhrndiismv 
tien (LCM) (79) can be a ttrnf^ronrirrtiirtg task, and given that 
mRNA is prone to degradation toe processing time must he 
kept to a rrtinrrnnrn If only a email am oast of asstcrial is 
available, then a revcac transcription-PCIl step is n c cs s aai y 
for amplification, hot tab adds an additional eanspnearion 
due to the tack of linear amplification of all transcripts, la 
one of our laboratories wc have used prcparaboas of single 
ceil types from tumor biopsies to standardize pooling of sam- 
ples for generating profiles of gene expression at different 
stages of tumor development (manuscript in preparation). 

2 J. Motnfortnatic analysis of expression data 

2.5./. HlerarthitaJ duster algorithm* Hierarchical duster* 
iag algorithms can be divided into two typo: aggjoaacrativc - 
and a rime (Fig. I) |30J. The sggjomcreovc method b a bot- 
tom-up approach, where the algorithm starts with ft separate 
choters (for example 4000 geeea, where oatOQO) and succes- 
sively combinca dusters unul only one b left The dmafvc 
method, on the other hand, b a cop-down approach starting 
with one duster and successively splitting dusters to produce 
others. The algorithm used to form the dusters must also be 
denned; two widely used and simple algorithms arc the single 
linkage and average linkage methods, respectively. Single fink- 
age, also called nearest neighbor, defines the distance between 
two dusters as the minimum distance over aD pairs of clusters. 
Average linkage takes in consideration the average distance 
ova ell (30). 

A distance matrix must be calculated before clustering b 
performed, and il b the distances between the obtained gene 
expression profiles that arc used to form the actual dusters 
(30). Observations with small distsnon arc grouped together 
as described above. The two most commonly used distance 
measurements are tht Eudidean distance and the Pearson 
correlation coefficient. 




Pig t. Two main types of ehsttretg algorithms, the bkrerchicBl 
cad tht imKUemichiBaJ ajjorilhan. 



Pearson correlation coefficient : 



Euclidean distance : ED "«=» ^{X*-Xgk) % 



jr<» y the cacajuicencsl for the Ath variable en sampling unit L 
*>* b the variance of the Alb variable (30,31). 

2.XJ. Nosr-hterarxhUal thsster algorithms. In ooD-bicr* 
archical duster enarysb b b asnsmod that the data can be 
divided into a certain number of dusters and that they an 
wdl separated The advantage of thb approach b that large 
dsta sets can be clustered much faster than by using hkrarchi* 
cal ctastcTing because a tower gumbo? of clusters b asignod. 
The most umuuuu earthed for cma-euerarctucaJ cluster anal- 
ysb b A^eerans. However, a method termed celf«orgaaceed 
maps (SCM) has recently been appbed to cxcffcssion data 
grnrratrd rrom DMA chip arrays (32»33|. Tike *-meanj meth- 
od (M) identifies ft points that function as cluster centers. 
Each data point b then assigned to one of these centers in a 
way that mrmrntrrs the sum of the dbtamrt between aD 
points and their eastern, Thus, it b the distribution of points 
that decides the value of the means. One drawback of tab 
method b thai -a specific oumbcr of dusters b asstpscd, as 
the number of ctustcra b usually isbAsbtw* in large data 
sets. SOM b tbaflsr to the k-esemsa approach, hot it has a 
gcotttctncal configuration and the number of uodii prr^fintr 
thb configuration. Initially, data points arc napped onto the 
geometrical ccnB^urstfcn. When diutcring the data with 
SOM the po tiUon of n node migrates to fit the data points 
during ruccesarve Itratfoni 0233). 

J.JUL 5usf7vttr^c4cxr0le*/ttm Common to the clustering 
rrtrthods in which array data ant used b that they arc unsu- 
pervised. La, so predefined references are knows*. An aftcroa* 
tivc option b to ODoetruct a supcrvfaod classification method 
that remjircs at least two rcfcrcneca. For cancer dst sf fir a d o n . 
for example, the references could be the gene expression pro- 
files from normal and mvasrve tumor tissue. In this particular 
case, a vector representing gene expression over n genes ens bo 
used to describe each tissue (35,36). In a recant article by 
Oolub and co-workers (16), the authors analyned 6817 genes 
using 38 bone marrow samples. Based on these 38 samples 
they found that o vector based on 10 and 200 genes was. 
sufficient to distinguish bcrweeo acute myeloid leukctnia and' 
acute lymphoblastic leukemia. Thus, the authors were abte to 
construct a cancer classifier based on a low ounsber of gcaca. 
Usng a similar vector-based classifier approach. Brown and 
o n worse n pTJ analysed 2467 yeast genes in 79 dificrenl ex* 
periments and were able to classify genes into functional cat* 
cgories based on the expression data from DNA chip arrays. 
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2.6. Applications oforropt for expression monitoring 

One of Che main trtas for amy application is in the rimuW 
uneous monitoring of ibouuncU of transcripts io different 
biological sciuogs. The approach a being used to identify 
new networks and to' understand patterns of a pressed genes. 
A number of articles have been published using the amy 
technology aiming at identifying discasc*iwcriairrl alterations 
in humans. For this purpose clinical samples, human cell lines, 
and to a few cases animal models of human disease have been 
used. 

74./. Gene expression profiling of tissue biopsies, Cttl fat* 
and animal models of disease. 2.6 J. I. Tana biopsies. Ocas 
oprcssioo studies oa clinical samples have been peifonocd m 
breast and colon caaoer (26,11), as well as io athcrosderosis 
P9.4C). Genes of presumably* known functions have been 
identified and linked to the diseases; most of these data are 
now availnblc in the Internet (Table I), to a study of breast 
csoccr that used dinical specimens and cell lines* with as 
array containing approxuaatrry 5000 genes. Ferou and cot- 
leagues (261 identified a proliferation related gene duster m 
the cell line that was unregulated in the more agpesahre clin- 
ical breast tumor spmtnrns (26). In a similar study of colon 
cancer based en the Asymetrix arrays, 41 BSTs homologous 
to ribosomal proteins were found to be unregulated fa the 
tumor tissue (36). In this study, • muscle indes was used for 
ennccung lor the svmnxl cennponema as this showed a high 
index m the normal biopsica. Both the breast and the ootoo 
cancer studies lacked n correlation between gene capftaiioa 
tevels in the cell lines and io the dmfcal speewna. 

Hunan athcroscfrrosu tcsioos from arteria carats sampled 
from patients undergoing surgery have also been analyzed 
usiag the Aftrnsetrii capressioo arrays (40). One importani 
fladmg was a fivefold uprefjulation of the carry growth re- 
sponse gcoc Egr-I, a DNA bindiag protein that mfcienccs the 
transcription of grots eriooding growth factors, cytokines, ad- 
hesioo nmteeulet* and protrioa related to coagulation. This 
finding was corroborated by immonohbtochemistry and su»» 
mal experjuaeata, and identified Egr-I «* a possibto target far 
therapeutic intervention, 

2.6,1.2. CoH hxex. Several studies based on expression 
monitoring have been performed in human ceil lines fa areas 
as diverse aj cancer (41), ophthelnwlogy (4% sad the central 
nervous system (4JJ- Amya have also been used to study the 
effect of cytokines flnterfercms) (44], cytocaegaloviAis wfectien 
(45), and oncogene traaafection (46] on the overall patterns of 
gene expression. In one study, human foreskin fibroblasts 
were selected with human cytomegalovirus and the expression 
of approximately 6000 genes was monitored for up to 24 h 
{45} A total of 258 genes was found to be vprfgaXated more 
than four-fold these toctuded HLA-E (uprcgulated six-fold). 



a protein that protects against cytotoxic T-rynpbocytcs; RO/ 
SSA (52 kOa protein eoRNA; uprrguhtod 12-foldX a com- 
monly targeted autoaatigxn. u w*U as several components of 
the pathway that produce prostagUndio £2 from asacnidontc 
acid. Io another study, it was shown thai treatsnent of the 
human fi biuia j wnu cei) line HT 1080 with IFNs o. P and 
7 resulted in the uprcgulxtion of novel genes implicated in 
apoptosis (RAP46, Bag^l, scrambtasc), while genes like 
IGF-2 and ZoTO wen strongly downicgulated (44). 

The effect on gene cxprcssten of the fusion oncogene PAXJ- 
FKHR trans/acted Into N1H JTJ celts has also been studied 
usiag nitntmys (461. U «■» shown that the fusion gene, but 
not the wOd type control was able to activate a myogenic 
transcription program that included induction of a number 
of transcription factors such as MyoD, myogenic, Sis I, and 
Slug. 

2.6.13. Animal moaW tjstems. A common approach to 
the study of human diseases to to use animal modd systems. 
Ibis has been done for a range of diseases Including eceapha- 
famyelopathy, lysnphoma, renal tubuli (23), and hmg fibrosis 
(47). In the latter case, Atynxtrix chips were probed with 
pooted aampla obtained from groups of set co fedune 

variations and coat. Transcripts that were significantly altered 
in hug fihrosb (47) inrrorlrd extracellular matrix and inflam- 
matory response genes. Analysis of dusters oontainmg these 
genes In a time ccanae experiment with the SPOTFTRB PRO 
2.0 program revealed cSScren t temporal patterns of captesadoa 
that further subdivided these genca. 

2.6J. Toxicology ana* ohtg testing. From a toxioologtcal 
point of view, there are great expectations for expicasioo mon- 
itoring as the effect of drugs, both expected and unfor e see n 
aide effects, can be. monitored in anatvJs and eventually in 
humans (48*49). One proWem that has arisen from these 
stud lea, even In quite shank modd systems, Is the often* 
wncxptainrd changes b transcript IcveU observed. These 
changes arc sjufte reproducible, ssggmtmg a much more com* 
plea relationship among gene products than pr ev i o usl y 
thought (45). 

The combination of gene expression monitoring and testing 
of drugs oa cell tines and is animal models holds great prom- 
ise, A leeeot publication ihowed that the vasuoon fa the liver 
cannmden of genes encoding xesobjotte mitanofmng cn- 
tytnes, ghimthiene regulators, OKA repair carymcsv heat 
shock imneins and housekeeping genes is larger among inott- 
vsduxl ammals than that mtieduced by the amy assay itself 
(49). These studio revealed the uprcsndation of cytochremea 
even at low doses of iVnaphmoftavene treatment and showed 
a good oorrelation brtrwrn the amy and Northern hybridi- 
sation data. A recent screening using cONA nsicroamyt of 60 
human oct) tinea used by the National Cancer Institute for 
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drug discovery, has formed tbe basis for esttbasbing a data- 
base that caa be linked Co gene caprcssioo data and molecular • 
pharmacology (41,50). 

2.61 ifoUatlar ctaxrificatton of diseaitL Qaisficatioo of 
diseases can gcaerally be achieved by class discovery and dan 
prediction |J6). Class discovery refers to the ideotincauoa, 
based on gene caprcssioo, of previously uortcognired tub- 
typo of the disease. IfcJixucaJ follow-up naterial a available 
these classes can be related to signs and symptoms, disease 
course, treatment outcome, and morality. Class prediction 
refers to the ability to assign a particular patient to an already 
defined class* based on molecular esaoiinatiob of the diseased 
tissue or other specimens. 

Two recent examples have demonstrated the power of es- 
pmxieD arrays to classify hematological malignancies. Id oae 
of these studies, leukemias were classified aa A ML and ALU 
respectively, based en SO genes selected from as array carry- 
ing 6817 gents. In this study, 36 out of 38 patients were 
correctly classified, and two were tmocslaio, The SO genes 
used in the array included come that were known to difler 
between AML and ALU as wefl as new markers P6). 

Another study classified eVcel) rysnphotnas into two enoteo- 
ularly distinct classes that reflected different alngca Of fr«otfl 
' differentiation, the sxrminal center-fikc and activated B-ceD 
tike lymphoma groups, respectively. For this study the fym* 
pbochip sniooarray from Research Ornrtfra, which holds 
17 856 gents, was used. Follow-up for 12 years showed a sir 4 
osficaat oUfTerencc in survival among these groups (SI). 

It b envisaged that global swveyi of gent cxsmsslon wiH 
identify marker genes that, may be used to group patients Into 
motonilarry relevant categories; these enarkera arc expected to 
greatly cm prove the precision and power of clinical trials: 

2.7. Comthutans 

There b nn doubt that the massive parallel gene capreasion 
- information generated by tntooarmys wO) have a major fan- 
pact in the discovery and undcraunding of patterns of ex- 
pressed genes. In addition, the technology is o peeled to gen- 
erate novel and effective targets for drug discovery and 
provide, in combination with proteozntca, valuable toob for 
the entire preeeas of drug development and evaluation. 

One of the main challcnssa we foresee in the future wflt be 
to solve the problems posed by the analysis, interpretation 
and aeeess to the targe amount of Worm* lion that will be 
generated. Large studies of ESTs have not been pnblbned yet 
as data analysis requires tbe development of ecw t*omlbr- 
mane tools that can deal with the' huge amount of nxfonna- 
tion that is being created-. These studies arc eapurtrrl to Iden- 
tify new genes of importance to specific biological excesses 
end reveal new regulatory pathways through tbe anarysts of 
the eapression levels of individual ESTs in large numbers of 
samples. 

In the future, coupling of expression monitoring to trans- 
genic animal models may prove to be quite rewarding as the 
global effect of a gene knock-out or knock-in caa be moni- 
tored with both mtcroarrtys and proteomics toots (10). 

3. Prcteomta 

A complementary technology to DNA microarrays for 
monitoring gene capressioa is provided by proteomics, a 
term generally used to encapsulate oil of the technology cur- 



rently available to analyze global pattern of gcoe aprcston 
at the protein level Proteins are frequently the functions! 
molecules and, therefore, the most likely to reflect difTtrcnots 
in gcoe expression. Genes may be present, they may be mu- 
tated, bat they arc not noastarirr transcribed. Some m * »s nv 
grrt arc transcribed but not tranibtcd. aod the number of 
mRNA copies docs not necessarily reflect the number of func- 
tional protein molecules (52). In addition, prouomia ad- 
dresses problems that caonoi be approached by DNA analy- 
sis, eamsty, relative abundanoe of the protein product, post- 
transntmaal modification, subcellular localisation, turnover, 
mteracttoo with other proteins as well as ftinctional aspects. 

The proteomc has been defined by WtJJtms and colleagues 
as the complete set of presents e nc oded by the genome 
and recently* the tern has been broadened to include the set 
of proirini apteesed both in space and tee. There are two 
main appr oa ches to piotcoini ca: one b the ttpression modd 
in which all proteins arc analysed, and the other b the osO 
map ntodct in which only a selected set of proteins, tike com- 
plexes and orgsusenes, ait studied (54). 

The protecflrdc technology b eotnpltsw and comprisca n 
plethora of state of the art techniques to resolve (high reso- 
lution two^imfttriooal gtb)» quajttfratc (phosphoriirager, 
special scennersX identify aacVchatsjcte rim pjotains (mmrose- 
quencing. mass suoUium etryX as weO as to atom (two-dozen* 
sioaal nuljaaylaaide gej cSoctropkorcsis (2D PAGE) dam* 
bases; bTtp:tfbiobaaejosyc^^ htm*/neapajytegM^ 
apmi/kpTOt*tnp£tsBl) eonifriunicatc and enterhnk protein and 
DNA secjuenos and snapping mformanon (biomfornutio) 
(I0,I24J-«1 

The usefulness of the 2D PAGE tncboique for large-scale 
proteondc projects dtpmda very much on the ouinbcs* of pro* 
leins that can be resolved fin a complex protein mixture, for 
example a human oeO. rVoteomc profile data from a few lab- 
omtorte*. Including one of our own, have bjcScatcd tons only 
a fraction of the human gens arc fwttehetf-oo fas a given cell 
type and extensive analysis of whole ceO catrecta, organelles as 
weO as cortiapy purified subcellular fractious, suggest that 
individual cells may not o press more thao 6900 primnry 
translation p i od uc t a 1991. To this number one has to add 

(phosjAorytatton, s^ycosylauon, densethytauon* acetycxtion, 
myrunoybnon, pnlmitoybtion, sulfation, ubsqumation, ctcX 
the latter bemg rather common and entsnjnve in many pro- 
teins [60,61). Thus, so we stand today the 2D PAGE technol- 
ogy is not abb to reserve and depict fas a single gel aD of the 
proteins thought to be present in a nsanmratian ceil 

Currently, there b a great deal of interest fas protnumia as 
applicatrons of this taclmnlogy am expected to reveal gene 
regulation events mvotved in disease piugiisilon aa wefl as 
potential targets for drug dUw»cry aod cUagnostics. More- 
over, the technology b bound to have a great Impact in agri- 
culture* tosicnlogy aod the industry to general. 

J./. Pttlttn ttpBctiam: fAc 2D PAGE technology 

For the past U years, high resolution 2D PAGE baa been 
the technique of choice for analysing the protein composition 
of cells, tissues and fluids* as well as for studying changes in 
global patterns of gene eapression dieted by a wide array of 
effectors (12.62-64). The techoique, which was originally de- 
scribed by O'Farrcfl (65,66) and Kfosc (67), separates proteins 
both in terms of their uoctcctric point (pi) and enoreeutar 
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weight and provides the highest lesotudcm for protein snaJy- 
tis. Usually, one chooses a arodtu'oo of interest, for example 
the addition of serum to non-differentiated human keiatmo- 
crtes. or compare oormai a fid timiuformed cetb (Fig. 2A»B), 
and let we eeus reveal the global pro trio respoose as «0 de- 
lected proteins cos be analyzed both qualitatively (posttnuu- 
latioftal modifications) and quantoUtnreJy (relative ibttodaoot, 
coordinated expression. Fig, 2Q in relation to each other 
QJ>.«| and refercnoa therein; sec also http:0biobaic.da/ 
cp-bin/oelis). 

For many yean the 2D PAGE technology relied oa the use 
of carrier ampholytes (amphoteric compounds) to esabKsh 
the pH gradient, but this technique hae proven to be difficult 
because of the lack of reproduabflity created by uncontrol- 
lable variations io the batches of ampholytes used to generate 
the pH gradients. Lately, however, with the introduction of 
famoobilized pH gradients (IPG*) l«V0L which arc integral 
part of the polracrylamide matrix, it has been possible to 
obtain focusing patterns that' can be easily reproduced by 
the non-expert. IPGs avoid some of the problems associated 



with earner ampholytes tech as cnthodk drift and 
gov allow a higher loading capacity for iniuuuicuaieti 



r runs, 

and provide utticascd charge rcsorutiea when o arrow pH gra- 
dients COJU pH unit/an) arc used &70.7II and references 
there*} In our hands, however, carrier acopbotyut (3.5-10, 
Fig JA) and broad range ITOt (Fig. 3B) resolve similar num- 
ber of ("Qmttomnc labelled pulyucptidti (about 2J00) as 
dhatiated with the ceaaralJoa of whole protem extracts from 
fabsDed human axratfoocytcs ffi\ It has been proposed that 
narrow range, overlapping fPQ gradients viewed tide-by-tide 
nay provide a tohitioo to the problem of resolving and de- 
picting the protcomc of a given cetl type. Recently, however, 
Corthali and co-workers found this solution unrealistic, as ll 
wiO require the running of a huge number of gab (7I| 

Very basis pofypeptrdes have proven difficult to resolve, 
although both carrier ampholytes (non-equilibrium pH gra> 
dieot electrophoresis, NEPHOE) (50.731 and IPOs (V-12 and 
4-12) (PO) and references thereto) have bees shown to sepa- 
rate basic proteins. 
One of the most important steps io the 2D PAOE techno*- 
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Fig. 1. Noo-cuJtund normal human ttcratinaryu pratrfos Kpariied using (A) osrrirr aeapboryto p.MO) and (B) IPOi (J- 10). A lew protdnt 
oft {odicoted for reference. B b from BjeSavb* cl al [721 



ogy coooems sample preparation at wy often wine proteins 
cannot be properly dissolved by the ryns cohition originally 
developed by OTarrcU t&5). Thus, there b pressing need to 
develop protocols for optimising cample enhMantioo. To- 
wards this aim, Rabilloud aad co-workers hove made use of 
the high loading capacity of IFGt to tensive tnembraac pre- 
trial for ftntctural anafysa\ and to doing eo have iinpiovcd 
their solubility by using a rombmitiflo of detergents aad cha* 
otropes |74 p 75). It hat bora shown that the addition of thio- 
urea, CHAPS and nuTobcttia surfactants to the lysis solution 
containing urea resulu 10 • much improved sohibifiatioa as 
wcO as transfer to the second d uutasicia SDS get As far as 
aadcar protcms are ooaeenad, <Mrg aad coQeaguta have 
improved considerably the cr parados of very basic proteias 
by first prccipiiatiag the samples with acetone prior to sofubl- 
bzanoa in the lysis solution f70). The problems associated 
with the estractfoa of tissue nrap^t, on the other haadV an 
much more complex and have not ben addressed yet la a 
systematic wshJoa. 

J/./. Detection An important tunitatioa of the 2D 
PAGB technology b the tack of very smsitive proocdura to 
detect those proteins that are present m very low abundance. 
In addition, detection procedures arc needed that can be ap- 
plied to a large number of resolved proteins whose abundance 
may span through seven or eight orders of nsagnJtude. 
dearly, the sensitivity of sflvcr nitrate aad Coomassle Bine 
ruining is inadequate, and only metabebe labelling with spe- 
cific isotopes may reveal enough proteias to warrant proteo- 
mic projects. Furthermore, the use of paorphor-unagiag tech- 
oology may enhance the sensitivity and (meanly of detection. 
Limitations of the radiotabdling approach include (t) lack of 
labelling of some proteins due to tow turnover, (il) problems 
axscoitcd .with safety regulations aad disposal, and (tii) diAV 
cullies in obtaining fresh human biopsy material for hbelliag 
eapcriroents. IdeaDy, one would like to have a highly sensitive 
fluorescence- based protein detectioo technique able to support 
all rypes of studies irrespective of the sample, or the cad point 
of the analysis. Preferably, the dye should not alter the mo- 
lecular weight and p/ of the proteins if it a to be added prior 
to electrophoresis, and should support quantitative studies 



involving pp****^** having catrcnm differences in their copy 
cum ben. Uafoitaaatefy, bo sum Ideal dye is available on 
the Batrket yet, a lt h o ugh Olford GlyooScscnce has developed 
tluonsont LTO-FAOB (htrp:Mewxaa^cmVpTOtccm/ 
homexoaTl, a technology not available to the srirntgr coxa- - 
tsuaity. l Ounrtsoxn ce ee^apoaotb such at SYFRO Orange, 
SYFRO Rod aad SYFRO Ruby have bora used to analyse 
whole protem rysatcs from bacterial aad osammahan ceQs, but 
their sensitivity (M og) is sbghUy rower than that of silver 
citrate [76,77). Soma advantages over siher naming intrude 
short atahdag laws and the fact that the grit do oot need to be 
fixed prior to staining, to addition, Hnln or no dcttsmtng it 
reauiredL 

For low abundance proteias of ksowo cd entity, d et e c t io n 
docs not pose a problem as Cats aad co worke rs have shown 
that 2D TAOB itnrnuaobtottiag in combmatioo with en- 
hanced chrrnihimincscence (ECL) can detect as fitth? as 100- 
300 tnnlrrutrt per cell in unfractionated ccSuav extracts ((D). 

3.12. QmtiWlaL Even though there are several toob 
available for the quantitation of protein spots; there b at 
present no available procedure for quaatiuting aO of the pro- 
teins resolved in a compla rail cure, rart of the problem bet 
ia the huge dynamic range of protem expression, lack of rev 
oration, post-tteastatfeaal modifications, ttainiag behavior of 
the protein*, as well aa ia the fact that many abundant pro- 
teins streak over lea abundant components interfering with 
the racasurcencma. At present, Buorcsccnt technology seems to 
be way ahead; as with the fiuoresoence stain Sypro Ruby 
there b a bacar response with respect to the sample aroount 
over a wide range of abvadance ([77) and references thesern). 
Quantitative guorceaeaoe auaearemcats can be performed 
with OCiVeamtra based systems as well as with laser scaaaer 
systems Q77) and references therein), lo some cases, reckon)* 
beJfing in combination with sdnallauon counting offers a rea- 
sonable alternative for quantitaung a small number of pro- 
teins m. 

J./J. Identification* Methods of protein aJentitkatian 
have included oexnuaoblotbag f78,791> Edman peptide se- 
quencing (JB0.8I) and refcrcflota thereto), amino acid compo- 
sition (8233}» aad more recently the use of matria*asslsted 
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obuio • 0k caauiaina infonutioo about the protcfe uvdlu ueka to athcr dies to the WorM Wide Web. 



tuer doorpUon/ioointioo (MALDI) nan spectrometry (84) 
and clnctrospray ionization (ESI) 115). Tbs tot ter tcctm iquaa, 
whkh rely on the comparison of peptide man fia^rpriat^ vt 
fan aad require only pieomol amounts of prottiiu. limited 
peptide sequencing can be performed using tandem mass spec- 
trometry or post-source decay, but ft U to ftfl) not possible to 
make N- and O terminal sequence idestxficatioo. For a review 
en Edmao peptide sequencing sec reference (8 1). Ander sen 
aad Mana review protein identification by macs apectjomctry 
etscwheie in this issue. 

J.2. Making comprekauitc 30 PACE daubeu* 

Advances is hardware and software development (Elm, 
Oeflab. Mtlanle, Quest. Tycbo and Kepler) for tetanias 
and image analysis of 2D gels, as weD as the development 
of sensitive toob for rapid protein identification, have cata- 
tyicd the establishment of comprehensive proieomic 2D 
FAOE databases (64,861 These dafnh«n aim at utcrfadag 
protein information with forthcoming DMA mappiag and se- 
quence data from genome projects, and offer a global ep- 
proacb to the study of gene expression both in health and 
disease (http-7/biobaiejJkJtg^bin/ceJij; htlptfcapasy.hcugcxh' 
sprot/sprot-top.html) GM0.55J and references therein). Besides 
annotating genomes, these data bases are expected to address 
problems thai cannot be approached by DMA analysis, 
namely, rebtive abundance of the protein product. posMreos- 
lational modifications, subcellular exaltation, turnover, in* 
teraction with other proteins as well as functional aspects. 
The first prototype databases were built by Oefii and Bravo 
rgT), who pioneered the use of protein identification tech- 
niques to establish comprehcniivc 2D FAOE databases 188} 



Fig. 4 thews a. region of the synthetic master 2D PAGB 
image fisocftectrfe focusing, IEF) of p'S^ethtoaine labelled 
proteins from fresh aofrcsdsejed human keratmoeyto as de- 
picuri to the World Wide Web (ht»:/7biob»dticjWnf 
ottu). totems tUgged with a ltd csm correspond to known 
polypeptides that haw ben identified by one or • combina- 
tJoo of techniques that include (i) 2D gd uninucoblottmg 
: antibodies end the ECL deteeriiro procecmre, 
j of CuomaAik tmffiaot Bhre stained pro- 
tdns H9). (HI) «*a spceuometiy of tryptsc peptto (90) aad 
fiy)romtgrirfon with known human proteins rjadtvidiiaJ pro- 
terns and organdie components) and (v) overlay teebjilquee 
(59) and <vi) transient oprcsstoo in mammanan celb (91). 

To date, 1237 polypeptides have been iaantUbd o this data- 
base (IEF and KEFHOE) of the 9159 that have been resolved 
end catalogued. In addition to ("S>nctbioiiutt labelled pro- 
tctns. the database eentains a few polypeptides that lack me- 
uioaiBC. but are revealed by rfrser staining. Gaowssfe bra- 
bant Bloc or by tobeOing with e mature of I© (^Qmrno 
adds. It should be stressed that tome protons migrate both 
in IEF and NEFHOB gets (for caempto, o-enolase. tnose- 
phosphate uoraeresc and ctongfttion factor 2) and they may 
serve at uutdmarks to align the gels and m references to nor- 
mal on quantitations in both pH directions. 

toformttioa gathered cm any given polypeptide, known or 
uaknowe. can be retrieved by clicking on the ce*rcapondmg 
spot. En thii case 14-1-2 sigma. else known as stmtifin (Fig. 4) 
(92). A file containing aD of the information enured for this 
particular protein, mostly obtained from experiments per- 
formed in our laboratory, to shown in Fig. 3 (only a fraction 
of the file is presented). Foes for known protons contain links 
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to a subset of Medline (http://www.otHjdm.alh.goW 
PubMod/), Swiss-Prot (hup ://upasy Jtou^ch/tprot/iprot-<op. 
html) and PDB (http^/www.cmbJ^kndclbcrg^c^db/). Other 
links ieehide OMIM (htrp:fjwww orht nhn nth goW OtnhsA 
OeneCsrds (hUpJ/biomformatkrwrirmsnn or itfcards). 
Gcae (http^/vwwjicbi4itn.oibwgovAfruOtscriadc3^tiiiO end 
ether Web cites such as CySPID (cytoskctetat protein data- 
base; btip 7/parHa med.yahvcdu/ ~ pjMBBrAcytosMhAodei. 
html), mcubolic pathways (compiled by K£GO: hnptf 
www.flcooxw.sdjp/keshA the cytokine explorer (JbnpM 
kbounigjnluouiixdu :443/cytokioca^xplorc7.htarX histology 
images (bltp://biosun.bieba3c. dk/~ p d^ or rh afaucrogsphi. 
bust), etc. lo the future, as new databases and related Web 
sites become available, h will be possible to navigate through- 
out various databases containing corapierocotary ituTormatfoo 
(U. oudcic aed and protein sequence, genome nmpping, dis* 
cases, protein structure. poA-franitatsonaJ modifications, aattV 
bodtcs, rigrjsJhng pathways, histology, etc). Gcarly. 
bases aBew easy aoous to a targe body of data: 
proicto is identified in givco databases, aJ) or the ittfcrroetioo 
aosusoutalcd cast be easily icti i oc d and made avtilsbfe to the 
researcher. 

Today, 188 information categories arc available to the 
World Wide Web version of the keraiiaocytc database, These 
include cellular localization, pathways, protein* affected in ' 
psoriatic keratiDOcyto, proteins expressed ia norma) urotbc* 
Uttm, kemtinoeytcs, fibroblasts. He La otlls and bladder caro- 
noma oeD line RT4, levels in fetal human tissues, partial ami* 
no arid sequences, abundance, cytoskcletal proton*, catcrum- 
binding proteins, snnuuns, chapcronins* heat shock proteins, 
etc. 

Functions to query the databases iodude search by came, 



protein number or keywords (Ffe 4), molocular weight and 
p/ v aa wcO as by organelle or ocOubr cenrpontnt. By cficUoj 
on any of ike orsjanrilnc oeDutar structures or uiiuuoututa it 
b possible to pet a protein fist as wcO a» their relative posi- 
tions on the csattcr osage. lo addition, ooi can retrieve a list 
of all knows protein* icccedsd.m the database, Moreover, we 
provide protocols and videos of preparative steps thai ceo be 
used to reproduce the data displayed aa wcD as a gallery of ZD 
gcb sad hnrnwiftblftlf 

Aarbus 2D PAOB databases available at http:JMobatt.dkJ 
cgitiofebs include traojitional ceU carcinomas (TCCs), squa- 
mous ceTJ caieutenat (SCCaX ones, fibroblast*, and moose 
kidney cells. Other databases available in the Internet caa be 
fouod to bftpJAeipaffy.hr^ bttp:// 
— - ha-V" ^hflTT' -vfc*>>Hip«»tehi; and bttp^/user* 
pefr^ehsraioJu-berr^ Abo, several 

prateomic tools for protest htmtinratmn and ctera ctnrfcav 
tioo, primary structure analysis, eecondary structure praa> 
boa, tertiary structure ami DMA trensmtioQ into protdo ere 
available at the EXPASY proteomic server (bttp^Awwwxipa- 
sy^b>Vrww/toob). 

SJ. Application* 

To date, there have beco thousands of reports illustrating 
the osefuuess of the 2D PAOB and proteoms technologies in 
msay areas of biology. Because of space Imitations, however, 
only a few of the application areas are highlighted below: 

IS. I. Omar. A great deal of research has beco devoted 
to the crtictdauoB of the pathways that control cell protifer- 
a una in normal ceils, and hence, the detrjrm nation of the 
means by which attentions of these pathways lead to abnor- 
mal growth characteristics anoYor empsutic irsjisformaoon 
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and cancer. Most studio have focused on oocogxscs, tumor 
suppressors, cell cycle rrgulaud proteins and figna) transduc- 
Hod Dolecuto id various ceil type of different spam (93-96). 
but ooly in a few cases there ha«c bets systematic attempts to 
analyze the proteia pbeaotype of pain of norratl. and trans- 
formed oet) types using a protccmic approach |60.68.97-99j. 
So Tar, only very few studies have made use of biopsy material 
due to problems related to the eel) heterogeneity. 

Already in 1982, Celts and coworkers started a protcomic 
approacb to the study of ceU traosfbnnation using dosed eel! 
. tines (68 t °7,W|. Their results showed that transformatioa it- 
suited in the abnormal caprcssioa of norma) genet, rather 
than in the expression of new does (98} la addition, their 
studies raised a word of caution concerning the widespread 
use of protein information derived from stadia* of different 
cell types from various species. Today, we art. wefl aware that 
cultured ecltr undergo important changes when placed in cul- 
ture due to different environmental factors and growth con- 
ditions (100) and accordingly, current efforts using the protco- 
mic approach are being directed to the study of oon •cultured 
cells and/or tiuue biopsies. Among the cancer projects cur* 
rcnDy underway, those centered cm butemia and hcmatoJog- 
rcel malignenctea, breast cancer, colorectal cancer and bladder 
cancer are briefly mentioned below. 

J.JL/.A Levkemta and hmuobfiatt maHgnwtckx Studies 
by Hannah and colleagues on childhood IcukcmJa and ether 
hematological fnrfgnancioi have yielded so far nsvcraJ 
marten that rnerode Oplft, also fcnown as — '»»— an 
oncoprotein that has been unpGcated in signal traiuductfob 
(101,107). In childhood leukemia, phosphorylation of CpIS 
was shown, to correlate with ■ high content of ocOs b the 5- 
phase suggesting a role in prouTcntioo. The group also 
identified om&-HI (oudecside diphosphate kinase A), a 19 
kDa protem that Is uprcgulatad m normal lymphocytes 
treated with cnitogens as well as in Imkcmia oelb from 
patients with acuta leukemia (103). 

J.J. /.J. Brtnit cancer. Systematic studies of dinicsJ bieasi 
tumors of different histopathologica) types by Franacn and 
co-worken (104-1061 bave revealed several proteins, including 
FCNA, bspfiO, hsp90 and cojrctkulin that are highly dcsvgo- 
taicd in invasive carcinomas end thai may serve as prognostic 
eaarfcert. These studies have made urn of fresh clinical tumor- 
tisruci of different subtypes and have paid special attention to 
sample preparation. 

J.J. /.J. Colorectal cancer. Studies 'of JungbJut and col- 
leagues |I07*I09) on sets of nmcrosooe^caln/ normal colon 
mucosa and colorectal carcinomas have revealed several pro- 
teins that an deregulated in the turners. Downregubted pro- 
terns included the liver fatty add binding protem, the smooth 
muscle protein 22-a, and c^coxytccase 2. Uprefutatod pro- 
teins included the heat shock protein 70 as well as several 
members of the 5*100 family of calcranvbinding proteins (S- 
100A9, S-100 AS, SI00AU and S-I00A6). Some of these find- 
ings have been confirmed by inxmuoobirlochenucn) studies 
POI). 

J. 3. 1.4. Bladder cancer. Cdb and colleagues [110-112] 
have explored the possibility of using protcomc expression 
profile* of bladder tumors as fingerprints to subclaasify niito- 
paihological types, and as a cutting point for searching for 
protein markers that may form the basis for diagnosis, prog- 
nosis and treatment. To achieve these goals they have ana- 
lyzed the protcomc expression profiles of hundreds of fresh 



tumors as wcD as random biopsies and cystectomies |l 10-1 12], 
and have cstauusfaed TCC and SOC proteornic databases that 
may provide a solid in/rastnirture to support future studio 
0 1 13); br^J/biobw^Wc^bin/adii). In the long run, a prac- 
tical goal of these studio b to identify a complete set of 
protein biomarkcrs that may be useful to classify hiaopaiho- 
logical grades, and that will provide with specific probes for 
the objective drsgnoss, prognosis and trcntmtat of these le- 
sions. So far, these studio have revealed markers for TCC 
progression (1101 a marker in the urine of patients bearing 
SCCk f 114,1 IS), and have red to the d e velo pmen t of a novel 
strategy for identifying pitmajignint squamous looms (1 12). 
The approach makes use first of pmtcomie trjchoofogjo to 
reveal and identify proteins' that arc differentially accessed 
so pure SCCk and normal urothefium. Thereafter, specific 
eouoocfio against the differcntiaDy expressed proteins are 
load to mununostain aerial cryostat sections of bmpsics (Im- 
munowalkmg) obsainnd from SOC patients that have under- 
gone removal of the bladder chat to wvaxhre disease (cystec- 
tomy). Since bladder cancer is a field disease (lid) - that is 
targe pan of the bladder tmrngbatrtt of dmb^ 
-ho expected that the urothefium of these patients may 
exhibit a spectrum of abnormalities ranging from metaplasia 
to invasive uxseese* 

•JLJLf. Jfoarr dbemte. Heart failure is among the leading 
causes of mortality in the Western Hemisphere and therefore, 
efforts are being devoted to the elucidation of the enolecular 
events leading to cardiac dysfunction QII7) and fefcrecmcs 
therein). So far, research on dusted cnjclrncayopathy (DCM) 
(117*1191 has revealed that approximatety 100 proteins ore 
deregulated, mostly downrca^datco\ in DCM aa compared to 
their normal counterparts. These i nc lu de cytoskcfctal and my- 
ofibrillar proteins, polypeptides associatod with enitoefcondria 
and mvohed in energy p r od uction , as wdl as interna asso- 
ciated with the stress response. These atactics have been expe- 
dited by the establishment of protcomic 2D PAOE cmubaso 
of the human heart (ventrmto and atrium) |I20) 

In addition to the global analyib of protem expression 
patterns in human heart rirsrasn, cardiac antigen oprcssion 
following cardiac tranrphmtattou has also been studied using 
techniques from proteomka For example, using 2D PAOE 
(cardiac proteins) in comhtntroo with Western bnmunoWot- 
ting (pacicnt sera), tt has been possible to identify antigens 
that featt with autoantioodio present both hi DCM 
(121,122) and m y e c a rdJl is (123). In this way, antigens associ- 
ated with the antibody response that may be involved in acute 
or chronic organ rejnetion have been cbsuaetcrisjxl 

Protctmnc s tud ies using natmal models of smart disease 
have also been carried out in sm-cflbn to unravel the raote> 
ular events lading to exustisc cfecssc. Recently, two cUflerent 
studies mvotvmg targe pacewnduced heart failure in 

the dog (124) and bovine DCM (1251 yudded comlar results ea 
those observed In human DCM. le^ercstiee/y. the most strik- 
ing result in the bovine DCM study was the finding of n 
seven-fold decrease in the caprcssion level of ubtquitin C-ter- 
rninsl hydrolase, as inappropriate ubiquination of proteins 
has been suggested as an Otologic factor in heart failure 1126). 

J.J.J, roxxcomgy. Chango m the envimnment as well as 
the growing interest of the pharmaceutical industry have 
stimulated the development of novel testing approaches based 
cm the recent technical advances both m genomics and pn> 
tccmics. Pioneering studies by Anderson and Anderson (127) 
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haw highlighted the usefulness and potential of the proteornic 
approach lo identify quantitative changre in rat liver expres- 
sion profiles aimriafrd with toxicity of drugs and other xeao* 
biou'c*. The data, which arc bong systematically stored in the 
rodent molecular efforts database, arc expected to yield tm- 
portaat bformatJoo u to the molecular mechanisms under- 
lying toxic responses. Likewise, the potential of protcomna 
have beca recently exemplified is studies of glomerular iieph* 
rotosictty in rat* (128), and of stimulated occupational jet Aid 
exposure in mice tune 1)29). to particular, the studies of Stcn> 
er's group have shown a remarkable correlation between de- 
creased levtb of calbindio D-2B, urinary calcium wasting la 
the urine, and intra tubular corticomedutUuy mtrificn lions in 
the kidney of rats and human treated with cyclosporin A 
l»30J. 

In the future, protcomJcs to b high-throughput mode is 
expected to have a major impact in the prr-diaica) safer/ 
tesu'ng of drugs. These studies will be facilitated by the estate 
lis run est of 2D PAGE databases of frequent target tissues 
(kidney, liver) as well as of cell Uses and fluids. 

J. 3. 4. Neurological disorders. The Crcoufetd-iaeob dis- 
ease (CJD) has been the subject of intensive analysb using 
protMmis. These studies have led to the Identification of 



two members of thr)4-)»3 family of proteins to the cerebral 
spinal a&id (CSF) of CJD pattmti (131J. The presence of these 
proteins in the CSF has been used to differentiate CJD from 
other dementia both with high sensitivity and specificity 
{112,133). These proteins, however, are present in the CSF 
of patients suffering from other neurological disorders not 
revolving dementia, limiting its clinical vaf&e (134,135). 

$.4. Cancbaieni 

. Today, there b oo technology in sight that matches the 
resolvng power of 2D PAGE, a trrhnfcjue that will continue 
to enjoy a centra) position in proieomlc projects for some disc 
in the near furore. There is considerable room for improve- 
ment, bowever, in particular as tar as sample preparation and 
solubility, choice of pH gradient and detection methods are 
concerned. Also, we need to improve the separation or very 
basic as well as very low and high oiolecular weight polypep- 
tides. In general, one expects researchers to first use wide IPG 
gradients to obtain an overview of the protcome profiles, and 
then proceed with a more debited analysis using narrow pH 
gradients, which provide higher resolution and sample load- 
ing, thus increasing the possibility of vouaJoing the lesser 
abundant proteins. The latter can be fectHteted by the avail- 
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ability of specific antibodies, as well as by the use of cxtm- 
lioo procedure* aad subcellular fractionation otttbods cur- 
rently at hand 1 1 36-1)8). 

There arc still maoy additiotul challenges, however, thai 
oust be addressed before a complete Human Frotcomc Proj- 
ect cao be imptemcnted |IJ9J. These include: automation to 
allow high-throughput sample analysis |l(0^ improved quao- 
liuiion capabilities, better inst r u men tation and software for 
peptide seqotDOog using raao spectrometry, more sophisti- 
cated image analysis systems to support gel comparisons 
and daubasiag as well as improved bio informs tic capabilities 
overall (12]. In addition, we need to deal with the problem of 
tissue oeO heterogeneity as more and more protectant projects 
will make osc of biopsy material in the future. 

<t Transcript and protein trrets: DNA e&hnairays ami 
proUunhs applied to the same samples 

As mentioned in Sectioa I, both DNA mlcroarriys and 
protcomics are oonplcsnestaiy technologies. To date* how- 
ever, there have hero only * Gmted number of studies in 
-which both technologies have been compared by. applying 
them to the same sample (141,142). Notably, the pioneer stud- 
ies of Anderson and Seilhamer (52) showed that there is not a 
good correlation between mRNA and protein levels in human 
Over, implying that geoe •based expression data may be of 
limited vatoe m the process of drag discovery. The study, 
which compared the levels of 19 gene products, yielded a 
correlation coefficient of 0.48 between mRNA end protem 
abundance, a vatoe that b half way between perfect and no 
correlation. 

Recently, Oratofl ct al. (maauscript to preparation) carried 
out a oseroarray and prottnniic study of bladder cancer In 
which they compared the transcript aad protein cspiession 
levels of pain or noninvasive and invasive low grade fresh 
TCCs. Even though they oould only oompare the levels of 
•bout 40 wc&» resolved and focused abundant proteins* h 
was clear that fai most cases there was ■ goad correlation 
between traoscripl and protein levels. Only in a few cases 
.they found discrepancies, and la soma of thorn maunon* 
they oould oot eliminate the possibility that this was due to 
rttcssengef stability, post^uanscriptional spleens, port-erantla- 
tional modtneatioas, protein foexumg problems, ckgradauon, 
as well as the choice of methods used to assess protein cm* 
prcssJoo levels (staining versus mdiotsbclling). For example; 
in oac tumor pair they found that the levels of keratin 8 
transcripts were much higher in the invasive tumor (compare 
Fig. 6C and D), while the protein levcb were much lower 
(compare Fig. 6A and B). Inununoblotting analysis using ker- 
atin B specific antibodies revealed that the discrepancy was 
due to degradation, as several related products of lower ap- 
parent molorutsr weights and more acidic p/s, could be visu- 
alized (rig. 6E). The identity of one of the erosrrcactieg 
peptides (indicated with arrows in Fig. 6E) to keratin 8 
was further confirmed by MALDl-ttmc-of-6j^t (TOF) (Fig. 
6F). From these studies it was dear that when comparing 
mRNA and protein levels there are other 'factors that 
need to be taken into consideration when interpreting the 
dsu. 

When comparing transcripts and protein apression profiles 
of matched sample pairs one often gets the expression that 
there ore more changes in the sbundaaoe of the mRNA tran- 



scripts as compared to die proteins. Considering that the cur* 
rest 2D FACE technology depicts mainly the more abundant 
proteins, it would seen possible that most of the dtsagas 
affecting protein levels may involve low ab u ndan ce polypep- 
tides. 

S» Gene expression profiling teehnfnnes: perspectives 

Novel and powerful tathriqoxs are row available to arulyte 
the global gene expression patterns of cultured cells aad tis- 
sues obtained from normal aad diseased subjects. Each of 
these technologies baa its own advantages and ItmJmtioaa, 
but fn combination they should provide us with a detailed 
gene expression phxrotype at both the transcription and 
translation level. A major challenge in the near future will 
be to define a base kmc for the normal gees expression pho- 
ne type of a given ceO type, tissue or body rhxal This ii sot a 
trivial task, however, as It wOl require the analysis of hun- 
dreds or even thousands of samples. 

Besides improvement on Use red! vidua! techniques them- 
selves (see above), there are ctSl major mutations that toast 
be addressed before these technologies can provide me 
expected outcome in molecular medicine, These m erode: 
G) technical problems assodsted with the analysis of expres- 
sion premies derived from tissocs that are composed of diner- 
cm ceil types, (5) lack of woctdum for Identifying tarfets 
that tie in the pathway of disease, and (ifi) need for bmtofor- ' 
matscs tools for rapidly ntmr-t the function of the putative 
targets. The letter, b of paramount importance to the phar- 
foaoeutical mdustry as the ntaemxation of disease deregulated 
targets alone is sot suftnxeat to start a costly drug screening 

Ashler as tissue hctcrogeoeity if concerned, the recent ad- 
vent of LCM horns great promise aa with thb technique H to 
possible to isolate rpeofic populations of ocfn from a tissue 
section using direct microscopic observation (29). However, 
even though the technique has been used for UNA analysis 
it is stiO not ready for enost preteomk projects as the number 
of cells that can be obtained is too small to generate reason- * 
able protein pranks to terms of the number of proteins that 
can be vtsuafixed (143). 

As DNA escfcarraya and pjuteonika generate more data m 
the future it will become a matter of priority to develop simple 
and rapid strategics to vaKdsta the. van amount of informa- 
tion that will be assented, eorticularlr in tissue biopsies. Thb 
we behave can be accomplished b part by esaaing use of 
ipeciBe antibodies m coabmauen wfotamnoMiioihcautai 
(U2). At present, there is no technology at hand that may 
allow us to prepare antibodies at will, aMteug* phage anti- . 
body libraries (14) show much prcsnfse. 

Finally, we would Hkc to empltasise that biology In this 
Mffltonium wiO be characterised by the study of complex 
biological phenomena. DNA microa/iays and protcomics 
are just tome of the technologies of functional grmrraics, 
and only their integration may oOow us to tackle the great 
edmpleaity tmdcrlying biological processes. 
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applications of DNA micromrays include uncovering unsuspected associations between 
genes and specific clinical features of disease that are helping devise novel molecular based 
disease classifications. Most published tumor studies using DKA microBrreys have either 
examined a pathologically homogeneous set of tumors to Identify dimcoDy relevant 
subi>pes, for example survivors vs non-survivors, or pathologically distinct subtypes 
belonging to the seme lineage, for example limited stage vs advanced Stage tumors to 
identify molecular correlates, or rumors of different lineages to identify molecular 
signBrares for each lineage. 

One of the landtnark studies thst have anracted much interest with respect to the 
potential contribution of DNA mJcmarroys to uncover novel classes of tumore, is an 
analysis of diffuse large tVcell lymphoma, the most common subtype of ron-HodfeaJn* 
lymphoma". Large B<eJI lymphoma is a cueicalry teterogeneoua disease. Only 40ft of 
patients have a good response to current therapy with a prolonged survival. A systematic 
charscteriaatioo of gene expression in this disease using DMA c ai cr uaj i uys uncovered a 
diversity in gene expression that reflected variation in tumor proliferation rate, host 
response and c&ffmmtotion state of the rumor. Two itmleeuMy distinct forms of oMuce 
large B-ceJI lymphoma woo uumvutd which had gene txpzmm patterns mdfcative of 
difTereot atages of B-cell differentiation. One type espresso* acnes ctaraeteffstk of 
germinal cento B cells and bad a sJgruftcantry better overall survival than the second type, 
which expressed genes, normally induced during ra vitro activation of peripheral blood D 
calla. The analyeSa oSererom identified prcvioualy undetected and cUidcaSly significant 
subtypes of lyinpuome. 

Studies to ctacrify trreast caawmn based on gene agrpmerhm profiles revealed that lbs 
tumors could be classified (mo a basal eplihciJaMiix group, an X^B2<rvcrcaprcesug 
ejoup and a normal breast-like group* M . A hxsrdna) epilbefial/cstrogen receptV'pcriove 
group could be divided into at least two wbgroupa, each vim a distinctive expression 
profile Survival anil vies on a euboohort of patients with lccaOly'odvanccd breast canoer 
uniformly treated, m a prospective study, showed significantly difrcrent outcomes for 
patients belonging to the various groups. Including a poor prognosis for the oasaMike 
subtype and a significant difTeience m outcome for the two estrogen receptor-positive 
groups. In an ic4epenuem study of 38 invasive breast centers* soiling molectjuv 
mfYcrcnoes between ductal card noma specimens were unoovered that ted to a suggested 
new classification lor eioogen-ieoeptor negative breast cancer 11 . Similarly, a study of 58 
node-oegative breast caiciiusmas disemdant for BR statue also unco v er ed a list of geses 
which discruTunaied tumors cecontlng to ER siatus a . Artificial neural neiworta could 
accurately predict ER status even after excluding top discriminator genes, Including BR 
itself. Only a small proportion of the 100 most mrportam BR discrtmmamr genes arc 
regulated by estradiol in MCF-7 cells. 

An Informative approach to analyse DNA microarray data in clinical smdiea is to divide 
such dam into a training set to uncover associations between specific genes and certain 
clinical features of the disease; and • testing set to validate these associations. However 
sinee both training and testing sets are derived from the same poof of pauenta whose 
samples were available to u^.iirvestigatom the extent to which such -r^^T may 
apply to other patients aoi included In the study, who may have different characteristics* 
cannot be inferred. Beer ct al. have undertaken a study of lung cancer in which the 
association chey observed between a set of genes and patient survival was validated with a 
testing set of tumors they had available and further validated with an independent set of 
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tumors for which micros/ray dais was collected by another group no: associated with thrt 
study 19 . Such cj tensive validation dearly indicated the robustness of the associatica 
uncovered bttweco o set of genes and survivd in lung erierioearcirioine. 

The aumcms published stodies using DNA miroarrays justify the ose of ihb 
technology for uncovering patterns of. gene expression that ore clinically informative. 
However it is substantially aore difficult to develop an uoderstnuring of disease at o 
mcchanisnc level using DNA mrcmnays, For eaott of the published studies it is unclear 
how well UNA levels reported cerretae with protein levels. A leek of comlation may 
Imply that rite predictive property of the gcne(e) is rndefeadesi of gene function. In studies 
of lung cancer, Chen ct el. collected both ONA mlcfoairvy and 2-D PAGE data* which 
allowed them to compare mRNA end protein levels to the same tumore w . The integrated 
intensities of 165 protein sputa representing protein produca of 98 genes were analysed in 
76 lung adeTtoaoctmsnas and 9 uiuuTccted luhg tissues using 2-D gels. For the same ftS 
samples, mRNA levels were determined using oUgonucleotide mic roafroys . Oafy 2 J of 98 
genes C21.4%) had a statistically dtnjftcani orrrclasion between protein and mRNA levels 
(r >0J44S; F 4K05). The inRNA/|potdo correlation coefficients also varied between 
isoforms of the same protein, bleating potentially isofbrm-spcoftc mechanisms for the 
regulation of protein abundance. 
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Despite the advances In our understanding of the molecular basis of 
gaps remain both m our understanding of cancer pathogenesis and In the 
effective strategies for early diagnosis and for ueasmeiiL A protcens 
uvesugatmj diseases such es center may overcome same of uSo tonStatkuu* of 
approaches'. DNA mlcroarrays have limited utility for the analysis of biological fluids and 
for uncovering directly, in the Raid, osseytbte bmmareera. Ncretroui aheraunns may occur 
m proteiot that ore not reflected m changes as (he RNA level. 

Unlike DNA mlesearrays thai provide one measure of gene expression, namely RNA 
levels, there is a need to implement protein oricroarray strategies that address the tunny 
different feeiures of proteins including determination of their levels in blologsed samples, 
and efeenrdaseJon of their selective mteractiona with otherwatelna. antibodies, drugs or 
verioos small ligandt. Arrays that incorporate s*tiboeues JMt or rccomttnam protaira 
obtained using cDNA caprcssion librartes^ 41 or phage-dlspley libraries** have been 
utilized for different types of protdo based assays. With other types of imcsosrreya, whole 
tissue-derived samples have been directly arrayed onto slides, to asscaa the reactivity of 
total protein lysates with specific Kgand 44-15 . Two practical applications of protein 
mtcroaurays were presented by Kodadee* nVaignatrd protein function array end protehv 
detecting arrsy. With protein function arrays, o large amount of protein is spotted on a solid 
fuppon at a defined location and tested to charoeterise either a biochemical activity or a 
molecular Interaction. The protdn*deuxting array consists or an arrayed set of protein 
Uganda used to profile gene expression and draw denatures rmficsuvc of the cellular state. 
The whole process of assembling the protdo array requires cortodemtions rdated to the 
nature of the support, the type of iinmobUisaiion, as well as die moJecular architecture of 
the particle being attested, four mdn different supports have been optimised to perform 
assays of thfs magnitude; chemically modified glass slides (potyrUlyancs. poryiMehides, 
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these samples. Patterns that distinguish between cancer patients and oormaJ subjects with 
rtmirtnble accuracy, have been reported for several types of cancer 1 . The coupling of 
protein amy* with mass spectrometry tcchnolosjcj is likely to become a powerful analytic 
tool with which to profile protein expression. Svch ah approach, known as frotsroChte 
(Gphergen Inc. USA), was successfully applied to snm> prostate asd ovarian cancers* 
end, more recently, bead and neck cancan*. Result) from these studies revealed die 
involvement of proteins in carcinogenesis processes and specific ally identified protean 
fingerprints from which cancer biomarltcrs were e* crated Die major drawbacks of direct 
analysis of tissues or biological fluids by MAU>1 if ihs preferential detection of proteins 
with a lower molecular mass and the difficulty m Identifying- the pvocstne corresponding to 
the masses observed. Anther mehnologica) inarroveoiertts coord enhance the otiHty of 
direct mass sueeeminetriii analysts of tf stuff and bioJcsncal fluids* 




Tltor* t. OxpruccfcOvc ctponntsufar opdmhotlcBaf POA etE«Dtactto&> (Rcpmtoed tan Minis ft at.. 
2002). 

Cancer tissue profiling studies that have utioted protein tmorsajreys are tegmatag to 
emerge: As • model to better understand how patterns of protein expression shape the tissue 
rmcroenvironmeot, iXhescvtc et a), analysed protein expression in tissue derived from 
squamous ceil carcinomas of the oral cavity through an antibody nncrearray approach for 
higMhroosjiput proteomic anoJysis* 9 . Udlfadag laser capture mteradeuectlon to proeore 
total . protein from specific nucroscopk cellular populations, they demon streied that 
ooantltativcv end potentially qualitative, differences in c gqncaaio n psBcras of mntUpro 
proteins within epithelial cells leproduciWy correlated with oral cavity tumor proprcssion. 
OifferendsJ expression of multiple proteins was found m toomal cells surrounding ami 
adjacent to regions of disused epithelium that c&ectly correlated with tumor progression of 
the epithelium. Most of the proteins identified m both cell types were involved in signal 
transduction pathways. They hypothesised therefore that 
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commWicaiiora involving complex cellular signaling between epithelium and stroma prey 
a key rmiAdfertflng oral cavity cancer progression. 

A clinically relevant application or protein micreaniyi is the identification of proteins 
thai induce an antibody response in autoimmune disorders 10 . Mieroamys were produced 
by attaching several honored proteins and peptides to the surface of derivatized gloss sfides. 
Arrays were iocubaxed with patient serum, end fluorescent labrts wen; used to detect 
autoantibody binding to specific proteins in sutohamuKc diseases, including systemic lupus 
cryihernatosm and rheumatoid arthritis. Such w dctoanay s represent a powerful toot co 
study immune responses».iD a variety of diseases including cancer. 

A reverse phase protein array approach thai tramoMUscs the whole repertoire of a 
tissue's proteins has been tievdoped*. A high degree of seasiuviry, precision and b'nearir/ 
was schjeved making it possible w quantify the plmspherytated status of signal proteins in 
human tissue eeD siibpopolations. Using this approach Fsweleoret al *have Ic^cudraaJly 
' analyzed the state of pro»sarvival checkpoint proteins at the ebhmueople transition stage 
from patient matched histologically normal prostate epithelium to prostate innwptlhcHsJ 
oeophuia end to invasive prostate cancer. Cancer progression was associated win Increased 
phosphorylation of Akl, suppression of apoptosis pathway* as wed as decreased 
pruuphoryjatum of ERJC At the transition from histologically fiorrna) epfthefiom to 
Itttraepithettal neoplasia, a statistically significant surge to phosphoiylattd All ami a 
concomitant suppression of downs&cam apoptosis pathways preceding the tasitfon farto 
invasive carciooma were observed. 

A major challenge In snaking brochfps for global analysis of proteia expression is the 
current tack of comprehensive acts of genome scale capture agents such as antibodies. Aa a 
result, blochips diet target specific classes of proteins such as ttinascs or cytokines era much 
easier to produce, that would have clinical utility. Another isnportant consideration to 
protein oiicroarrays is thai proteins undergo numerous poat-transiaiianej motffteetions eg 
phosphorylations, grycosylatfons, vmcb ore highly important so their functions, so they can 
determine activity, stability, localisation and turnover. To address the need for 
coToprehensivt analysts of proteins in their modified forma, several approaches to lha liquid 
based separation of cell and tissue rysates were investigated in order to obtain protein 
fractions with •educed complexity or pore Individual proteins 91 . The separation piji d uus 
can be arrayed in a manner thai allows the probing of protein constituents of colli and 
tissues to uncover specific targets. IV example, using a combination of anion exchange 
and reverse phase LC Mtdoa-Ourpidc ct at have obtained soma 2000 tnaMdusJ protein 
fractions that have been utilised to produce microsmya that Interrogate cancer oell 
proiecmes. Fractions thsl react with specific probes are within the reach of 
chromatographic and gel based separation techniques for resolving their rndjvidua) protein 
constituents aad of mass speetrancmc techniques for identification of their constituent 
proteins. The LC procedures allow sufficient protein amounts to be resolved for the 
construction of large numbers of mtcmariays from a given cell or tissue source. 

Protein microarraya of different types are Hkely to become commercially ovailable for 
assays of broad sets of proteins and may well rival or as least complement DWa 
microsmya aa tools far global expression analysis. 
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SUMMARY 



In conclusion, array, based technologies have emerged thai ccnoibutc to p/ofilini 
tissues at die genomic, oansoiptcmic and proteorrttc torch. Analytical toob are needed to 
mine the vast amount of data generaledt UUrmately the molecular analysis of cancer at s 
genome and proteomc scole will allow better classifscatfon of disease and tailored 
individuate^ therapy for individual patients, 
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Amplification and overexpression of putative oncogenes 
confer growth advantages for tumor development. We 
used a functional genomic approach thai integrated 
simultaneous genomic and transcript microarray, proteo- 
mics, and tissue microarray analyses to directly identify 
putative oncogenes in lung adenocarcinoma. We first 
identified 183 genes with increases in both genomic copy 
number and transcript in six lung adenocarcinoma cell 
lines. Next, we used two-dimensional polyacrylamide gel 
electrophoresis and mass spectrometry to identify 42 
proteins that were overexpressed in the cancer cells 
relative to normal cells. Comparing the 183 genes with 
the 42 proteins, we identified four genes - PRDXi, 
EEFIA2, CALR, and KCIP-l - in which elevated protein 
expression correlated with both increased DNA copy 
number and increased transcript levels (all r>0.84, two- 
sided / > <0.05). These findings were validated by South- 
ern, Northern, and Western blotting. Specific inhibition of 
EEFIA2 and KCIP-l expression with siRNA in the four 
cell lines tested suppressed proliferation and induced 
apoptosis. Parallel fluorescence in situ hybridization and 
immunohistochemical analyses of EEF1A2 and KCIP-l in 
tissue microarrays from patients with lung adenocarcinoma 
showed that gene amplification was associated with high 
Protein expression for both g i>nPR smH fhqt protein 



Introduction 

In lung adenocarcinoma, as in other types of cancer, 
gene amplification and the consequent overexpression of 
the amplified oncogene play an important role in the 
development of tumors, because their overexpression 
confers a growth advantage. The ability to identify 
putative oncogenes that are activated during tumorigen- 
csis could facilitate the choice of molecular genetic 
targets for diagnosis and therapy of the disease. This 
concept has been exemplified by HER-2, which was first 
found to be amplified in neuroblastomas and subse- 
quently shown to be associated with poor prognosis in 
breast cancer (Ross and Fletcher, 1999). Now, HER-2 
aberrations are used as a predictor of response to 
therapy, and treatment of HER-2-positive breast cancer 
with the monoclonal anti-HER-2 antibody trastuzumab 
has been shown to improve prognosis (Ross and 
Fletcher, 1999). Emerging evidence of common ampli- 
cons in lung adenocarcinomas (Luk et al., 2001; Jiang 
et al. y 2004; Tonon et al. % 2005) suggests that additional 
oncogenes remain to be identified; however, conven- 
tional techniques are ineffective in pinpointing such 
oncogenes. Parallel measurement of DNA copy number 
a n d mRNA — levels — i n cDNA microarrays perm i ts 
changes in copy number to be compared with transcrip- 
tion levels on a gene-by-gene basis to generate lists of 
candidate genes within the defining amplicons (Hyman 
et al. y 2002; Pollack el a/., 2002). However, use of 
transcript patterns does not allow assessment of the 
expression of protein products or identification of proto- 
oncogenes. Another approach, identifying differentially 
expressed proteins by proteomic analysis and then 
comparing the proteins present with mRNA expression 
in cDNA microarrays from the same specimens, can 
clarify the extent to which changes in transcript patterns 
reflect changes in their cognate, proteins and post- 
transcriptional mechanisms (Chen et aL, 2002), but this 
approach cannot be used to identify oncogenes driven 
by extensive increases of their gene copy number. 
Moreover, using individual microarrays or proteomic 
approaches alone cannot distinguish the cancer-driving 
oncogenes that directly propel tumor progression from 
the larger number of passenger genes that may be 
concurrently over-represented but are not biologically 
relevant in tumor development. 



overexpression was related to tumor grade, disease stage, 
Ki-67 expression, and a shorter survival of patients. The 
amplification of EEF1A2 and KCIP-l and the presence of 
overexpressed protein in tumor samples strongly suggest 
that these genes could be oncogenes and hence potential 
targets for diagnosis and therapy in lung adenocarcinoma. 
Oncogene (2006) 25, 2628-2635. doi: 1 0. 1 038/sj.onc. 1 209289; 
published online 12 December 2005 
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In this study, we used a comprehensive approach that 
integrated simultaneous comparative genomic hybridi- 
zation (CGH) and transcript microarray with proteomic 
analyses of six lung adenocarcinoma cell lines. We 
directly and specifically identified four putative onco- 
genes thai could have been activated through amplifica- 
tion and consequent elevation of transcript expression. 
We used small interfering RNA (siRNA) to inhibit the 
expression of two of these four genes in the lung cancer 
cell lines, which further implicated them in oncogenesis. 
We then explored the clinical significance of these 
findings by assessing the expression of these two genes 
in tissue microarrays of human lung cancer specimens. 
Our findings underscore the power of integrated 
functional genomic analyses for identifying putative 
oncogenes in lumorigenesis; such activated genes could 
be useful as targets for diagnosis or therapy in lung 
cancer. 



Results 

Simultaneous global genomic and transcript analyses 
identify 183 genes with increases in genomic copy 
numbers and transcript expression levels 
To identify genes in which increased DNA copy number 
might contribute to increased transcript in lung adeno- 
carcinomas, first we used CGH with microarrays of six 
lung adenocarcinoma cell lines. We identified 587 genes 
showing increases in DNA copy number across all six 
cell lines (Supplementary Table IS), which were 
distributed as 90 amplicons on all chromosomes except 
for chromosomes 13 and Y (Supplementary Table 2S). 
A subsequent transcript test with the identical arrays of 
the same cell lines revealed 275 genes that showed 
increased mRNA levels (Supplementary Table 3S). 
Using random permutation tests across all cancer cell 
linos, we identified 183 genes (31%) that showed 
elevated transcript levels from the 587 genes that were 
over - represented i n th e genom e (Tabl e I ), suggesting 



of their cognate proteins. To extend these findings 
beyond genomic over-representation to expression of 
the protein products of those genes, we next assessed 
protein expression in the same cell lines by two- 
dimensional polyacrylamide gel electrophoresis (PAGE) 
and found that 42 different proteins, representing 42 
individual genes, were significantly increased in the 
cancer cell lines (Table 2; Supplementary Figures IS and 
2S). Some of these proteins were identified as having 
multiple isoforms, and all individual isoforms exhibited 
increases in expression ranging from 4.6 to 12.8 times 
their expression in normal lung tissue cells. In compar- 
ing protein level of the 42 genes with changes in their 
cognate genomic and mRNA expression from the global 
microarray analyses, we found that four (9.5%) of those 
42 genes - PRDX1, EEFIA2. CALR, and KCIP-I - 
showed statistically significant correlations between 
elevated protein expression and increases in both copy 
number and mRNA expression (all r>0.84; / > <0.05) 
(Table 2) in the cancer cell lines. These findings imply 
that the abundance of these four proteins is attributable 
to the amplification and consequent elevated transcrip- 
tion of their cognate genes. 



Validation of copy number, transcript, and protein 
expression ofPRDXl, EEF1A2, CALR, and KCIP-1 
in lung cancer cell lines 

To confirm our findings from the high-throughput 
analyses, we next used Southern, Northern, and Western 
blotting to assess DNA, RNA, and protein levels for the 
four genes identified in the six cell lines. For compar- 
ison, we arbitrarily chose one gene, NFKBI^m which an 
increase in protein level did not correlate with genetic 
changes. Overall, we found excellent concordance 
between the CGH microarray and Southern blotting 
analyses, transcript array and Northern blotting ana- 
lyses, and proteomic and Western blotting analyses for 
all five genes (Figure I). For example, KCIP-1 showed 
fivefold amplification in five of the six cancer cell lines. 
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that elevated transcript levels of the 183 genes may 
reflect their genomic over-representation in the cancer 
cells. These findings are consistent with previous reports 
linking genomic changes with altered transcript patterns 
in breast cancer (Hyman et aL, 2002; Pollack et a/., 
2002). However, our finding that only 31% of the genes 
showing increased DNA copy numbers had cognate 
increases in transcript expression in lung adenocarcino- 
mas is different from the overall rates of 40-60% 
reported for breast cancer (Hyman et aU 2002; Pollack 
et <//., 2002). This discordance may reflect methodologic 
differences between studies or biological differences 
between breast cancer and lung adenocarcinoma. 

Proteomic analyses identify four genes for which protein 
abundance was associated with increases in the cognate 
gene and transcript levels 

Analysis of transcript patterns is insufficient for under- 
standing the expression of protein products and the 

effect of genomic o\er-:epresenumon on the expression 



whereas NFKBI showed no such increase in any of the 
cell lines. As for transcript expression, Northern blotting 
of EEF1A2 showed high expression in five of the six 
cancer cell lines; again, levels of NFKBI transcript were 
not increased in any cancer cell line as compared with 
normal bronchial epithelial cells. The results of Western 
blotting were also consistent with the results of the 
proteomic experiments; for example, five of the cancer 
cell lines exhibited strong protein bands for PRDXI as 
compared with normal cells. These findings provide 
strong support for the validity of the results derived 
from the high-throughput techniques in this study. 

These parallel analyses also revealed close correla- 
tions in the extent of changes in gene copies, transcript, 
and protein of each of the four genes in the cancer cell 
lines. For example, in the five cancer cell lines that 
showed at least fourfold increases in EEFJA2 copy 
number, expression of transcript and protein was also 
increased by at least a factor of four as well (relative to 
their expression in normal cells) (Supplementary Figure 
3S). The protein abundance of the four genes showing 
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(Supplementary Figure 4S). In contrast, EEF1A2 and 
KCIP-I protein levels remained unchanged in mock- 
treated control cells and in cells transfected with a 
scrambled siRNA sequence. At 48 h after siRNA 
transfection, the percentage of proliferation of the 
transfected cancer cells was reduced to 15-30% as 
compared with 91-100% of cell proliferation of the 
same cell lines treated with PBS or scrambled siRNA 
(Supplementary Figure 5S). Apoptosis of siRNA- 
transfected cells was 27-34%, whereas only 4% of the 
same cell lines treated with PBS or scrambled siRNA 
showed apoptosis. These results strongly support an 
oncogenic role for the identified genes in lung cancer and 
confirm their potential usefulness as therapeutic targets 
for the disease. 



Amplification and protein expression of KCIP-I and 
EEF1A2 in lung tissue 

To further validate these findings and to assess the 
possible clinical significance of the four potential 
putative oncogenes identified from the cell lines, we first 
applied fluorescence in situ hybridization and immuno- 
histochemical analysis, in parallel to commercially 
available human lung tissue microarrays (Ambion, 
Austin, TX, USA) to evaluate the status of two of these 
four genes in lung cancer tissue specimens. (Commer- 
cially available antibodies to PRDX1 or CALR were 
not suitable for use in immunohistochemicai analysis 
when this report was written.) Overexpression of KCIP- 
1 and EEFIA2 protein in the tumors was concordant 
with amplification of the corresponding genes 
(/ > = 0.0003 for KC1P-1 and P = 0.001 J for EEFIA2). 
For example, 16 (35%) of the 46 lung adenocarcinomas 
in the microarray showed amplification of KCIP-L and 
strong cytoplasmic staining for 1CC1P-J protein was seen 
in 18 tumors (39%) (Figure 2). We next examined 
whether overexpression of these genes was associated 
with increased cell proliferation by analysing Ki-67 
e xpre ss ion in contiguou s se ctions of the tissue micro - 
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corresponding increases in both DNA copy number and 
mRNA provides further evidence that these could be 
oncogenes, the activation of which is reflected by 
genomic amplification and consequent increases in 
transcript level in lung adenocarcinoma cell lines. 

Specific inhibition of EEF1A2 and KCIP-I expression by 
siRNAs led to decreased cell proliferation and induction of 
apoptosis 

To further prove the oncogenic function of the identified 
genes in iung tumorigenesis, we used siRNAs to inhibit 
the endogenous expression of EEFIA2 and KCJP-1 
protein in four lung cancer cell lines (HI 563, H229, 
H522, and SK-LU). Transfection of the cancer cells with 
specific siRNAs reduced the level of EEFIA2 and 
KCIP-I protein by 70 90% 48 li after transfection 



arrays. Positive Ki-67 expression was found to correlate 
with positive expression of both KCIP-I (/ > = 0.02) and 
EEFIA2 (P = 0.0l). To extend these findings, we then 
studied 1 1 tissue microarray blocks comprising normal 
and tumor tissue specimens from 113 patients with 
pathologic stage I non-small-cell lung cancer who had 
undergone curative surgery (Wang et aL 9 2005).. 
Immunohistochemicai analysis showed that EEF1A2 
was expressed in 32 cases (28%) and KCIP-I in 29 cases 
(26%). Univariate and multivariate Cox proportional 
hazards models were used to detect possible associations 
between EEFIA2 and KCIP-I expression and clinico- 
pathologic variables. Expression of EEFI A2 or KCIP-I 
was associated with short overall survival time 
(/> = 0.OOI2 for EEF1A2 and P = 0.0026 for KCIP-I) 
(Supplementary Figure 6S). Age at diagnosis, histologic 
type of cancer, degree of tumor differentiation, and 
smoking history were not associated with survival time. 

Although only two genes were validated in the lung 
tissue microarrays (because available antibodies to 
the other two genes were not suitable for use in 
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Tabic 2 Proteins showing significant ovcrcxpression in cancer cell lines relative to those in normal bronchial epithelial cell lines and their 
correlation coefficients with increas ed DNA copy number or mRNA values" 
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Fatty acid-binding protein 5 



"Only the gene showing statistically significant increased protein expression with increases in both genomic copy number and transcript 
simultaneously will be considered as potential putative oncogene in lung adenocarcinoma cells. V, Spearman correlation coefficients between 
proteins and genomic or mRNA values are based on all six cancer cell lines; bold indicates /><0.05, if r> 0.84000. Mw, molecular weight; p/, 
isoelectric point. 



immunohistochemical analysis), these findings are con- 
sistent with those from our cell lines, demonstrating 
again thaL genomic amplification and consequent 
increases in amounts of transcript may be, at least in 
part, driving the abundance of proteins in these lung 
tumors. The association between expression of these 
genes and that of Ki-67. a known indicator of poor 
prognosis in lung cancer (Martin et aL, 2004), suggests 
that activation of these genes may be an indicator of 
tumor aggressiveness. These results also suggest that 
expression of EEFIA2 and KC1P-1 proteins in stage I 
non-small-cell lung cancer may be useful as a marker for 
distinguishing patients with relatively poor prognosis 
from those who might benefit from adjuvant treatment. 



Discussion 

Our current study illustrates the power of integrated 
functional genomic analyses for identifying putative 
oncogenes and for evaluating their potential clinical 
significance. Among the four identified oncogenes, three 
genes {PRDXh CALR, and KC1P-1) have been im- 
plicated in lung tumorigenesis. PRDXI is an antioxidant 
protein involved in regulating cell proliferation, differ- 
entiation, and apoptosis. Kirn et al. (2003) found 
PRDXI expression to be elevated in both lung cancer 
and adjacent normal lung tissue, suggesting that 
activation of PRDXI may enhance proliferation in lung 
cancer. CALR has a major role in Ca 2+ binding and the 
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Figure I Confirmation by Southern, Northern, and Western blot 
analyses of increased DNA copies, transcript levels, and protein 
levels in the four genes identified in high-throughput analyses. For 
comparison, wc arbitrarily chose one gene, NFKBt, in which an 
increased protein level did not correlate with genetic changes. The 
Moiling results arc consistent with the results from ihcCGH array, 
transcript array, and protcomic analyses. Nor, indicates normal 
bronchial epithelial cell line. All the experiments were repeated at 
least three limes with each cell line. Means of normalized to /?-uctin 
signal intensities on Southern, Northern, and Western blots, along 
with 95% confidence intervals, were calculated (0-actin signals are 
not shown in ihc figure; two different normal bronchial epithelial 
cell lines were used in the confirmation and only one normal cell 
line is shown in the figure). 



noted that 90.7% of the genes showing high protein 
expression did not show corresponding increases 
in both DNA copy number and transcript, a finding 
consistent with that of others that transcriptional, 
translational, and post-translational regulatory mecha- 
nisms can greatly influence the abundance of protein 
in lung tumorigenesis (Chen et aL, 2002). For example, 
NFKB1 is a critical arbiter of immune responses, 
cell survival, and transformation and is often activated 
in several types of tumors (Chen et aL 2002). De- 
regulation of NFKBI is thought to be modulated 
through phosphorylation of Ser337 by protein kinase 
A (Chen et aL, 2002). Tn our study, 68.8% of the 
genes showing over-representation in the genome 
did not show elevated transcript levels, implying 
that at least some of these genes are 'passenger' genes 
that are concurrently amplified because of their 
location with respect to amplicons but lack bio- 
logical relevance in terms of the development of lung 
adenocarcinoma. 

Although the potential oncogenes we identified here 
are likely to be important, certainly other oncogenes 
could be involved in the development of lung adeno- 
carcinoma. The oligo microarray we used consists of 
22000 probes, which represent only about 60% of the 
human genome. Moreover, each probe was designed for 
the 3' region of expressed sequence tags of the selected 
genes. Also, our results were initially derived from 
cancer cell lines, although the findings were later 
confirmed in human tissue samples. Our ongoing study 
using microarrays with information on more genes 
and the development of high-resolution proteomic 
analyses for use with larger numbers of specimens will 
allow more comprehensive analyses of the molecular 
consequences of gene amplifications. Such expanded 
analyses will very likely lead to the identification of 
additional oncogenes. 

Some of the results of our current study were 
comparable to those of other studies of lung cancer. 
For fixnmp l e , genomic c o py number a nd protein leve l s 
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transcriptional regulation of other genes and was 
-recently found to be overexpressed in 73% of 40 lung 
adenocarcinomas (Oates and Edwards, 2000). KCIP-1 
belongs to the 14-3-3 family, which participates via the 
M APK and Wnt signaling pathways in the regulation of 
many cellular processes including cell proliferation and 
differentiation as well as tumorigenesis (Thomas et aL, 
2005). KCIP-I was recently found to be expressed in all 
12 iung tumors tested in a single-institution study (Qi 
et al. % 2005). Interestingly, EEFJA2 was originally 
considered a putative oncogene in ovarian cancer on 
the basis of its being amplified in 25% and over- 
expressed in 30% of the same set of ovarian tumors 
(Anand et aL, 2002); functional analyses have estab- 
lished its oncogenic role in cellular transformation (Lee, 
2003). Our discovery that EEFIA2 may be a putative 
oncogene in lung adenocarcinoma demonstrates the 
power of our functional genomic strategy for rapidly 
identifying potential oncogenes. 

Although the main focus of this study was to 
s|x*u!ically identify putarive oncogenes, it she u id be 



of KCTP-I were previously found to be amplified and 
overexpressed in primary lung cancers by cDNA clone- 
based CGH array analysis (Jiang et aL, 2004) and 
proteomic analysis (Chen et aL, 2002), respectively. Our 
functional genomic approach, which integrates simulta- 
neous CGH, transcript microarrys, proteomic analyses, 
and siRNA, allows us not only to quickly identify 
potential oncogenes but also to explore their significance 
as diagnostic and therapeutic targets in tumor progres- 
sion - more than could be achieved by any technique 
alone. 

Genes identified in this way may serve as promising 
targets for diagnosis and therapy in lung adenocarci- 
noma. Further research on the clinical implications of 
such genes is needed; experiments now underway in our 
laboratory include overexpression or the genes in 
normal cells, disruption of the function of these genes 
in cancer cells, and investigation of how interactions 
among these genes (or interactions with other known 
oncogenes) may mediate the expression of the trans- 
formed phenotype. 
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Figure 2 f /TFM J» amphficaUon is associated with high EEFIA2 protein expression in lung adenocarcinomas, (a) Cells from a lung 
adenoc^ranonia sample in which EEFlA2 \s amplified show more green signals (EEFIA2) than red signals (chromosome 20 
Lcn tromenc probe) i [original magnification, x 400). (b) Fmmunohistochemicai staining of cells from the same tissue sample as in panel 
a shows strong EEF I A2 sta.nmg in the cytoplasm, (c) A lung adenocarcinoma sample with two copies of EEFIA2 and chromosome 20 
T" Pf ? ,nd,cal ' n 8 no EEFIA2 amplification (original magnification, x 400). (d) Immunohistochcmical staining of cells 

from the same tissue sample as in panel c shows negative staining for EEFIA2. • 



Materials and methods 

Cell lines 

Six human lung adenocarcinoma cell lines (H23, H229, H1792, 
SK-LU-I. H522. and HI563) were obtained from the 
American Type Culture Collection (Manassas, VA, USA). 
Two norma l bronch i al ep i thel i a l cel l l i nes we r e obtained fr o m 



Clontech (Palo Alto, CA, USA). Genomic DNA, mRNA, and 
protein were derived from a single harvest of these cells. 

DNA and RNA profiles by microarray analysis 
Genomic DNA labeling and hybridization were performed as 
described previously (Barrett el «/.. 2004) with Agilent's 
Human 1A Oligo Microarray (V2) (Agilent Technologies, 
Palo A|lo. CA, USA), which, contains 22000 unique 60-mer 
oligos. Details of the protocol for analysing transcripts are 
available at http://www.cheni.agilem.com. Map positions for 
arrayed genes were assigned by identifying the DNA sequence 
represented in the UniGene cluster and matching it with the 
Golden Path genome assembly (http://genome.ucsc.edu/; Mat 
7, 2004 Freeze). Microarray images of DNA copy number and 
expression were analysed by using AgilcntCGH Analytics and 
Feature Extraction software. DNA copy number profiles that 
deviated significantly from background signal ratios (measured 
from nonnal control cell hybridization, as described elsewhere; 
Barrett et at.. 2004) were interpreted as evidence of true 
differences in DNA copy number. The criteria for defining 
genomic ovcr-rcprcscnialion and amplicons are described 
elsewhere (flyman rf a/.. 2002): details are iziven in the . 



Supplementary Information. An increase in mRNA level was 
defined as a twofold increase in signal ratio relative to that of 
the control (log 2 > I). 

Quantitative two-dimensional PAGE and mass spectrometry 
Analysis of proteins by two-dimensional PAGE and their 
idenUhcation by mass spectrometry were performed as" 
previously described (Shen et ai % 2004). Briefly, protein pellets 
were solubilized in rehydration buffer, after which the first- 
dimension isoelectric focusing was carried out with a Protean 
IEF Cell (Bio-Rad Laboratories) and the second-dimension 
separation was carried out with Bio-Rad's Ready Gel Precast 
Gels and the Bio-Rad Criterion Cell apparatus. Protein spots 
were visualized by silver-based staining, and all gels were 
assessed with Bio-Rad's PDQuest 2D gel image analysis 
software. Selected spots were subjected to in-gel tryptic 
digestion and analysed on a Voyager-DE PRO matrix-assisted 
laser desorptipn ionization/time-of-flight mass spectrometer 
(Applied Biosystems, Foster City, CA, USA). The mass list of 
the 20 most intense raonoisotopic peaks for each sample was 
entered in the MS-Fit search program (v3.2,l) (http:// 
prosr^tor.ucsf.edu/ucsflitmI4.0/msfit.htm) and searched in 
the National Center for Biotechnology Information protein 
database. 

Southern. Northern, and Western blot analyses 
Southern, Northern, and Western blot hybridizations were 
performed according to standard protocols. cDNA clones for 
the tested genes were purchased from Invitrogen (Carlsbad, 
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CA. USA) and prepared as probes for the blot hybridizations. 
Antibodies used were obtained as follows: PRDXI, CALR, 
NFKBI, KCIP-L and /J-actin from Santa Cruz Biotechnology 
(Satua Cruz, CA. USA); and EEFIA2 from Upstate Biotech- 
nology (Waltham. MA. USA). 

Fluorescence in situ hybridization and imnnmohistochcmical 
analyses of limy tissue microarrays 

Fluorescence in situ hybridizations and immunohistochemical 
analyses of KCIP-I and EEFIA2 were carried out as described 
elsewhere (Jiang et aL 2002; Wang et aL, 2005) with Lung 
Tissue Microarrays (Amnion, Austin, "TX, USA) and II 
homemade microarray blocks containing tissue samples from 
1 1 3 patients with pathologic stage I non-small-ccll lung cancer 
(Wang et at., 2005). DNA probes specific for KCIP-I and 
EEFIA2 wore obtained by screening a Human BAC Clone 
library (Invitrogcn) by polymerase chain reaction as described 
previously (Jiang ct aL. 2002). The antibodies used for the 
immunohislochcmical analyses were the same as those used 
for the Western blotting. Cell proliferation of the lung tissues 
was assessed with a Ki-67 monoclonal antibody from Santa 
Cruz Biotechnology. Definitions of the cutoff value for a 
positive result of each antibody are shown in Supplementary 
Information. 

siRNA transjection, cellular proliferation assay, and apoptosis 
analysis 

Translations were carried out by using siPORT Lipid 
Transfeclion Agent (Ambion) with siRNAs targeting KCIP-I 
or EEFIA2 or with a scrambled siRNA duplex (siControl) 
(Dharmacon Inc., Lafayette, CO. USA), with PBS used as a 
negative control (Jiang et aL, 2002). Cells were fixed 24, 48, or 
96 h later and subjected to further tests. All siRNAs were 
prepared by using a transcription-based method with Silencer 
siRNA according to the manufacturer's instructions (Am- 
bion). Sequences of the individual siRNAs are listed in 
Supplementary Table 4S. Inhibition of cell growth by the 
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siRNAs was determined by MTT staining, and cell growth 
rate was plotted against the percentage of viable cells in the 
saline-treated controls (a value arbitrarily set at 100%) (Jiang 
et aL, 2002). Apoptosis was analysed by fluorescence cell 
cycle analysis of terminal deoxynucleotidyl transferase- 
mediated dUTP nick-end labeling with FITC-labeled dUTP 
(Boehringer Mannheim Biochemicals, Mannheim, Germany) 
(Jiang et aL. 2005). 

Statistical analyses 

Relationships between gene copy number and mRNA level 
were examined as described elsewhere (Hyman et aL. 2002, 
Supplementary Information). Correlations between protein 
abundance and DNA copy number and mRNA expression of 
the corresponding genes, were evaluated with the Spearman 
correlation coefficient. Fisher s exact test and £ 2 -tests were 
used to analyse associations between amplification and 
expression of the candidate genes with various histopathologic 
variables of the samples in the tissue microarrays. Univariate 
and multivariate analyses were carried out with Cox's 
proportional hazards model to determine which independent 
factors might have a joint significant influence on survival. A 
P-value s£0.05 was considered statistically significant; all 
statistical tests were based on a two-sided significance level. 

Acknowledgements 

This work was supported by National Institutes of Health 
Grant CA If 3707-01, an institutional research grant from The 
University of Texas MD Anderson Cancer Center, a Devel- 
opmental Project/Career Development Award from The 
University of Texas Specialized Programs of Research 
Excellence in Lung Cancer P50 CA 70907, and an M Keck 
Center for Cancer Gene Therapy Award (all to FJ). We thank 
Christine F Wogan of the Department of Scientific Publica- 
tions for editorial review of this manuscript. 



2635 



Anand N, Murthy S, Amann G, Wemick M, Porter LA, 
Cukier IH et aL (2002). Nat Genet 31: 301-305. 

Barrett MT, SchefTcr A, Ben-Dor A, Sampas N, Lipson D, 
Kmcaid R et aL (2004). Proc Natl Acad Sci USA 101: 
17765-17770. 

Chen G. Gharib TG, Huang CC, Taylor JM, Misek DE, 
Kardta SL et aL (2002). Mol Cell Proteomics 1: 304-313. 

Hyman E, Kauraniemi P, Hautaniemi S, Wolf M, Mousses S, 
Rozenbium E et aL (2002). Cancer Res 62: 6240-6245. 

Jiang F, Caraway NP, Li RY, Katz RL. (2005). Oncogene 24: 
3409-3418. 

Jiang F ( Lin F, Price R, Gu J, Medeiros U, Zhang HZ et aL 

(2002). J Mo! Diagn 4: 144-149. 
Jiang F, Yin Z, Caraway NP, Li R, Katz RL. (2004). Neoplasia 

(New York) 6: 623-635. 
Kim HJ, Chac HZ, Kim YJ, Kim YH, Hwangs TS, Park EM 

ct aL (2003). Cell Biol Toxicol 19: 285-298. 
Lee JM. (2003). Rcprod Biol Endocrinol I: 69-73. 



Luk C, Tsao MS, Bayani J, Shepherd F, Squire J A. (2001). 

Cancer Genet Cytogenet 125: 87-99. 
Martin B. Paesmans M. Mascaux C. Berghmnns T rnihw P 



Meert AP et at. (2004). Br J Cancer 91: 2018-2225. 
Oates J, Edwards C. (2000). His to pathology 36: 341-347. 
Pollack JR, Soriie T, Perou CM, Rees CA, Jeffrey SS, Lonning 
PE et aL (2002). Proc Natl Acad Sci USA 99: 1 2963- 1 2968. 
Qi W, Liu X, Qiao D, Martinez JD. (2005). Int J Cancer 113: 
359-363; 

Ross JS, Fletcher JA. (1999). Semin Cancer Biol 9: 125-138. 
Shen J, Person MD, Zhu J, Abbruzzese JL, Li D. (2004). 

Cancer Res 64: 90 1 8-9026. 
Thomas D, Guth ridge M, Woodcock J, Lopez A. (2005). Curr 

Top Dev Biol 67: 285-303. 
Tonon G, Wong KK, Maulik G, Brcnnan C, Feng B, Zhang Y 

et aL (2005). Proc Natl Acad Sci USA 102: 9625-9630. 
Wang H, Zhang Z, Li R, Ang KK, Zhang H, Caraway NP 

et aL (2005). Int J Cancer 1 16: 285-290. 



Supplemenlary Information accompanies the paper on Oncogene website (http://www.nature.com/onc) 



Oncogene 



